DE eng

Search in the Catalogues and Directories

Page: 1 2 3 4 5
Hits 1 – 20 of 83

1
Benchmarking Data-driven Automatic Text Simplification for German
In: Säuberli, Andreas; Ebling, Sarah; Volk, Martin (2020). Benchmarking Data-driven Automatic Text Simplification for German. In: Gala, Nuria; Wilkens, Rodrigo. Proceedings of the 1st Workshop on Tools and Resources to Empower People with REAding DIfficulties (READI). Marseille: European Language Resources Association, 41-48. (2020)
Abstract: Automatic text simplification is an active research area, and there are first systems for English, Spanish, Portuguese, and Italian. For German, no data-driven approach exists to this date, due to a lack of training data. In this paper, we present a parallel corpus of news items in German with corresponding simplifications on two complexity levels. The simplifications have been produced according to a well-documented set of guidelines. We then report on experiments in automatically simplifying the German news items using state-of-the-art neural machine translation techniques. We demonstrate that despite our small parallel corpus, our neural models were able to learn essential features of simplified language, such as lexical substitutions, deletion of less relevant words and phrases, and sentence shortening.
Keyword: 000 Computer science; 410 Linguistics; Digital Society Initiative; Institute of Computational Linguistics; knowledge & systems
URL: https://www.zora.uzh.ch/id/eprint/188569/
https://www.zora.uzh.ch/id/eprint/188569/1/saeuberli-et-al-2020.pdf
https://doi.org/10.5167/uzh-188569
https://www.aclweb.org/anthology/2020.readi-1.7
BASE
Hide details
2
Modelling Large Parallel Corpora: The Zurich Parallel Corpus Collection
In: Graën, Johannes; Kew, Tannon; Shaitarova, Anastassia; Volk, Martin (2019). Modelling Large Parallel Corpora: The Zurich Parallel Corpus Collection. In: Challenges in the Management of Large Corpora (CMLC-7), Cardiff, Wales, 22 July 2019 - 22 July 2019. (2019)
BASE
Show details
3
Post-editing Productivity with Neural Machine Translation: An Empirical Assessment of Speed and Quality in the Banking and Finance Domain
In: Läubli, Samuel; Amrhein, Chantal; Düggelin, Patrick; Gonzalez, Beatriz; Zwahlen, Alena; Volk, Martin (2019). Post-editing Productivity with Neural Machine Translation: An Empirical Assessment of Speed and Quality in the Banking and Finance Domain. In: Machine Translation Summit XVII, Dublin, Ireland, 19 August 2019 - 23 August 2019, 267-272. (2019)
BASE
Show details
4
Crowdsourcing the OCR Ground Truth of a German and French Cultural Heritage Corpus
In: Clematide, Simon; Furrer, Lenz; Volk, Martin (2018). Crowdsourcing the OCR Ground Truth of a German and French Cultural Heritage Corpus. Journal for Language Technology and Computational Linguistics (JLCL), 33(1):25-47. (2018)
BASE
Show details
5
Parallel Corpora, Terminology Extraction and Machine Translation
In: Volk, Martin (2018). Parallel Corpora, Terminology Extraction and Machine Translation. In: 16. DTT-Symposion. Terminologie und Text(e), Mannheim, 22 March 2018 - 24 March 2018, 3-14. (2018)
BASE
Show details
6
Multi-word Adverbs – How well are they handled in Parsing and Machine Translation?
In: Volk, Martin; Graën, Johannes (2017). Multi-word Adverbs – How well are they handled in Parsing and Machine Translation? In: The 3rd Workshop on Multi-word Units in Machine Translation and Translation Technology (MUMTTT 2017), London, 14 November 2017 - 14 November 2017. (2017)
BASE
Show details
7
Lessons from a Massive Open Online Course (MOOC) on Natural Language Processing for Digital Humanitie
In: Clematide, Simon; Meraner, Isabel; Bubenhofer, Noah; Volk, Martin (2017). Lessons from a Massive Open Online Course (MOOC) on Natural Language Processing for Digital Humanitie. In: Teaching NLP for Digital Humanitie, Berlin, 12 September 2017 - 12 September 2017, 17-22. (2017)
BASE
Show details
8
Crossing Sentence Boundaries in Statistical Machine Translation
In: Mascarell, Laura; Rios, Annette; Volk, Martin. Crossing Sentence Boundaries in Statistical Machine Translation. In: MultiLingual, December 2016, p.50-52. (2016)
BASE
Show details
9
Efficient Exploration of Translation Variants in Large Multiparallel Corpora Using a Relational Database
In: Graën, Johannes; Clematide, Simon; Volk, Martin (2016). Efficient Exploration of Translation Variants in Large Multiparallel Corpora Using a Relational Database. In: 4th Workshop on the Challenges in the Management of Large Corpora, Portorož, 28 May 2016 - 28 May 2016, 20-23. (2016)
BASE
Show details
10
Multilingwis – A Multilingual Search Tool for Multi-Word Units in Multiparallel Corpora
In: Clematide, Simon; Graën, Johannes; Volk, Martin (2016). Multilingwis – A Multilingual Search Tool for Multi-Word Units in Multiparallel Corpora. In: Corpas Pastor, Gloria. Computerised and Corpus-based Approaches to Phraseology: Monolingual and Multilingual Perspectives/Fraseología computacional y basada en corpus: perspectivas monolingües y multilingües. Geneva: Tradulex, n/a. (2016)
BASE
Show details
11
Rule-based Automatic Text Simplification for German
In: Suter, Julia; Ebling, Sarah; Volk, Martin (2016). Rule-based Automatic Text Simplification for German. In: 13th Conference on Natural Language Processing (KONVENS 2016), Bochum, Germany, 19 September 2016 - 21 September 2016. (2016)
BASE
Show details
12
Building a Parallel Corpus on the World's Oldest Banking Magazine
In: Volk, Martin; Amrhein, Chantal; Aepli, Noëmi; Müller, Mathias; Ströbel, Phillip (2016). Building a Parallel Corpus on the World's Oldest Banking Magazine. In: KONVENS, Bochum, 19 September 2016 - 21 September 2016. (2016)
BASE
Show details
13
Bi-particle adverbs, PoS-tagging and the recognition of german separable prefix verbs
In: Volk, Martin; Clematide, Simon; Graën, Johannes; Ströbel, Phillip (2016). Bi-particle adverbs, PoS-tagging and the recognition of german separable prefix verbs. In: KONVENS 2016, Bochum, 19 September 2016 - 21 September 2016. (2016)
BASE
Show details
14
Reflections and a Proposal for a Query and Reporting Language for Richly Annotated Multiparallel Corpora
In: Clematide, Simon (2015). Reflections and a Proposal for a Query and Reporting Language for Richly Annotated Multiparallel Corpora. In: Gintare, Grigonyte; Clematide, Simon; Utka, Andrius; Volk, Martin. Proceedings of the Workshop on Innovative Corpus Query and Visualization Tools at NODALIDA 2015, May 11-13, 2015, Vilnius, Lithuania. Linköping, Sweden: Linköping University Electronic Press, Linköpings universitet, 6-16. (2015)
BASE
Show details
15
Cleaning the Europarl Corpus for Linguistic Applications
In: Graën, Johannes; Batinić, Dolores; Volk, Martin (2014). Cleaning the Europarl Corpus for Linguistic Applications. In: Konvens 2014, Hildesheim, 8 October 2014 - 10 October 2014. (2014)
BASE
Show details
16
Enforcing Consistent Translation of German Compound Coreferences
In: Mascarell, Laura; Fishel, Mark; Korchagina, Natalia; Volk, Martin (2014). Enforcing Consistent Translation of German Compound Coreferences. In: Konvens, Hildesheim, Germany, 8 October 2014 - 10 October 2014. (2014)
BASE
Show details
17
Combining statistical machine translation and translation memories with domain adaptation
In: Läubli, Samuel; Fishel, Mark; Volk, Martin; Weibel, Manuela (2013). Combining statistical machine translation and translation memories with domain adaptation. In: NODALIDA 2013, Nordic Conference of Computational Linguistics, Oslo, Norway, 22 May 2013 - 24 May 2013, 331-341. (2013)
BASE
Show details
18
Mining for Domain-specific Parallel Text from Wikipedia
In: Plamada, Magdalena; Volk, Martin (2013). Mining for Domain-specific Parallel Text from Wikipedia. In: Proceedings of the Sixth Workshop on Building and Using Comparable Corpora, Sofia, Bulgaria, August 2013 - August 2013, 112-120. (2013)
BASE
Show details
19
Assessing post-editing efficiency in a realistic translation environment
In: Läubli, Samuel; Fishel, Mark; Massey, Gary; Ehrensberger-Dow, Maureen; Volk, Martin (2013). Assessing post-editing efficiency in a realistic translation environment. In: MT Summit XIV Workshop on Post-editing Technology and Practice, Nice, 2 September 2013 - 2 September 2013, 83-91. (2013)
BASE
Show details
20
Statistical machine translation for automobile marketing texts
In: Läubli, Samuel; Fishel, Mark; Weibel, Manuela; Volk, Martin (2013). Statistical machine translation for automobile marketing texts. In: Proceedings of the XIV Machine Translation Summit, Nice, 2 September 2013 - 6 September 2013, 265-272. (2013)
BASE
Show details

Page: 1 2 3 4 5

Catalogues
0
0
0
0
0
0
0
Bibliographies
0
0
0
0
0
0
0
0
0
Linked Open Data catalogues
0
Online resources
0
0
0
0
Open access documents
83
0
0
0
0
© 2013 - 2024 Lin|gu|is|tik | Imprint | Privacy Policy | Datenschutzeinstellungen ändern