Home Catalogue search

eng

Refine your search:

Search in the Catalogues and Directories






	Sort by
Simple Search

Hits 1 – 7 of 7

1	Modelling Large Parallel Corpora: The Zurich Parallel Corpus Collection
	Graën, Johannes; Kew, Tannon; Shaitarova, Anastassia; Volk, Martin
	In: Graën, Johannes; Kew, Tannon; Shaitarova, Anastassia; Volk, Martin (2019). Modelling Large Parallel Corpora: The Zurich Parallel Corpus Collection. In: Challenges in the Management of Large Corpora (CMLC-7), Cardiff, Wales, 22 July 2019 - 22 July 2019. (2019)
	Abstract: Text corpora come in many different shapes and sizes and carry heterogeneous annotations, depending on their purpose and design. The true benefit of corpora is rooted in their annotation and the method by which this data is encoded is an important factor in their interoperability. We have accumulated a large collection of multilingual and parallel corpora and encoded it in a unified format which is compatible with a broad range of NLP tools and corpus linguistic applications. In this paper, we present our corpus collection and describe a data model and the extensions to the popular CoNLL-U format that enable us to encode it.
	Keyword: 000 Computer science; 410 Linguistics; Institute of Computational Linguistics; knowledge & systems
	URL: https://doi.org/10.14618/ids-pub-9020 https://www.zora.uzh.ch/id/eprint/175081/ https://doi.org/10.5167/uzh-175081 https://www.zora.uzh.ch/id/eprint/175081/1/Graen_Kew_Shaitarova_Volk_2019.pdf
	BASE
	Hide details

2	Multi-word Adverbs – How well are they handled in Parsing and Machine Translation?
	Volk, Martin; Graën, Johannes
	In: Volk, Martin; Graën, Johannes (2017). Multi-word Adverbs – How well are they handled in Parsing and Machine Translation? In: The 3rd Workshop on Multi-word Units in Machine Translation and Translation Technology (MUMTTT 2017), London, 14 November 2017 - 14 November 2017. (2017)
	BASE
	Show details

3	Efficient Exploration of Translation Variants in Large Multiparallel Corpora Using a Relational Database
	Graën, Johannes; Clematide, Simon; Volk, Martin
	In: Graën, Johannes; Clematide, Simon; Volk, Martin (2016). Efficient Exploration of Translation Variants in Large Multiparallel Corpora Using a Relational Database. In: 4th Workshop on the Challenges in the Management of Large Corpora, Portorož, 28 May 2016 - 28 May 2016, 20-23. (2016)
	BASE
	Show details

4	Multilingwis – A Multilingual Search Tool for Multi-Word Units in Multiparallel Corpora
	Clematide, Simon; Graën, Johannes; Volk, Martin
	In: Clematide, Simon; Graën, Johannes; Volk, Martin (2016). Multilingwis – A Multilingual Search Tool for Multi-Word Units in Multiparallel Corpora. In: Corpas Pastor, Gloria. Computerised and Corpus-based Approaches to Phraseology: Monolingual and Multilingual Perspectives/Fraseología computacional y basada en corpus: perspectivas monolingües y multilingües. Geneva: Tradulex, n/a. (2016)
	BASE
	Show details

5	Bi-particle adverbs, PoS-tagging and the recognition of german separable prefix verbs
	Volk, Martin; Clematide, Simon; Graën, Johannes...
	In: Volk, Martin; Clematide, Simon; Graën, Johannes; Ströbel, Phillip (2016). Bi-particle adverbs, PoS-tagging and the recognition of german separable prefix verbs. In: KONVENS 2016, Bochum, 19 September 2016 - 21 September 2016. (2016)
	BASE
	Show details

6	Cleaning the Europarl Corpus for Linguistic Applications
	Graën, Johannes; Batinic, Dolores; Volk, Martin. - 2014
	BASE
	Show details

7	Cleaning the Europarl Corpus for Linguistic Applications
	Graën, Johannes; Batinić, Dolores; Volk, Martin
	In: Graën, Johannes; Batinić, Dolores; Volk, Martin (2014). Cleaning the Europarl Corpus for Linguistic Applications. In: Konvens 2014, Hildesheim, 8 October 2014 - 10 October 2014. (2014)
	BASE
	Show details

© 2013 - 2024 Lin|gu|is|tik | Imprint | Privacy Policy | Datenschutzeinstellungen ändern