DE eng

Search in the Catalogues and Directories

Page: 1 2
Hits 1 – 20 of 21

1
The German reference corpus DeReKo: New developments – new opportunities
In: Proceedings of the Eleventh International Conference on Language Resources and Evaluation (LREC 2018) (2018), 4353-4360
IDS Bibliografie zur deutschen Grammatik
Show details
2
Proceedings of the 4th Workshop on Challenges in the Management of Large Corpora
Bański, Piotr (Hrsg.); Kupietz, Marc (Hrsg.); Lüngen, Harald (Hrsg.). - 2016
IDS Bibliografie zur deutschen Grammatik
Show details
3
Efficient Exploration of Translation Variants in Large Multiparallel Corpora Using a Relational Database
In: Graën, Johannes; Clematide, Simon; Volk, Martin (2016). Efficient Exploration of Translation Variants in Large Multiparallel Corpora Using a Relational Database. In: 4th Workshop on the Challenges in the Management of Large Corpora, Portorož, 28 May 2016 - 28 May 2016, 20-23. (2016)
Abstract: We present an approach for searching and exploring translation variants of multi-word units in large multiparallel corpora based on a relational database management system. Our web-based application Multilingwis, which allows for multilingual lookups of phrases and words in English, French, German, Italian and Spanish, is of interest to anybody who wants to quickly compare expressions across several languages, such as language learners without linguistic knowledge. In this paper, we focus on the technical aspects of how to represent and efficiently retrieve all occurrences that match the user’s query in one of five languages simultaneously with their translations into the other four languages. In order to identify such translations in our corpus of 220 million tokens in total, we use statistical sentence and word alignment. By using materialized views, composite indexes, and pre-planned search functions, our relational database management system handles large result sets with only moderate requirements to the underlying hardware. As our systematic evaluation on 200 search terms per language shows, we can achieve retrieval times below 1 second in 75 % of the cases for multi-word expressions.
Keyword: 000 Computer science; 410 Linguistics; Institute of Computational Linguistics; knowledge & systems
URL: https://www.zora.uzh.ch/id/eprint/124373/1/cmlc4.pdf
https://doi.org/10.5167/uzh-124373
https://www.zora.uzh.ch/id/eprint/124373/
http://www.lrec-conf.org/proceedings/lrec2016/workshops/LREC2016Workshop-CMLC_Proceedings.pdf
BASE
Hide details
4
Challenges in the alignment, management and exploitation of large and richly annotated multi-parallel corpora
In: Graën, Johannes; Clematide, Simon (2015). Challenges in the alignment, management and exploitation of large and richly annotated multi-parallel corpora. In: 3rd Workshop on the Challenges in the Management of Large Corpora, Lancaster, 20 July 2015 - 20 July 2015, 15-20. (2015)
BASE
Show details
5
Modelling, Learning and Processing of Text Technological Data Structures
Mehler, Alexander (Hrsg.); Kühnberger, Kai-Uwe (Hrsg.); Lobin, Henning (Hrsg.). - Dordrecht : Springer, 2011
IDS OBELEX meta
Show details
6
The Morphosyntactic Annotation of DeReKo: Interpretation, Opportunities, and Pitfalls
In: Grammatik und Korpora 2009. Dritte internationale Konferenz (2011), 451-472
IDS Bibliografie zur deutschen Grammatik
Show details
7
Unification of XML documents with concurrent markup
In: LLC. - Oxford : Oxford Univ. Press 20 (2005) 1, 103-116
BLLDB
Show details
8
Unification of XML documents with concurrent markup
In: Literary & linguistic computing. - Oxford : Oxford Univ. Press 20 (2005) 1, 103-116
OLC Linguistik
Show details
9
Unification of XML Documents with Concurrent Markup
Witt, Andreas; Goecke, Daniela; Sasaki, Felix. - : Oxford University Press, 2005
BASE
Show details
10
Unification of XML Documents with Concurrent Markup
Witt, Andreas; Goecke, Daniela; Sasaki, Felix. - : Oxford University Press, 2005
BASE
Show details
11
Enhancing speech corpus resources with multiple lexical tag layers [Online resource]
IDS-Repository
Show details
12
Introduction: Modeling, Learning and Processing of Text-Technological Data Structures [Online resource]
IDS-Repository
Show details
13
GOLD and Discourse: Domain- and Community-Specific Extensions [Online resource]
IDS-Repository
Show details
14
Unification of XML Documents with Concurrent Markup [Online resource]
IDS-Repository
Show details
15
Different Views on Markup [Online resource]
IDS-Repository
Show details
16
The German reference corpus DeReKo: new developments – new opportunities [Online resource]
IDS-Repository
Show details
17
The Morphosyntactic Annotation of DeReKo: Interpretation, Opportunities, and Pitfalls [Online resource]
IDS-Repository
Show details
18
Unification of XML Documents with Concurrent Markup [Online resource]
IDS-Repository
Show details
19
Multi-Dimensional Markup: N-way relations as a generalisation over possible relations between annotation layers [Online resource]
IDS-Repository
Show details
20
Methods for the semantic analysis of document markup [Online resource]
IDS-Repository
Show details

Page: 1 2

Catalogues
0
0
1
0
0
0
0
Bibliographies
1
0
3
0
0
0
1
0
0
Linked Open Data catalogues
0
Online resources
0
0
0
0
Open access documents
4
0
11
0
0
© 2013 - 2024 Lin|gu|is|tik | Imprint | Privacy Policy | Datenschutzeinstellungen ändern