Home Catalogue search

eng

Refine your search:
- Keyword
- Creator / Publisher:
- Year
- Medium:
  - Online (14)
- Type:
  - Article (6)
  - Book (1)
- BLLDB-Access:
  - free (14)
  - subject to license (0)

Search in the Catalogues and Directories






	Sort by
Simple Search

Hits 1 – 14 of 14

1	Proceedings of the Workshop on Challenges in the Management of Large Corpora (CMLC-9) 2021. Limerick, 12 July 2021 (Online-Event) ...
	Lüngen, Harald; Kupietz, Marc; Bański, Piotr. - : Leibniz-Institut für Deutsche Sprache, 2021
	BASE
	Show details

2	Proceedings of the LREC 2020: 8th Workshop on Challenges in the Management of Large Corpora (CMLC-8)
	In: Proceedings of the LREC 2020: 8th Workshop on Challenges in the Management of Large Corpora (CMLC-8). Edited by: Bański, Piotr; Barbaresi, Adrien; Clematide, Simon; Kupietz, Marc; Lüngen, Harald; Pisetta, Ines (2020). Marseille, France: European Language Ressources Association. (2020)
	BASE
	Show details

3	Increasing Interoperability for Embedding Corpus Annotation Pipelines in Wmatrix and other corpus retrieval tools
	Rayson, Paul Edward. - 2018
	BASE
	Show details

4	Challenges in the Management of Large Corpora (CMLC-6)
	In: Challenges in the Management of Large Corpora (CMLC-6). Edited by: Banski, Piotr; Kupietz, Marc; Barbaresi, Adrien; Biber, Hanno; Breiteneder, Evelyn; Clematide, Simon; Witt, Andreas (2018). Paris: European Language Resources Association (ELRA). (2018)
	Abstract: Large corpora require careful design, licensing, collecting, cleaning, encoding, annotation, management, storage, retrieval, analysis, and curation to unfold their potential for a wide range of research questions and users, across a number of disciplines. Apart from the usual CMLC topics that fall into these areas, the 6th edition of the CMLC workshop features a special focus on corpus query and anal- ysis systems and specifically on goals concerning their interoperability. In the past 5 years, a whole new generation of corpus query engines that overcome limitations on the number of tokens and annotation layers has started to emerge at several research centers. While there seems to be a consensus that there can be no single corpus tool that fulfills the need of all communities and that a degree of heterogeneity is required, the time seems ripe to discuss whether (further, unre- stricted) divergence should be avoided in order to allow for some interoperability and reusability – and how this can be achieved. The two most prominent areas where interoperability seems highly desirable are query languages and software components for corpus analysis. The former issue is already partially addressed by the proposed ISO standard Corpus Query Lingua Franca (CQLF). Components for corpus analysis and further processing of results (e.g. for visualization), on the other hand, should in an ideal world be exchangeable and reusable across different platforms, not only to avoid redundancies, but also to foster replicability and a canonization of methodology in NLP and corpus linguistics. The 6th edition of the workshop is meant to address these issues, notably by including an expert panel discussion with representatives of tool development teams and power users.
	Keyword: 000 Computer science; 410 Linguistics; Institute of Computational Linguistics; knowledge & systems
	URL: https://www.zora.uzh.ch/id/eprint/162636/1/BanskiKupietz2018.pdf https://doi.org/10.5167/uzh-162636 http://lrec-conf.org/workshops/lrec2018/W17/index.html https://www.zora.uzh.ch/id/eprint/162636/
	BASE
	Hide details

5	Proceedings of the 4th Workshop on Challenges in the Management of Large Corpora
	Bański, Piotr (Hrsg.); Kupietz, Marc (Hrsg.); Lüngen, Harald (Hrsg.). - 2016
	IDS Bibliografie zur deutschen Grammatik
	Show details

6	Efficient Exploration of Translation Variants in Large Multiparallel Corpora Using a Relational Database
	Graën, Johannes; Clematide, Simon; Volk, Martin
	In: Graën, Johannes; Clematide, Simon; Volk, Martin (2016). Efficient Exploration of Translation Variants in Large Multiparallel Corpora Using a Relational Database. In: 4th Workshop on the Challenges in the Management of Large Corpora, Portorož, 28 May 2016 - 28 May 2016, 20-23. (2016)
	BASE
	Show details

7	Challenges in the alignment, management and exploitation of large and richly annotated multi-parallel corpora
	Graën, Johannes; Clematide, Simon
	In: Graën, Johannes; Clematide, Simon (2015). Challenges in the alignment, management and exploitation of large and richly annotated multi-parallel corpora. In: 3rd Workshop on the Challenges in the Management of Large Corpora, Lancaster, 20 July 2015 - 20 July 2015, 15-20. (2015)
	BASE
	Show details

8	KorAP: the new corpus analysis platform at IDS Mannheim [Online resource]
	Bański, Piotr; Bingel, Joachim; Diewald, Nils.
	IDS-Repository
	Show details

9	Robust corpus architecture: a new look at virtual collections and data access [Online resource]
	Bański, Piotr; Frick, Elena; Hanl, Michael.
	IDS-Repository
	Show details

10	KorAP architecture – diving in the deep sea of corpus data [Online resource]
	Diewald, Nils; Hanl, Michael; Margaretha, Eliza.
	IDS-Repository
	Show details

11	The New IDS Corpus Analysis Platform: Challenges and Prospects [Online resource]
	Bański, Piotr; Fischer, Peter M.; Frick, Elena.
	IDS-Repository
	Show details

12	Maximizing the potential of very large corpora: 50 years of big language data at IDS Mannheim [Online resource]
	Kupietz, Marc; Lüngen, Harald; Bański, Piotr.
	IDS-Repository
	Show details

13	EuReCo - Joining Forces for a European Reference Corpus as a sustainable base for cross-linguistic research [Online resource]
	Kupietz, Marc; Witt, Andreas; Bański, Piotr.
	IDS-Repository
	Show details

14	Access control by query rewriting: the case of KorAP [Online resource]
	Banski, Piotr; Diewald, Nils; Hanl, Michael.
	IDS-Repository
	Show details

© 2013 - 2024 Lin|gu|is|tik | Imprint | Privacy Policy | Datenschutzeinstellungen ändern