1 |
The Lannang Corpus (LanCorp): A POS-tagged, sociolinguistic corpus containing recordings and transcriptions of Lannang speech collected from the metropolitan Manila Lannangs between 2016 and 2020 ...
|
|
|
|
BASE
|
|
Show details
|
|
2 |
Catalan United Nations v1.0 test set ...
|
|
|
|
Abstract:
Catalan version [1] of the test set from the United Nations v1.0 [2]. The translation was performed in two steps: we did a first automatic translation from the Spanish test set version into Catalan and then a professional translator post-edited the output. [1] Marta R. Costa-Jussà, Noé Casas, Carlos Escolano, and José A. R. Fonollosa. 2019. Chinese-Catalan: A Neural Machine Translation Approach Based on Pivoting and Attention Mechanisms. ACM Trans. Asian Low-Resour. Lang. Inf. Process. 18, 4, Article 43 (August 2019), 8 pages. DOI:https://doi.org/10.1145/3312575 [2] Michal Ziemski, Marcin Junczys-Dowmunt, and Bruno Pouliquen. 2016. The United Nations parallel corpus v1.0. In Proceedings of the LREC, 2016 ... : This work is supported by the Spanish Ministerio de Economía y Competitividad and European Regional Development Fund, through the postdoctoral senior grant Ramón y Cajal. ...
|
|
Keyword:
Benchmark; Catalan; Multilingual Parallel Data; United Nations
|
|
URL: https://zenodo.org/record/3888413 https://dx.doi.org/10.5281/zenodo.3888413
|
|
BASE
|
|
Hide details
|
|
6 |
MLM: A Benchmark Dataset for Multitask Learning with Multiple Languages and Modalities ...
|
|
|
|
BASE
|
|
Show details
|
|
7 |
MLM: A Benchmark Dataset for Multitask Learning with Multiple Languages and Modalities ...
|
|
|
|
BASE
|
|
Show details
|
|
9 |
Evaluating the Impact of Bilingual Lexical Resources on Cross-lingual Sentiment Projection in the Pharmaceutical Domain ...
|
|
|
|
BASE
|
|
Show details
|
|
10 |
Evaluating the Impact of Bilingual Lexical Resources on Cross-lingual Sentiment Projection in the Pharmaceutical Domain ...
|
|
|
|
BASE
|
|
Show details
|
|
11 |
When humans and machines collaborate: Cross-lingual Label Editing in Wikidata ...
|
|
|
|
BASE
|
|
Show details
|
|
12 |
Talk of Europe - The debates of the European Parliament as Linked Open Data ...
|
|
|
|
BASE
|
|
Show details
|
|
13 |
Context-based ontology matching and data interlinking
|
|
|
|
In: https://hal.archives-ouvertes.fr/hal-01180927 ; [Contract] Lindicle. 2015, pp.21 (2015)
|
|
BASE
|
|
Show details
|
|
14 |
Talk of Europe - The debates of the European Parliament as Linked Open Data ...
|
|
|
|
BASE
|
|
Show details
|
|
17 |
Hétérogénéité des corpus et textométrie
|
|
|
|
In: Langages, n 187, 3, 2012-09-01, pp.13-26 (2012)
|
|
BASE
|
|
Show details
|
|
18 |
多言語平行コーパスのための「言語学的におもしろい100の文」 ; 100 linguistically interesting sentences for a multilingual parallel corpus
|
|
|
|
BASE
|
|
Show details
|
|
|
|