Home Catalogue search

eng

Refine your search:

Search in the Catalogues and Directories






	Sort by
Simple Search

Page: 1 2

Hits 1 – 20 of 30

1	Shapley Idioms: Analysing BERT Sentence Embeddings for General Idiom Token Identification
	Nedumpozhimana, Vasudevan; Klubička, Filip; Kelleher, John D.
	In: Front Artif Intell (2022)
	BASE
	Show details

2	Semantic Relatedness and Taxonomic Word Embeddings ...
	Kacmajor, Magdalena; Kelleher, John D.; Klubicka, Filip. - : arXiv, 2020
	BASE
	Show details

3	English WordNet Taxonomic Random Walk Pseudo-Corpora
	Klubicka, Filip; Maldonado, Alfredo; Mahalunkar, Abhijit...
	In: Conference papers (2020)
	BASE
	Show details

4	Language related issues for machine translation between closely related south Slavic languages
	Arcan, Mihael; Klubicka, Filip; Popovic, Maja. - : The COLING 2016 Organizing Committee, 2019
	BASE
	Show details

5	Synthetic, Yet Natural: Properties of WordNet Random Walk Corpora and the impact of rare words on embedding performance
	Klubicka, Filip; Mahalunkar, Abhijit; Maldonado, Alfredo...
	In: Conference papers (2019)
	BASE
	Show details

6	Size Matters: The Impact of Training Size in Taxonomically-Enriched Word Embeddings
	Maldonado, Alfredo; Klubicka, Filip; Kelleher, John D.
	In: Articles (2019)
	BASE
	Show details

7	Training corpus hr500k 1.0
	Ljubešić, Nikola; Agić, Željko; Klubička, Filip. - : Jožef Stefan Institute, 2018
	BASE
	Show details

8	Quantitative Fine-Grained Human Evaluation of Machine Translation Systems: a Case Study on English to Croatian ...
	Klubička, Filip; Toral, Antonio; Sánchez-Cartagena, Víctor M.. - : arXiv, 2018
	Abstract: This paper presents a quantitative fine-grained manual evaluation approach to comparing the performance of different machine translation (MT) systems. We build upon the well-established Multidimensional Quality Metrics (MQM) error taxonomy and implement a novel method that assesses whether the differences in performance for MQM error types between different MT systems are statistically significant. We conduct a case study for English-to-Croatian, a language direction that involves translating into a morphologically rich language, for which we compare three MT systems belonging to different paradigms: pure phrase-based, factored phrase-based and neural. First, we design an MQM-compliant error taxonomy tailored to the relevant linguistic phenomena of Slavic languages, which made the annotation process feasible and accurate. Errors in MT outputs were then annotated by two annotators following this taxonomy. Subsequently, we carried out a statistical analysis which showed that the best-performing system (neural) ... : 22 pages, 2 figures, 9 tables, 1 equation. This is a post-peer-review, pre-copyedit version of an article published in Machine Translation Journal. The final authenticated version will be available online at the journal page. arXiv admin note: substantial text overlap with arXiv:1706.04389 ...
	Keyword: 68T50; Artificial Intelligence cs.AI; Computation and Language cs.CL; FOS Computer and information sciences
	URL: https://arxiv.org/abs/1802.01451 https://dx.doi.org/10.48550/arxiv.1802.01451
	BASE
	Hide details

9	Is it worth it? Budget-related evaluation metrics for model selection ...
	Klubička, Filip; Salton, Giancarlo D.; Kelleher, John D.. - : arXiv, 2018
	BASE
	Show details

10	Quantitative Fine-grained Human Evaluation of Machine Translation Systems: a Case Study on English to Croatian
	Sanchez-Cartagena, Victor Manuel; Toral, Antonio; Klubicka, Filip
	In: Articles (2018)
	BASE
	Show details

11	Is it worth it? Budget-related evaluation metrics for model selection
	Klubicka, Filip; Salton, Giancarlo; Kelleher, John D.
	In: Conference papers (2018)
	BASE
	Show details

12	hr500k – A Reference Training Corpus of Croatian.
	Erjavec, Tomaž; Ljubešić, Nikola; Klubicka, Filip...
	In: Conference papers (2018)
	BASE
	Show details

13	Croatian Twitter training corpus ReLDI-NormTag-hr 1.1
	Ljubešić, Nikola; Farkaš, Daša; Klubička, Filip. - : Jožef Stefan Institute, 2017
	BASE
	Show details

14	Serbian Twitter training corpus ReLDI-NormTag-sr 1.0
	Ljubešić, Nikola; Farkaš, Daša; Klubička, Filip. - : Jožef Stefan Institute, 2017
	BASE
	Show details

15	Croatian Twitter training corpus ReLDI-NormTag-hr 1.0
	Ljubešić, Nikola; Farkaš, Daša; Klubička, Filip. - : Jožef Stefan Institute, 2017
	BASE
	Show details

16	Serbian Twitter training corpus ReLDI-NormTag-sr 1.1
	Ljubešić, Nikola; Farkaš, Daša; Klubička, Filip. - : Jožef Stefan Institute, 2017
	BASE
	Show details

17	Fine-grained human evaluation of neural versus phrase-based machine translation ...
	Klubička, Filip; Toral, Antonio; Sánchez-Cartagena, Víctor M.. - : arXiv, 2017
	BASE
	Show details

18	Fine-Grained Human Evaluation of Neural Versus Phrase-Based Machine Translation
	Klubička Filip; Toral Antonio; Sánchez-Cartagena Víctor M.
	In: Prague Bulletin of Mathematical Linguistics , Vol 108, Iss 1, Pp 121-132 (2017) (2017)
	BASE
	Show details

19	Serbian-English parallel corpus srenWaC 1.0
	Ljubešić, Nikola; Esplà-Gomis, Miquel; Ortiz Rojas, Sergio. - : Jožef Stefan Institute, 2016
	BASE
	Show details

20	Finnish-English parallel corpus fienWaC 1.0
	Ljubešić, Nikola; Esplà-Gomis, Miquel; Ortiz Rojas, Sergio. - : Jožef Stefan Institute, 2016
	BASE
	Show details

Page: 1 2

© 2013 - 2024 Lin|gu|is|tik | Imprint | Privacy Policy | Datenschutzeinstellungen ändern