1 |
How much context span is enough? Examining context-related issues for document-level MT
|
|
|
|
In: Castilho, Sheila orcid:0000-0002-8416-6555 (2022) How much context span is enough? Examining context-related issues for document-level MT. In: 13th Language Resources and Evaluation Conference, 21-23 June 2022, Marseille, France. (In Press) (2022)
|
|
Abstract:
This paper analyses how much context span is necessary to solve different context-related issues, namely, reference, ellipsis, gender, number, lexical ambiguity, and terminology when translating from English into Portuguese. We use the DELA corpus, which consists of 60 documents and six different domains (subtitles, literary, news, reviews, medical, and legislation). We find that the shortest context span to disambiguate issues can appear in different positions in the document including preceding, following, global, world knowledge; and that the average length depends on the issue types as well as the domain. Additionally, we show that the standard approach of relying on only two preceding sentences as context might not be enough depending on the domain and issue types.
|
|
Keyword:
Computational linguistics; context span; document-level; Language; Linguistics; Machine translating; Translating and interpreting
|
|
URL: http://doras.dcu.ie/27009/
|
|
BASE
|
|
Hide details
|
|
2 |
An investigation of English-Irish machine translation and associated resources
|
|
Dowling, Meghan. - : Dublin City University. School of Computing, 2022. : Dublin City University. ADAPT, 2022
|
|
In: Dowling, Meghan orcid:0000-0003-1637-4923 (2022) An investigation of English-Irish machine translation and associated resources. PhD thesis, Dublin City University. (2022)
|
|
BASE
|
|
Show details
|
|
3 |
The role of machine translation in translation education: A thematic analysis of translator educators’ beliefs
|
|
|
|
In: Translation and Interpreting : the International Journal of Translation and Interpreting Research, Vol 14, Iss 1, Pp 177-197 (2022) (2022)
|
|
BASE
|
|
Show details
|
|
4 |
DELA Corpus - A Document-Level Corpus Annotated with Context-Related Issues
|
|
|
|
In: Castilho, Sheila orcid:0000-0002-8416-6555 , Cavalheiro Camargo, João Lucas orcid:0000-0003-3746-1225 , Menezes, Miguel and Way, Andy orcid:0000-0001-5736-5930 (2021) DELA Corpus - A Document-Level Corpus Annotated with Context-Related Issues. In: Sixth Conference on Machine Translation (WMT21), 10-11 Nov 2021, Punta Cana, Dominican Republic (Online). ISBN 978-1-954085-94-7 (2021)
|
|
BASE
|
|
Show details
|
|
5 |
Can Google Translate Rewire Your L2 English Processing?
|
|
|
|
In: Resende, Natália orcid:0000-0002-5248-2457 and Way, Andy orcid:0000-0001-5736-5930 (2021) Can Google Translate Rewire Your L2 English Processing? Digital, 1 (1). pp. 66-85. ISSN 2673-6470 (2021)
|
|
BASE
|
|
Show details
|
|
6 |
Towards document-level human MT evaluation: On the Issues of annotator agreement, effort and misevaluation
|
|
|
|
In: Castilho, Sheila orcid:0000-0002-8416-6555 (2021) Towards document-level human MT evaluation: On the Issues of annotator agreement, effort and misevaluation. In: 16th Conference of the European Chapter of the Association for Computational Linguistics - EACL 2021., 19-23 April 2021, Online. (In Press) (2021)
|
|
BASE
|
|
Show details
|
|
7 |
Meaning and translation : theory and practice of machine translation as exemplified by applicative and cognitive grammars
|
|
|
|
BASE
|
|
Show details
|
|
8 |
On the differences between human translations
|
|
|
|
In: Popović, Maja orcid:0000-0001-8234-8745 (2020) On the differences between human translations. In: 22nd Annual Conference of the European Association for Machine Translation (EAMT 2020), 3 -5 Nov 2020, Lisbon, Portugal (Online). (2020)
|
|
BASE
|
|
Show details
|
|
9 |
MTrill project: machine translation impact on language learning
|
|
|
|
In: Resende, Natália orcid:0000-0002-5248-2457 and Way, Andy orcid:0000-0001-5736-5930 (2020) MTrill project: machine translation impact on language learning. In: European Association for Machine Translation (EAMT) 2020, 3-5 Nov 2020, Lisbon, Portugal (Online). ISBN 978-989-33-0589-8 (2020)
|
|
BASE
|
|
Show details
|
|
10 |
MT syntactic priming effects on L2 English speakers
|
|
|
|
In: Resende, Natália orcid:0000-0002-5248-2457 , Cowan, Benjamin orcid:0000-0002-8595-8132 and Way, Andy orcid:0000-0001-5736-5930 (2020) MT syntactic priming effects on L2 English speakers. In: European Association for Machine Translation (EAMT) 2020, 2-6 Nov 2010, Lisbon, Portugal (Online). ISBN 978-989-33-0589-8 (2020)
|
|
BASE
|
|
Show details
|
|
11 |
Automatic Translation, Context, and Supervised Learning in Comparative Politics
|
|
|
|
In: Courtney, Michael, Breen, Michael orcid:0000-0002-5857-9938 , McMenamin, Iain orcid:0000-0002-1704-390X and McNulty, Gemma orcid:0000-0002-6909-6958 (2020) Automatic Translation, Context, and Supervised Learning in Comparative Politics. Journal of Information Technology and Politics . ISSN 1933-1681 (2020)
|
|
BASE
|
|
Show details
|
|
12 |
Rapid development of competitive translation engines for access to multilingual COVID-19 information
|
|
|
|
In: Way, Andy orcid:0000-0001-5736-5930 , Haque, Rejwanul orcid:0000-0003-1680-0099 , Xie, Guodong, Gaspari, Federico orcid:0000-0003-3808-8418 , Popović, Maja orcid:0000-0001-8234-8745 and Poncelas, Alberto orcid:0000-0002-5089-1687 (2020) Rapid development of competitive translation engines for access to multilingual COVID-19 information. Informatics . ISSN 2227-9709 (2020)
|
|
BASE
|
|
Show details
|
|
13 |
A reception study of machine translated subtitles for MOOCs
|
|
Hu, Ke. - : Dublin City University. School of Applied Language and Intercultural Studies, 2020. : Dublin City University. Centre for Translation and Textual Studies (CTTS), 2020. : Dublin City University. ADAPT, 2020
|
|
In: Hu, Ke (2020) A reception study of machine translated subtitles for MOOCs. PhD thesis, Dublin City University. (2020)
|
|
BASE
|
|
Show details
|
|
14 |
“A tiny cog in a large machine”: Digital Taylorism in the Translation Industry
|
|
|
|
In: Moorkens, Joss orcid:0000-0003-0766-0071 (2020) “A tiny cog in a large machine”: Digital Taylorism in the Translation Industry. Translation Spaces, 9 (1). pp. 12-34. ISSN 2211-3711 (2020)
|
|
BASE
|
|
Show details
|
|
15 |
Comprehension and trust in crises: investigating the impact of machine translation and post-editing
|
|
|
|
In: Rossetti, Alessandra orcid:0000-0002-2162-9639 , O'Brien, Sharon orcid:0000-0003-4864-5986 and Cadwell, Patrick orcid:0000-0002-2371-4378 (2020) Comprehension and trust in crises: investigating the impact of machine translation and post-editing. In: 22nd Annual Conference of the European Association for Machine Translation (EAMT 2020), 6 Nov 2020, Lisbon, Portugal. ISBN 978-989-33-0589-8 (2020)
|
|
BASE
|
|
Show details
|
|
16 |
Machine translation of user-generated content
|
|
Lohar, Pintu. - : Dublin City University. School of Computing, 2020. : Dublin City University. ADAPT, 2020
|
|
In: Lohar, Pintu (2020) Machine translation of user-generated content. PhD thesis, Dublin City University. (2020)
|
|
BASE
|
|
Show details
|
|
17 |
Neural machine translation and the evolution of the localisation sector: implications for training
|
|
|
|
In: O'Brien, Sharon orcid:0000-0003-4864-5986 and Rossetti, Alessandra orcid:0000-0002-2162-9639 (2020) Neural machine translation and the evolution of the localisation sector: implications for training. Journal of Internationalization and Localization, 7 (1/2). pp. 95-121. ISSN 2032-6904 (2020)
|
|
BASE
|
|
Show details
|
|
18 |
A human evaluation of English-Irish statistical and neural machine translation
|
|
|
|
In: Dowling, Meghan orcid:0000-0003-1637-4923 , Castilho, Sheila orcid:0000-0002-8416-6555 , Moorkens, Joss orcid:0000-0003-4864-5986 , Lynn, Teresa and Way, Andy orcid:0000-0001-5736-5930 (2020) A human evaluation of English-Irish statistical and neural machine translation. In: Proceedings of the 22nd Annual Conference of the European Association for Machine Translation, 6 Nov 2020, Lisbon, Portugal. (2020)
|
|
BASE
|
|
Show details
|
|
19 |
A set of recommendations for assessing human-machine parity in language translation
|
|
|
|
In: Läubli, Samuel orcid:0000-0001-5362-4106 , Castilho, Sheila orcid:0000-0002-8416-6555 , Neubig, Graham, Sennrich, Rico orcid:0000-0002-1438-4741 , Shen, Qinlan and Toral, Antonio orcid:0000-0003-2357-2960 (2020) A set of recommendations for assessing human-machine parity in language translation. Journal of Artificial Intelligence Research, 67 . pp. 653-672. ISSN 1076-9757 (2020)
|
|
BASE
|
|
Show details
|
|
20 |
On the same page? Comparing inter-annotator agreement in sentence and document level human machine translation evaluation
|
|
|
|
In: Castilho, Sheila orcid:0000-0002-8416-6555 (2020) On the same page? Comparing inter-annotator agreement in sentence and document level human machine translation evaluation. In: Fifth Conference on Machine Translation, 19-20 Nov 2020, Dominican Republic (Online). (2020)
|
|
BASE
|
|
Show details
|
|
|
|