1 |
DELA Corpus - A Document-Level Corpus Annotated with Context-Related Issues
|
|
|
|
In: Castilho, Sheila orcid:0000-0002-8416-6555 , Cavalheiro Camargo, João Lucas orcid:0000-0003-3746-1225 , Menezes, Miguel and Way, Andy orcid:0000-0001-5736-5930 (2021) DELA Corpus - A Document-Level Corpus Annotated with Context-Related Issues. In: Sixth Conference on Machine Translation (WMT21), 10-11 Nov 2021, Punta Cana, Dominican Republic (Online). ISBN 978-1-954085-94-7 (2021)
|
|
BASE
|
|
Show details
|
|
2 |
Can Google Translate Rewire Your L2 English Processing?
|
|
|
|
In: Resende, Natália orcid:0000-0002-5248-2457 and Way, Andy orcid:0000-0001-5736-5930 (2021) Can Google Translate Rewire Your L2 English Processing? Digital, 1 (1). pp. 66-85. ISSN 2673-6470 (2021)
|
|
BASE
|
|
Show details
|
|
3 |
DELA Corpus - A Document-Level Corpus Annotated with Context-Related Issues ...
|
|
|
|
BASE
|
|
Show details
|
|
6 |
Modelling source- and target-language syntactic Information as conditional context in interactive neural machine translation
|
|
|
|
In: Gupta, Kamal Kumar, Haque, Rejwanul orcid:0000-0003-1680-0099 , Ekbal, Asif, Bhattacharyya, Pushpak and Way, Andy orcid:0000-0001-5736-5930 (2020) Modelling source- and target-language syntactic Information as conditional context in interactive neural machine translation. In: Proceedings of the 22nd Annual Conference of the European Association for Machine Translation, 2-6 Nov 2020, Lisboa, Portugal. (2020)
|
|
BASE
|
|
Show details
|
|
7 |
Selecting Backtranslated Data from Multiple Sources for Improved Neural Machine Translation
|
|
|
|
In: Soto, Xabier orcid:0000-0002-3622-6496 , Shterionov, Dimitar orcid:0000-0001-6300-797X , Poncelas, Alberto orcid:0000-0002-5089-1687 and Way, Andy orcid:0000-0001-5736-5930 (2020) Selecting Backtranslated Data from Multiple Sources for Improved Neural Machine Translation. In: Annual Conference of the Association for Computational Linguistics, ACL, 5-10 July 2020, Seattle, WA, USA (Online). (2020)
|
|
Abstract:
Machine translation (MT) has benefited from using synthetic training data originating from translating monolingual corpora, a technique known as backtranslation. Combining backtranslated data from different sources has led to better results than when using such data in isolation. In this work we analyse the impact that data translated with rule-based, phrase-based statistical and neural MT systems has on new MT systems. We use a real-world low-resource use-case (Basque-to-Spanish in the clinical domain) as well as a high-resource language pair (German-to-English) to test different scenarios with backtranslation and employ data selection to optimise the synthetic corpora. We exploit different data selection strategies in order to reduce the amount of data used, while at the same time maintaining high-quality MT systems. We further tune the data selection method by taking into account the quality of the MT systems used for backtranslation and lexical diversity of the resulting corpora. Our experiments show that incorporating backtranslated data from different sources can be beneficial, and that availing of data selection can yield improved performance.
|
|
Keyword:
Machine translating
|
|
URL: http://doras.dcu.ie/24425/
|
|
BASE
|
|
Hide details
|
|
8 |
MTrill project: machine translation impact on language learning
|
|
|
|
In: Resende, Natália orcid:0000-0002-5248-2457 and Way, Andy orcid:0000-0001-5736-5930 (2020) MTrill project: machine translation impact on language learning. In: European Association for Machine Translation (EAMT) 2020, 3-5 Nov 2020, Lisbon, Portugal (Online). ISBN 978-989-33-0589-8 (2020)
|
|
BASE
|
|
Show details
|
|
9 |
Syntax-informed interactive neural machine translation
|
|
|
|
In: Gupta, Kamal Kumar, Haque, Rejwanul orcid:0000-0003-1680-0099 , Ekbal, Asif, Bhattacharyya, Pushpak and Way, Andy orcid:0000-0001-5736-5930 (2020) Syntax-informed interactive neural machine translation. In: The International Joint Conference on Neural Networks (IJCNN), 19-24 July 2020, Glasgow, UK (Online). (2020)
|
|
BASE
|
|
Show details
|
|
10 |
MT syntactic priming effects on L2 English speakers
|
|
|
|
In: Resende, Natália orcid:0000-0002-5248-2457 , Cowan, Benjamin orcid:0000-0002-8595-8132 and Way, Andy orcid:0000-0001-5736-5930 (2020) MT syntactic priming effects on L2 English speakers. In: European Association for Machine Translation (EAMT) 2020, 2-6 Nov 2010, Lisbon, Portugal (Online). ISBN 978-989-33-0589-8 (2020)
|
|
BASE
|
|
Show details
|
|
11 |
Rapid development of competitive translation engines for access to multilingual COVID-19 information
|
|
|
|
In: Way, Andy orcid:0000-0001-5736-5930 , Haque, Rejwanul orcid:0000-0003-1680-0099 , Xie, Guodong, Gaspari, Federico orcid:0000-0003-3808-8418 , Popović, Maja orcid:0000-0001-8234-8745 and Poncelas, Alberto orcid:0000-0002-5089-1687 (2020) Rapid development of competitive translation engines for access to multilingual COVID-19 information. Informatics . ISSN 2227-9709 (2020)
|
|
BASE
|
|
Show details
|
|
12 |
Machine translation of user-generated content
|
|
Lohar, Pintu. - : Dublin City University. School of Computing, 2020. : Dublin City University. ADAPT, 2020
|
|
In: Lohar, Pintu (2020) Machine translation of user-generated content. PhD thesis, Dublin City University. (2020)
|
|
BASE
|
|
Show details
|
|
13 |
Parallel data extraction using word embeddings
|
|
|
|
In: Lohar, Pintu and Way, Andy orcid:0000-0001-5736-5930 (2020) Parallel data extraction using word embeddings. In: NLPTA 2020 : International Conference on NLP Techniques and Applications, 28-29 Nov 2020, London, UK (Online). (2020)
|
|
BASE
|
|
Show details
|
|
14 |
The ADAPT’s submissions to the WMT20 biomedical translation task
|
|
|
|
In: Nayak, Prashanth, Haque, Rejwanul orcid:0000-0003-1680-0099 and Way, Andy orcid:0000-0001-5736-5930 (2020) The ADAPT’s submissions to the WMT20 biomedical translation task. In: The Fifrth Conference on Machine Translation (The Biomedical Shared Task), 19-20 Nov 2020, Dominican Republic (Online). (In Press) (2020)
|
|
BASE
|
|
Show details
|
|
15 |
Terminology-aware sentence mining for NMT domain adaptation: ADAPT’s submission to the Adap-MT 2020 English-to-Hindi AI translation shared task
|
|
|
|
In: Haque, Rejwanul orcid:0000-0003-1680-0099 , Moslem, Yasmin orcid:0000-0003-4595-6877 and Way, Andy orcid:0000-0001-5736-5930 (2020) Terminology-aware sentence mining for NMT domain adaptation: ADAPT’s submission to the Adap-MT 2020 English-to-Hindi AI translation shared task. In: Workshop on Low Resource Domain Adaptation for Indic Machine Translation (Adap-MT 2020), 18-21 Dec 2020, Patna, India (Online). (2020)
|
|
BASE
|
|
Show details
|
|
16 |
Investigating query expansion and coreference resolution in question answering on BERT
|
|
|
|
In: Bhattacharjee, Santanu, Haque, Rejwanul orcid:0000-0003-1680-0099 , Maillette de Buy Wenniger, Gideon and Way, Andy orcid:0000-0001-5736-5930 (2020) Investigating query expansion and coreference resolution in question answering on BERT. In: 25th International Conference on Natural Language & Information Systems (NLDB 2020)), 24 - 26 June 2020, Saarbrücken, Germany (Online). ISBN 978-3-030-51309-2 (2020)
|
|
BASE
|
|
Show details
|
|
17 |
Identifying complaints from product reviews: a case study on Hindi
|
|
|
|
In: Singh, Raghvendra Pratap, Haque, Rejwanul orcid:0000-0003-1680-0099 , Hasanuzzaman, Mohammed orcid:0000-0003-1838-0091 and Way, Andy orcid:0000-0001-5736-5930 (2020) Identifying complaints from product reviews: a case study on Hindi. In: 28th Irish Conference on Artificial Intelligence and Cognitive Science, 7-8 Dec 2020, Dublin, Ireland. (2020)
|
|
BASE
|
|
Show details
|
|
18 |
The ADAPT Centre’s neural MT systems for the WAT 2020 document-level translation task
|
|
|
|
In: Jooste, Wandri, Haque, Rejwanul orcid:0000-0003-1680-0099 and Way, Andy orcid:0000-0001-5736-5930 (2020) The ADAPT Centre’s neural MT systems for the WAT 2020 document-level translation task. In: 7th Workshop on Asian Translation (WAT2020), 4 Dec 2020, Suzhou, China (Online). (2020)
|
|
BASE
|
|
Show details
|
|
19 |
Investigating low-resource machine translation for English-to-Tamil
|
|
|
|
In: Ramesh, Akshai, Parthasarathy, Venkatesh Balavadhani, Haque, Rejwanul orcid:0000-0003-1680-0099 and Way, Andy orcid:0000-0001-5736-5930 (2020) Investigating low-resource machine translation for English-to-Tamil. In: Proceedings of the 3rd Workshop on Technologies for MT of Low Resource Languages (LoResMT 2020) AACL-IJCNLP, December 4-7, 2020, Suzhou, China (Online). (2020)
|
|
BASE
|
|
Show details
|
|
20 |
The ADAPT system description for the STAPLE 2020 English-to-Portuguese translation task
|
|
|
|
In: Haque, Rejwanul orcid:0000-0003-1680-0099 , Moslem, Yasmin and Way, Andy orcid:0000-0001-5736-5930 (2020) The ADAPT system description for the STAPLE 2020 English-to-Portuguese translation task. In: 4th Workshop on Neural Generation and Translation (WNGT 2020), 10 July 2020, Seattle, WA, USA (Online). (2020)
|
|
BASE
|
|
Show details
|
|
|
|