1 |
How much context span is enough? Examining context-related issues for document-level MT
|
|
|
|
In: Castilho, Sheila orcid:0000-0002-8416-6555 (2022) How much context span is enough? Examining context-related issues for document-level MT. In: 13th Language Resources and Evaluation Conference, 21-23 June 2022, Marseille, France. (In Press) (2022)
|
|
BASE
|
|
Show details
|
|
2 |
DELA Corpus - A Document-Level Corpus Annotated with Context-Related Issues
|
|
|
|
In: Castilho, Sheila orcid:0000-0002-8416-6555 , Cavalheiro Camargo, João Lucas orcid:0000-0003-3746-1225 , Menezes, Miguel and Way, Andy orcid:0000-0001-5736-5930 (2021) DELA Corpus - A Document-Level Corpus Annotated with Context-Related Issues. In: Sixth Conference on Machine Translation (WMT21), 10-11 Nov 2021, Punta Cana, Dominican Republic (Online). ISBN 978-1-954085-94-7 (2021)
|
|
Abstract:
Recently, the Machine Translation (MT) community has become more interested in document-level evaluation especially in light of reactions to claims of "human parity", since examining the quality at the level of the document rather than at the sentence level allows for the assessment of suprasentential context, providing a more reliable evaluation. This paper presents a document-level corpus annotated in English with context-aware issues that arise when translating from English into Brazilian Portuguese, namely ellipsis, gender, lexical ambiguity, number, reference, and terminology, with six different domains. The corpus can be used as a challenge test set for evaluation and as a training/testing corpus for MT as well as for deep linguistic analysis of context issues. To the best of our knowledge, this is the first corpus of its kind.
|
|
Keyword:
annotation; Computational linguistics; corpus; document-level MT; Language; Machine translating; machine translation evaluation; Translating and interpreting
|
|
URL: http://doras.dcu.ie/26256/
|
|
BASE
|
|
Hide details
|
|
3 |
Towards document-level human MT evaluation: On the Issues of annotator agreement, effort and misevaluation
|
|
|
|
In: Castilho, Sheila orcid:0000-0002-8416-6555 (2021) Towards document-level human MT evaluation: On the Issues of annotator agreement, effort and misevaluation. In: 16th Conference of the European Chapter of the Association for Computational Linguistics - EACL 2021., 19-23 April 2021, Online. (In Press) (2021)
|
|
BASE
|
|
Show details
|
|
4 |
Contextualization of Web contents through semantic enrichment from linked open data ; Contextualisation des contenus Web par l'enrichissement sémantique à partir de données
|
|
|
|
In: https://tel.archives-ouvertes.fr/tel-03561788 ; Databases [cs.DB]. Normandie Université, 2021. English. ⟨NNT : 2021NORMC243⟩ (2021)
|
|
BASE
|
|
Show details
|
|
5 |
A Novel Deep Learning ArCAR System for Arabic Text Recognition with Character-Level Representation
|
|
|
|
In: Computer Sciences & Mathematics Forum; Volume 2; Issue 1; Pages: 14 (2021)
|
|
BASE
|
|
Show details
|
|
6 |
What cultural aspects should be taught in FL lessons? – A model for evaluating the cultural content in FL course-books
|
|
|
|
In: Acta Scientiarum. Language and Culture; Vol 43 No 2 (2021): July-Dec.; e52066 ; Acta Scientiarum. Language and Culture; v. 43 n. 2 (2021): July-Dec.; e52066 ; 1983-4683 ; 1983-4675 (2021)
|
|
BASE
|
|
Show details
|
|
7 |
Redonner du sens à l’accord interannotateurs : vers une interprétation des mesures d’accord en termes de reproductibilité de l’annotation
|
|
|
|
In: ISSN: 1248-9433 ; EISSN: 1965-0906 ; Revue TAL ; https://hal.archives-ouvertes.fr/hal-02375240 ; Revue TAL, ATALA (Association pour le Traitement Automatique des Langues), 2019, 60 (2), pp.23 (2019)
|
|
BASE
|
|
Show details
|
|
8 |
Εntity-level Εvent Ιmpact Αnalytics ; Analyse de l’Impact des Événements au Niveau des Entités
|
|
|
|
In: https://hal.archives-ouvertes.fr/tel-02102795 ; Document and Text Processing. Normandie Université, Unicaen, EnsiCaen, CNRS, GREYC UMR 6072, 2019. English (2019)
|
|
BASE
|
|
Show details
|
|
9 |
A CNN-BiLSTM Model for Document-Level Sentiment Analysis
|
|
|
|
In: Machine Learning and Knowledge Extraction ; Volume 1 ; Issue 3 ; Pages 48-847 (2019)
|
|
BASE
|
|
Show details
|
|
10 |
ЛЕКСИЧЕСКАЯ КАТЕГОРИЯ «ДОКУМЕНТ» В КОГНИТИВНОМ АСПЕКТЕ
|
|
СЛАУТИНА МАРИНА ВАСИЛЬЕВНА. - : Федеральное государственное бюджетное образовательное учреждение высшего профессионального образования «Волгоградский государственный университет», 2016
|
|
BASE
|
|
Show details
|
|
11 |
Semantic Hierarchical Document Signature For Determining Sentence Similarity
|
|
|
|
In: Proceedings of the 19th international conference on Fuzzy Systems (2015)
|
|
BASE
|
|
Show details
|
|
12 |
DAnIEL, parsimonious yet high-coverage multilingual epidemic surveillance ; DAnIEL : Veille épidémiologique multilingue parcimonieuse
|
|
|
|
In: 20ème conférence du Traitement Automatique du Langage Naturel 2013 (TALN 2013) ; https://hal.archives-ouvertes.fr/hal-01074881 ; 20ème conférence du Traitement Automatique du Langage Naturel 2013 (TALN 2013), Jun 2013, Sables d'Olonne, France. p.787-788 (2013)
|
|
BASE
|
|
Show details
|
|
13 |
Multi-document summarization via sentence-level semantic analysis and symmetric matrix factorization
|
|
|
|
In: http://users.cis.fiu.edu/%7Etaoli/pub/sigir08-p307-wang.pdf (2008)
|
|
BASE
|
|
Show details
|
|
15 |
Script-Independent Text Line Segmentation in Freestyle Handwritten Documents
|
|
|
|
In: DTIC (2006)
|
|
BASE
|
|
Show details
|
|
16 |
Towards a Quantitative Theory of Variability ; Towards a Quantitative Theory of Variability: Language, brain and computation
|
|
|
|
In: UG and External Systems ; https://hal.archives-ouvertes.fr/hal-00134205 ; Ana-Maria Di Sciullo. UG and External Systems, John Benjamins, pp.375-388, 2005 (2005)
|
|
BASE
|
|
Show details
|
|
17 |
Theater Strategy and the Theater Campaign Plan: Both Are Essential
|
|
|
|
In: DTIC (1988)
|
|
BASE
|
|
Show details
|
|
18 |
Multi-document summarization via sentence-level semantic analysis and symmetric matrix factorization
|
|
|
|
In: http://users.cs.fiu.edu/~taoli/tenure/fp557-Wang.pdf
|
|
BASE
|
|
Show details
|
|
19 |
Handwritten Text Image Compression for Indic Script
|
|
|
|
In: http://research.ijcaonline.org/volume47/number5/pxc3879888.pdf
|
|
BASE
|
|
Show details
|
|
|
|