1 |
Towards Arabic Sentence Simplification via Classification and Generative Approaches ...
|
|
|
|
BASE
|
|
Show details
|
|
2 |
Overview of the Fourth BUCC Shared Task: Bilingual Dictionary Induction from Comparable Corpora
|
|
|
|
In: 13th Workshop on Building and Using Comparable Corpora (BUCC) ; https://hal.archives-ouvertes.fr/hal-03100822 ; 13th Workshop on Building and Using Comparable Corpora (BUCC), May 2020, Marseille, France. pp.6-13 (2020)
|
|
BASE
|
|
Show details
|
|
3 |
Know thy corpus! Robust methods for digital curation of Web corpora ...
|
|
|
|
BASE
|
|
Show details
|
|
4 |
Recognizing semantic relations by combining transformers and fully connected models
|
|
|
|
BASE
|
|
Show details
|
|
5 |
A Multilingual Dataset for Evaluating Parallel Sentence Extraction from Comparable Corpora
|
|
|
|
In: International Conference on Language Resources and Evaluation ; https://hal.archives-ouvertes.fr/hal-01898362 ; International Conference on Language Resources and Evaluation, May 2018, Miyazaki, Japan (2018)
|
|
BASE
|
|
Show details
|
|
8 |
Language Adaptation for Extending Post-Editing Estimates for Closely Related Languages
|
|
|
|
In: Prague Bulletin of Mathematical Linguistics , Vol 106, Iss 1, Pp 181-192 (2016) (2016)
|
|
Abstract:
This paper presents an open-source toolkit for predicting human post-editing efforts for closely related languages. At the moment, training resources for the Quality Estimation task are available for very few language directions and domains. Available resources can be expanded on the assumption that MT errors and the amount of post-editing required to correct them are comparable across related languages, even if the feature frequencies differ. In this paper we report a toolkit for achieving language adaptation, which is based on learning new feature representation using transfer learning methods. In particular, we report performance of a method based on Self-Taught Learning which adapts the English-Spanish pair to produce Quality Estimation models for translation from English into Portuguese, Italian and other Romance languages using the publicly available Autodesk dataset.
|
|
Keyword:
Computational linguistics. Natural language processing; P98-98.5
|
|
URL: https://doi.org/10.1515/pralin-2016-0017 https://doaj.org/article/929b4b847e784ff58e42fa7bb30fa2df
|
|
BASE
|
|
Hide details
|
|
17 |
Terminology Extraction, Translation Tools and Comparable Corpora: TTC concept, midterm progress and achieved results
|
|
|
|
In: LREC 2012 Workshop on Creating Cross-language Resources for Disconnected Languages and Styles (CREDISLAS) ; https://hal.archives-ouvertes.fr/hal-00819909 ; LREC 2012 Workshop on Creating Cross-language Resources for Disconnected Languages and Styles (CREDISLAS), May 2012, Istanbul, Turkey. 4 p (2012)
|
|
BASE
|
|
Show details
|
|
19 |
User-centred Views on Terminology Extraction Tools: Usage Scenarios and Integration into MT and CAT Tools.
|
|
|
|
In: Actes du colloque Tralogy : Anticiper les technologies pour la traduction ; Tralogy I. Métiers et technologies de la traduction : quelles convergences pour l'avenir ? ; https://hal.archives-ouvertes.fr/hal-00818657 ; Tralogy I. Métiers et technologies de la traduction : quelles convergences pour l'avenir ?, Mar 2011, Paris, France. 10 p (2011)
|
|
BASE
|
|
Show details
|
|
|
|