1 |
Revisiting Multi-Domain Machine Translation
|
|
|
|
In: EISSN: 2307-387X ; Transactions of the Association for Computational Linguistics ; https://hal.archives-ouvertes.fr/hal-03159743 ; Transactions of the Association for Computational Linguistics, The MIT Press, 2021, 9, pp.17-35 (2021)
|
|
BASE
|
|
Show details
|
|
2 |
MEDLINE as a parallel corpus: a survey to gain insight on French-, Spanish- and Portuguese-speaking authors' abstract writing practice
|
|
|
|
In: International Conference on Language Resources and Evaluation ; https://hal.archives-ouvertes.fr/hal-03023950 ; International Conference on Language Resources and Evaluation, ELRA, May 2020, Marseille, France (2020)
|
|
BASE
|
|
Show details
|
|
3 |
What is best for Spoken Language Understanding: Small but Task-dependant Embeddings or Huge but Out-of-domain Embeddings?
|
|
|
|
In: IEEE International Conference on Acoustics, Speech, and Signal Processing ; https://hal.archives-ouvertes.fr/hal-02503694 ; IEEE International Conference on Acoustics, Speech, and Signal Processing, May 2020, Barcelone, Spain (2020)
|
|
BASE
|
|
Show details
|
|
4 |
Generative latent neural models for automatic word alignment
|
|
|
|
In: Association for Machine Translation in the Americas ; https://hal.archives-ouvertes.fr/hal-02949042 ; Association for Machine Translation in the Americas, Oct 2020, Miami, Florida, United States. pp.64-77 ; https://amtaweb.org/amta-2020-conference-program-and-registration-are-available/ (2020)
|
|
BASE
|
|
Show details
|
|
5 |
Experimenting the Automatic Recognition of Non-Conventionalized Units in Sign Language
|
|
|
|
In: ISSN: 1999-4893 ; Algorithms ; https://hal.archives-ouvertes.fr/hal-03060271 ; Algorithms, MDPI, 2020, 13, pp.310-345. ⟨10.3390/a13120310⟩ (2020)
|
|
BASE
|
|
Show details
|
|
6 |
Towards Continuous Recognition of Illustrative and Spatial Structures in Sign Language
|
|
|
|
In: ECCV Sign Language Recognition, Translation and Production Workshop ; https://hal.archives-ouvertes.fr/hal-03060270 ; ECCV Sign Language Recognition, Translation and Production Workshop, Springer, Aug 2020, Glasgow, United Kingdom (2020)
|
|
BASE
|
|
Show details
|
|
7 |
SimAlign: High Quality Word Alignments Without Parallel Training Data Using Static and Contextualized Embeddings
|
|
|
|
In: EMNLP 2020 ; https://hal.archives-ouvertes.fr/hal-03013194 ; EMNLP 2020, Association for Computational Linguistics, Nov 2020, Online, United States. pp.1627 - 1643 (2020)
|
|
Abstract:
International audience ; Word alignments are useful for tasks like statistical and neural machine translation (NMT) and cross-lingual annotation projection. Statistical word aligners perform well, as do methods that extract alignments jointly with translations in NMT. However, most approaches require parallel training data and quality decreases as less training data is available. We propose word alignment methods that require no parallel data. The key idea is to leverage multilingual word embeddings {--} both static and contextualized {--} for word alignment. Our multilingual embeddings are created from monolingual data only without relying on any parallel data or dictionaries. We find that alignments created from embeddings are superior for four and comparable for two language pairs compared to those produced by traditional statistical aligners {--} even with abundant parallel data; e.g., contextualized embeddings achieve a word alignment F1 for English-German that is 5 percentage points higher than eflomal, a high-quality statistical aligner, trained on 100k parallel sentences.
|
|
Keyword:
[INFO.INFO-CL]Computer Science [cs]/Computation and Language [cs.CL]; [INFO]Computer Science [cs]; contextualized embeddings; machine translation; word alignement
|
|
URL: https://hal.archives-ouvertes.fr/hal-03013194/file/2020.findings-emnlp.147.pdf https://hal.archives-ouvertes.fr/hal-03013194/document https://hal.archives-ouvertes.fr/hal-03013194
|
|
BASE
|
|
Hide details
|
|
8 |
Vocapia-LIMSI System for 2020 Shared Task on Code-switched Spoken Language Identification
|
|
|
|
In: The First Workshop on Speech Technologies for Code-Switching in Multilingual Communities ; https://hal.archives-ouvertes.fr/hal-03091792 ; The First Workshop on Speech Technologies for Code-Switching in Multilingual Communities, Oct 2020, Shanghai, China (2020)
|
|
BASE
|
|
Show details
|
|
9 |
TL-Explorer: A Digital Humanities Tool for Mapping and Analyzing Translated Literature
|
|
|
|
In: Proceedings of the The 4th Joint SIGHUM Workshop on Computational Linguistics for Cultural Heritage, Social Sciences, Humanities and Literature ; https://hal.archives-ouvertes.fr/hal-03090881 ; Proceedings of the The 4th Joint SIGHUM Workshop on Computational Linguistics for Cultural Heritage, Social Sciences, Humanities and Literature, Dec 2020, Barcelona, Spain ; https://www.aclweb.org/anthology/2020.latechclfl-1.20/ (2020)
|
|
BASE
|
|
Show details
|
|
10 |
Boosting Neural Machine Translation with Similar Translations
|
|
|
|
In: Annual Meeting of the Association for Computational Linguistics ; https://hal.archives-ouvertes.fr/hal-02956324 ; Annual Meeting of the Association for Computational Linguistics, Jul 2020, Seattle, United States. pp.1570-1579, ⟨10.18653/v1/2020.acl-main.143⟩ (2020)
|
|
BASE
|
|
Show details
|
|
11 |
MEDIAPI-SKEL -A 2D-Skeleton Video Database of French Sign Language With Aligned French Subtitles
|
|
|
|
In: Proceedings of the 12th Conference on Language Resources and Evaluation (LREC 2020) ; Proceedings of the 12th Language Resources and Evaluation Conference ; 12th Conference on Language Resources and Evaluation (LREC 2020) ; https://hal.archives-ouvertes.fr/hal-02952340 ; 12th Conference on Language Resources and Evaluation (LREC 2020), Jun 2020, Marseille, France. pp.6063-6068 (2020)
|
|
BASE
|
|
Show details
|
|
12 |
The Synthesis of Complex Shape Deployments in Sign Language
|
|
|
|
In: Proceedings of the 9th workshop on the Representation and Processing of Sign Languages ; https://hal.archives-ouvertes.fr/hal-02924673 ; Proceedings of the 9th workshop on the Representation and Processing of Sign Languages, May 2020, Marseille, France (2020)
|
|
BASE
|
|
Show details
|
|
13 |
Use cases for a Sign Language Concordancer
|
|
|
|
In: Proceedings of the 9th workshop on the Representation and Processing of Sign Languages ; https://hal.archives-ouvertes.fr/hal-02944491 ; Proceedings of the 9th workshop on the Representation and Processing of Sign Languages, May 2020, Marseille, France (2020)
|
|
BASE
|
|
Show details
|
|
14 |
Elicitation and corpus of spontaneous Sign Language discourse representation diagrams
|
|
|
|
In: Proceedings of the 9th workshop on the Representation and Processing of Sign Languages ; https://hal.archives-ouvertes.fr/hal-02924676 ; Proceedings of the 9th workshop on the Representation and Processing of Sign Languages, May 2020, Marseille, France (2020)
|
|
BASE
|
|
Show details
|
|
15 |
Person Identification Based on Sign Language Motion: Insights from Human Perception and Computational Modeling
|
|
|
|
In: International Conference on Movement and Computing ; https://hal.archives-ouvertes.fr/hal-03078733 ; International Conference on Movement and Computing, ACM, Jul 2020, Jersey City / Virtual, United States. ⟨10.1145/3401956.3404187⟩ (2020)
|
|
BASE
|
|
Show details
|
|
16 |
RNN embeddings for identifying difficult to understand medical words
|
|
|
|
In: ACL Workshop on Biomedical Natural Language Processing ; https://hal.archives-ouvertes.fr/hal-02371219 ; ACL Workshop on Biomedical Natural Language Processing, Aug 2019, Florence, Italy (2019)
|
|
BASE
|
|
Show details
|
|
17 |
The NLP4NLP Corpus (II): 50 Years of Research in Speech and Language Processing
|
|
|
|
In: ISSN: 2504-0537 ; EISSN: 2504-0537 ; Frontiers in Research Metrics and Analytics ; https://hal.archives-ouvertes.fr/hal-02413749 ; Frontiers in Research Metrics and Analytics, Frontiers Media, 2019, 3, pp.1-30 (2019)
|
|
BASE
|
|
Show details
|
|
18 |
Challenges in Audio Processing of Terrorist-Related Data
|
|
|
|
In: International Conference on Multimedia Modeling ; https://hal.archives-ouvertes.fr/hal-02415176 ; International Conference on Multimedia Modeling, Springer, Jan 2019, Thessaloniki, Greece (2019)
|
|
BASE
|
|
Show details
|
|
19 |
French Levothyrox® Crisis: Retrospective Analysis of Social Media
|
|
|
|
In: International Society of Pharmacovigilance ; https://hal.archives-ouvertes.fr/hal-02411632 ; International Society of Pharmacovigilance, Springer International Publishing, Oct 2019, Bogota, Colombia (2019)
|
|
BASE
|
|
Show details
|
|
20 |
The NLP4NLP Corpus (I): 50 Years of Publication, Collaboration and Citation in Speech and Language Processing
|
|
|
|
In: ISSN: 2504-0537 ; EISSN: 2504-0537 ; Frontiers in Research Metrics and Analytics ; https://hal.archives-ouvertes.fr/hal-02413751 ; Frontiers in Research Metrics and Analytics, Frontiers Media, 2019, 3, pp.1-30 (2019)
|
|
BASE
|
|
Show details
|
|
|
|