1 |
Books of Hours: the First Liturgical Corpus for Text Segmentation
|
|
|
|
In: Proceedings of the 12th Conference on Language Resources and Evaluation (LREC 2020) ; 12th Language Resources and Evaluation Conference ; https://hal.archives-ouvertes.fr/hal-02931294 ; 12th Language Resources and Evaluation Conference, May 2020, Marseille (Virtual), France. pp.776-784 (2020)
|
|
BASE
|
|
Show details
|
|
2 |
Hierarchical Text Segmentation for Medieval Manuscripts
|
|
|
|
In: COLING'2020 The 28th International Conference on Computational Linguistics ; https://hal.archives-ouvertes.fr/hal-03100170 ; COLING'2020 The 28th International Conference on Computational Linguistics, Dec 2020, Barcelona, Spain. pp.6240-6251 ; https://www.aclweb.org/anthology/2020.coling-main.549.pdf (2020)
|
|
BASE
|
|
Show details
|
|
4 |
Towards Automatic Variant Analysis of Ancient Devotional Texts
|
|
|
|
In: 1st International Workshop on Computational Approaches to Historical Language Change ; http://hal.univ-nantes.fr/hal-02401711 ; 1st International Workshop on Computational Approaches to Historical Language Change, co-located with ACL 2019, Aug 2019, Florence, Italy. pp.240-249, ⟨10.18653/v1/W19-4730⟩ (2019)
|
|
BASE
|
|
Show details
|
|
5 |
Leveraging Meta-Embeddings for Bilingual Lexicon Extraction from Specialized Comparable Corpora
|
|
|
|
In: 27th International Conference on Computational Linguistics (COLING) ; https://hal.archives-ouvertes.fr/hal-02181815 ; 27th International Conference on Computational Linguistics (COLING), Aug 2018, Santa Fe, United States. pp.937-949 (2018)
|
|
BASE
|
|
Show details
|
|
6 |
Bilingual Word Embeddings for Bilingual Terminology Extraction from Specialized Comparable Corpora
|
|
|
|
In: 8th International Joint Conference on Natural Language Processing (IJCNLP) ; https://hal.archives-ouvertes.fr/hal-01757418 ; 8th International Joint Conference on Natural Language Processing (IJCNLP), Nov 2017, Taipei, Taiwan (2017)
|
|
BASE
|
|
Show details
|
|
7 |
Improving Bilingual Terminology Extraction from Comparable Corpora via Multiple Word-Space Models
|
|
|
|
In: 10th International Conference on Language Resources and Evaluation (LREC) ; https://hal.archives-ouvertes.fr/hal-02434073 ; 10th International Conference on Language Resources and Evaluation (LREC), May 2016, Portorož, Slovenia (2016)
|
|
BASE
|
|
Show details
|
|
8 |
Efficient Data Selection for Bilingual Terminology Extraction from Comparable Corpora
|
|
|
|
In: 26th International Conference on Computational Linguistics (COLING) ; https://hal.archives-ouvertes.fr/hal-02001789 ; 26th International Conference on Computational Linguistics (COLING), Dec 2016, Osaka, Japan. pp.3401-3411 ; https://www.aclweb.org/anthology/C16-1321 (2016)
|
|
BASE
|
|
Show details
|
|
9 |
Exploiting Unbalanced Specialized Comparable Corpora for Bilingual Lexicon Extraction
|
|
|
|
In: ISSN: 1351-3249 ; EISSN: 1469-8110 ; Natural Language Engineering ; https://hal.archives-ouvertes.fr/hal-01188579 ; Natural Language Engineering, Cambridge University Press (CUP), 2016, Special Issue: Machine Translation Using Comparable Corpora, pp.27 (2016)
|
|
BASE
|
|
Show details
|
|
10 |
Indexation d'articles scientifiques Présentation et résultats du défi fouille de textes DEFT 2016
|
|
|
|
In: Atelier DEFT 2016 ; https://hal.archives-ouvertes.fr/hal-01693785 ; Atelier DEFT 2016, Jul 2016, Paris, France (2016)
|
|
BASE
|
|
Show details
|
|
11 |
Continuous Adaptation to User Feedback for Statistical Machine Translation
|
|
|
|
In: North American Chapter of the Association for Computational Linguistics – Human Language Technologies (NAACL HLT 2015) ; https://hal.archives-ouvertes.fr/hal-01454944 ; North American Chapter of the Association for Computational Linguistics – Human Language Technologies (NAACL HLT 2015), 2015, Denver (Colorado, USA), Unknown Region (2015)
|
|
BASE
|
|
Show details
|
|
12 |
Continuous adaptation to user feedback for statistical machine translation
|
|
|
|
In: 1001 ; 1005 (2015)
|
|
BASE
|
|
Show details
|
|
13 |
Extraction de lexiques bilingues à partir de corpus comparables spécialisés : étude du contexte lexical
|
|
|
|
In: ISSN: 1248-9433 ; EISSN: 1965-0906 ; Revue TAL ; https://hal.archives-ouvertes.fr/hal-01097566 ; Revue TAL, ATALA (Association pour le Traitement Automatique des Langues), 2014, Varia, 55 (1), pp.13-44 (2014)
|
|
BASE
|
|
Show details
|
|
14 |
Looking at Unbalanced Specialized Comparable Corpora for Bilingual Lexicon Extraction
|
|
|
|
In: Proceedings of the 52nd Annual Meeting of the Association for Computational Linguistics (ACL) ; https://hal.archives-ouvertes.fr/hal-01097565 ; Proceedings of the 52nd Annual Meeting of the Association for Computational Linguistics (ACL), Jun 2014, Baltimore, United States. pp.1284-1293 (2014)
|
|
BASE
|
|
Show details
|
|
15 |
Bilingual lexicon extraction from comparable corpora ; Extraction de lexiques bilingues à partir de corpus comparables
|
|
|
|
In: https://tel.archives-ouvertes.fr/tel-00946914 ; Traitement du texte et du document. Université de Nantes, 2013. Français (2013)
|
|
BASE
|
|
Show details
|
|
16 |
Word Co-occurrence Counts Prediction for Bilingual Terminology Extraction from Comparable Corpora.
|
|
|
|
In: 6th International Joint Conference on Natural Language Processing. IJCNLP 2013. ; https://hal.archives-ouvertes.fr/hal-00949198 ; 6th International Joint Conference on Natural Language Processing. IJCNLP 2013., Oct 2013, Nagoya, Japan. pp.12 (2013)
|
|
BASE
|
|
Show details
|
|
18 |
QAlign: A New Method for Bilingual Lexicon Extraction from Comparable Corpora.
|
|
|
|
In: the 13th Conference on Intelligent Text Processing and Computational Linguistics. CICLing 2012. ; https://hal.archives-ouvertes.fr/hal-00949335 ; the 13th Conference on Intelligent Text Processing and Computational Linguistics. CICLing 2012., Mar 2012, New Delhi, India. pp.12 (2012)
|
|
BASE
|
|
Show details
|
|
19 |
Adaptive Dictionary for Bilingual Lexicon Extraction from Comparable Corpora
|
|
|
|
In: 12th International Conference on Language Resources and Evaluation. LREC 2012 (Short paper) ; https://hal.archives-ouvertes.fr/hal-00949215 ; 12th International Conference on Language Resources and Evaluation. LREC 2012 (Short paper), May 2012, Istanbul, Turkey. pp.6 (2012)
|
|
Abstract:
International audience ; One of the main resources used for the task of bilingual lexicon extraction from comparable corpora is : the bilingual dictionary, which is considered as a bridge between two languages. However, no particular attention has been given to this lexicon, except its coverage, and the fact that it can be issued from the general language, the specialized one, or a mix of both. In this paper, we want to highlight the idea that a better consideration of the bilingual dictionary by studying its entries and filtering the non-useful ones, leads to a better lexicon extraction and thus, reach a higher precision. The experiments are conducted on a medical domain corpora. The French-English specialized corpus 'breast cancer' of 1 million words. We show that the empirical results obtained with our filtering process improve the standard approach traditionally dedicated to this task and are promising for future work.
|
|
Keyword:
[INFO.INFO-TT]Computer Science [cs]/Document and Text Processing; Bilingual lexicon extraction; Comparable corpora; Words filtering
|
|
URL: https://hal.archives-ouvertes.fr/hal-00949215
|
|
BASE
|
|
Hide details
|
|
20 |
Métarecherche pour l'extraction lexicale bilingue à partir de corpus comparables
|
|
|
|
In: 18e Conférence sur le Traitement Automatique des Langues Naturelles (TALN) ; https://hal.archives-ouvertes.fr/hal-00608210 ; 18e Conférence sur le Traitement Automatique des Langues Naturelles (TALN), Jun 2011, Montpellier, France. pp.283-293 (2011)
|
|
BASE
|
|
Show details
|
|
|
|