21 |
Unsupervised acquisition of morphological resources for Ukrainian
|
|
|
|
In: Computational Linguistics and Intelligent Systems ; https://hal.archives-ouvertes.fr/hal-01736400 ; Computational Linguistics and Intelligent Systems, Apr 2017, Kharkiv, Ukraine (2017)
|
|
BASE
|
|
Show details
|
|
22 |
Understanding of unknown medical words
|
|
|
|
In: Biomedical NLP Workshop associated with RANLP 2017 ; https://hal.archives-ouvertes.fr/hal-01736408 ; Biomedical NLP Workshop associated with RANLP 2017, Sep 2017, Varna, Bulgaria (2017)
|
|
BASE
|
|
Show details
|
|
23 |
Information Extraction for the Seed Development Regulatory Networks of Arabidopsis Thaliana. ; Extraction d’Information pour les réseaux de régulation de la graine chez Arabidopsis Thaliana.
|
|
|
|
In: https://tel.archives-ouvertes.fr/tel-01613508 ; Computation and Language [cs.CL]. Université Paris Saclay (COmUE), 2017. English. ⟨NNT : 2017SACLS027⟩ (2017)
|
|
BASE
|
|
Show details
|
|
24 |
CLEF eHealth 2017 Multilingual Information Extraction task Overview: ICD10 Coding of Death Certificates in English and French.
|
|
|
|
In: Workshop of the Cross-Language Evaluation Forum ; https://hal.archives-ouvertes.fr/hal-01665374 ; Workshop of the Cross-Language Evaluation Forum, CEUR-WS, Jan 2017, Dublin, Ireland (2017)
|
|
BASE
|
|
Show details
|
|
25 |
Representation of complex terms in a vector space structured by an ontology for a normalization task
|
|
|
|
In: BioNLP 2017 ; BioNLP 2017 Workshop, Association for Computational Linguistics ; https://hal.archives-ouvertes.fr/hal-01582292 ; BioNLP 2017 Workshop, Association for Computational Linguistics, Aug 2017, Vancouver, Canada. ⟨10.18653/v1/W17-2312⟩ ; http://aclweb.org/anthology/W17-2312 (2017)
|
|
BASE
|
|
Show details
|
|
26 |
A Study of Reuse and Plagiarism in Speech and Natural Language Processing papers
|
|
|
|
In: BIRNDL 2016 ; https://hal.archives-ouvertes.fr/hal-01840817 ; BIRNDL 2016, Jan 2016, Newark, United States (2016)
|
|
BASE
|
|
Show details
|
|
27 |
INEX Tweet Contextualization Task: Evaluation, Results and Lesson Learned
|
|
|
|
In: ISSN: 0306-4573 ; Information Processing and Management ; https://hal-amu.archives-ouvertes.fr/hal-01479297 ; Information Processing and Management, Elsevier, 2016, 52 (5), pp.801-819. ⟨10.1016/j.ipm.2016.03.002⟩ (2016)
|
|
BASE
|
|
Show details
|
|
28 |
Generating and executing complex natural language queries across linked data
|
|
|
|
In: International Congress on Medical Informatics ; https://hal.archives-ouvertes.fr/hal-01971222 ; International Congress on Medical Informatics, Jan 2015, Sao Paulo, Brazil (2015)
|
|
BASE
|
|
Show details
|
|
29 |
A Unified Kernel Approach For Learning Typed Sentence Rewritings
|
|
|
|
In: Proceedings of the 53rd Annual Meeting of the Association for Computational Linguistics and the 7th International Joint Conference on Natural Language Processing, ; Annual Meeting of the Association for Computational Linguistics ; https://hal.archives-ouvertes.fr/hal-02281919 ; Annual Meeting of the Association for Computational Linguistics, The Association for Computer Linguistics, Jan 2015, Beijing, China. pp.939 - 949, ⟨10.3115/v1/P15-1091⟩ ; https://www.aclweb.org/anthology/P15-1091 (2015)
|
|
BASE
|
|
Show details
|
|
30 |
Generative event schema induction with entity disambiguation
|
|
|
|
In: Proceedings of the 53rd Annual Meeting of the Association for Computational Linguistics and the 7th International Joint Conference on Natural Language Processing ; https://hal-cea.archives-ouvertes.fr/cea-01844047 ; Proceedings of the 53rd Annual Meeting of the Association for Computational Linguistics and the 7th International Joint Conference on Natural Language Processing, Jul 2015, Beijing, China. pp.188-197 (2015)
|
|
BASE
|
|
Show details
|
|
31 |
Reformatting clinical records based on global layout statistics
|
|
|
|
In: International Symposium on Semantic Mining in Biomedicine ; https://hal.archives-ouvertes.fr/hal-01831245 ; International Symposium on Semantic Mining in Biomedicine, Jan 2014, Aveiro, Portugal (2014)
|
|
BASE
|
|
Show details
|
|
32 |
Data-driven Synset Induction and Disambiguation for Wordnet Development
|
|
|
|
In: ISSN: 1574-020X ; EISSN: 1574-0218 ; Language Resources and Evaluation ; https://hal.inria.fr/hal-01088000 ; Language Resources and Evaluation, Springer Verlag, 2014, 48 (4), pp.655-677. ⟨10.1007/s10579-014-9291-2⟩ (2014)
|
|
Abstract:
International audience ; Automatic methods for wordnet development in languages other than English generally exploit information found in Princeton WordNet (PWN) and translations extracted from parallel corpora. A common approach consists in preserving the structure of PWN and transferring its content in new languages using alignments, possibly combined with information extracted from multilingual semantic resources. Even if the role of PWN remains central in this process, these automatic methods offer an alternative to the manual elaboration of new wordnets. However, their limited coverage has a strong impact on that of the resulting resources. Following this line of research, we apply a cross-lingual word sense disambiguation method to wordnet development. Our approach exploits the output of a data-driven sense induction method that generates sense clusters in new languages, similar to wordnet synsets, by identifying word senses and relations in parallel corpora. We apply our cross-lingual word sense disambiguation method to the task of enriching a French wordnet resource, the WOLF, and show how it can be efficiently used for increasing its coverage. Although our experiments involve the English-French language pair, the proposed methodology is general enough to be applied to the development of wordnet resources in other languages for which parallel corpora are available. Finally, we show how the disambiguation output can serve to reduce the granularity of new wordnets and the degree of polysemy present in PWN.
|
|
Keyword:
[INFO.INFO-CL]Computer Science [cs]/Computation and Language [cs.CL]; ACM: I.: Computing Methodologies/I.2: ARTIFICIAL INTELLIGENCE/I.2.7: Natural Language Processing; cross-lingual word sense disambiguation; parallel corpora; sense clustering; word sense induction; WordNet
|
|
URL: https://hal.inria.fr/hal-01088000/file/LRE_Apidianaki_Sagot_camera_ready.pdf https://hal.inria.fr/hal-01088000 https://hal.inria.fr/hal-01088000/document https://doi.org/10.1007/s10579-014-9291-2
|
|
BASE
|
|
Hide details
|
|
33 |
Genetic algorithm-based tuning of the C-Value for term ranking
|
|
|
|
In: International Conference on Stochastic Modeling Techniques and Data Analysis ; https://hal.archives-ouvertes.fr/hal-01972757 ; International Conference on Stochastic Modeling Techniques and Data Analysis, Jan 2014, Lisbonne, Portugal (2014)
|
|
BASE
|
|
Show details
|
|
34 |
Disfluency analysis and automatic detection in conversational spontaneous speech ; Analyse et détection automatique de disfluences dans la parole spontanée conversationnelle
|
|
|
|
In: https://tel.archives-ouvertes.fr/tel-01164385 ; Informatique et langage [cs.CL]. Université Paris Sud - Paris XI, 2014. Français. ⟨NNT : 2014PA112415⟩ (2014)
|
|
BASE
|
|
Show details
|
|
35 |
Overview of INEX Tweet Contextualization 2014 track
|
|
|
|
In: Proceedings of Conference and Labs of the Evaluation Forum ; Conference on Multilingual and Multimodal Information Access Evaluation (CLEF) - 2014 ; https://hal.archives-ouvertes.fr/hal-01138069 ; Conference on Multilingual and Multimodal Information Access Evaluation (CLEF) - 2014, Sep 2014, Sheffield - UK, United Kingdom. pp. 1-6 (2014)
|
|
BASE
|
|
Show details
|
|
36 |
Tuning HeidelTime for identifying time expressions in clinical texts in English and French
|
|
|
|
In: International Workshop on Health Text Mining and Information Analysis ; https://hal.archives-ouvertes.fr/hal-01972761 ; International Workshop on Health Text Mining and Information Analysis, Jan 2014, Gothenburg, Sweden (2014)
|
|
BASE
|
|
Show details
|
|
37 |
Reducing VSM data sparseness by generalizing contexts: application to health text mining
|
|
|
|
In: International Workshop on Health Text Mining and Information Analysis ; https://hal.archives-ouvertes.fr/hal-01972762 ; International Workshop on Health Text Mining and Information Analysis, Jan 2014, Gothenburg, Sweden (2014)
|
|
BASE
|
|
Show details
|
|
38 |
Traitement automatique des entités nommées en arabe : détection et traduction
|
|
|
|
In: ISSN: 1248-9433 ; EISSN: 1965-0906 ; Revue TAL ; https://hal.archives-ouvertes.fr/hal-01663487 ; Revue TAL, ATALA (Association pour le Traitement Automatique des Langues), 2014, 54, pp.101-132 (2014)
|
|
BASE
|
|
Show details
|
|
39 |
Event role extraction using domain-relevant word representations
|
|
|
|
In: Proceedings of the 2014 Conference on Empirical Methods in Natural Language Processing (EMNLP) ; https://hal-cea.archives-ouvertes.fr/cea-01844443 ; Proceedings of the 2014 Conference on Empirical Methods in Natural Language Processing (EMNLP), Oct 2014, Doha, Qatar. pp.1852-1857 (2014)
|
|
BASE
|
|
Show details
|
|
40 |
Building specialized bilingual lexicons using large-scale background knowledge
|
|
|
|
In: 2013 Conference on Empirical Methods in Natural Language Processing, EMNLP 2013 ; https://hal-cea.archives-ouvertes.fr/cea-01844695 ; 2013 Conference on Empirical Methods in Natural Language Processing, EMNLP 2013, Oct 2013, Seattle, United States. pp.479-489 (2013)
|
|
BASE
|
|
Show details
|
|
|
|