DE eng

Search in the Catalogues and Directories

Hits 1 – 5 of 5

1
Improved Named Entity Recognition Through SVM-Based Combination
In: https://hal.archives-ouvertes.fr/hal-01322867 ; [Research Report] Galatasaray University, Computer Science Department. 2013 (2013)
Abstract: Named Entity Extraction (NER) consists in identifying specific textual expressions, which represent various types of concepts: persons, locations, organizations, etc. It is an important part of natural language processing, because it is often used when building more advanced text-based tools, especially in the context of information extraction. Consequently, many NER tools are now available, designed to handle various sorts of texts, languages and entity types. A recent study on biographical texts showed the overall indices used to assess the performance of these tools hide the fact they can behave rather differently depending on the textual context, and could actually be complementary. In this work, we check this assumption by proposing two methods allowing to combine several NER tools: one relies on a voting process and the other is SVM-based. Both take advantage of a global text feature to guide the combination process. We extend an existing corpus to provide enough data for training and testing. We implement an open source flexible platform aiming at benchmarking NER tools. We apply our combination methods on a selection of NER tools, including state-of-the-art ones, as well as our custom tool specifically designed to process hyperlinked biographical texts. Our results show both proposed combination approaches outmatch the individual performance of all the considered standalone NER tools. Of the two, the SVM-based approach reaches the highest performance.
Keyword: [INFO.INFO-IR]Computer Science [cs]/Information Retrieval [cs.IR]; [INFO.INFO-TT]Computer Science [cs]/Document and Text Processing
URL: https://hal.archives-ouvertes.fr/hal-01322867/file/report.pdf
https://hal.archives-ouvertes.fr/hal-01322867/document
https://hal.archives-ouvertes.fr/hal-01322867
BASE
Hide details
2
Vers une étude comparative diachronique des mondes lexicaux du féminisme
In: https://hal-lirmm.ccsd.cnrs.fr/lirmm-00816322 ; [Rapport de recherche] RR-13010, Lirmm. 2013, pp.13 (2013)
BASE
Show details
3
Lexicon-Grammar, a method of linguistic description ; Le lexique-grammaire, une méthode de description linguistique ; O Léxico-Gramática, um método de descrição linguística
In: https://hal-upec-upem.archives-ouvertes.fr/hal-00823401 ; 2013, pp.1-125 (2013)
BASE
Show details
4
ProlexFeeder - Populating a Multilingual Ontology of Proper Names from Open Sources
In: https://hal.archives-ouvertes.fr/hal-01216002 ; [Research Report] 306, Laboratoire d'Informatique, Université François Rabelais Tours, France. 2013 (2013)
BASE
Show details
5
Proceedings of the Workshop "Lan­guage, Cog­nition and Com­pu­ta­tional Models"
In: https://hal.archives-ouvertes.fr/hal-00997337 ; 2013 (2013)
BASE
Show details

Catalogues
0
0
0
0
0
0
0
Bibliographies
0
0
0
0
0
0
0
0
0
Linked Open Data catalogues
0
Online resources
0
0
0
0
Open access documents
5
0
0
0
0
© 2013 - 2024 Lin|gu|is|tik | Imprint | Privacy Policy | Datenschutzeinstellungen ändern