Page: 1 2 3 4 5 6 7 8 9... 89
81 |
Grammatically Coded Corpus Of Spoken Lithuanian: Methodology And Development ...
|
|
|
|
BASE
|
|
Show details
|
|
82 |
Grammatically Coded Corpus Of Spoken Lithuanian: Methodology And Development ...
|
|
|
|
BASE
|
|
Show details
|
|
84 |
Crowdsourcing lexical semantic judgements from bilingual dictionary users
|
|
|
|
BASE
|
|
Show details
|
|
85 |
Concept and entity grounding using indirect supervision
|
|
|
|
Abstract:
Extracting and disambiguating entities and concepts is a crucial step toward understanding natural language text. In this thesis, we consider the problem of grounding concepts and entities mentioned in text to one or more knowledge bases (KBs). A well-studied scenario of this problem is the one in which documents are given in English and the goal is to identify concept and entity mentions, and find the corresponding entries the mentions refer to in Wikipedia. We extend this problem in two directions: First, we study identifying and grounding entities written in any language to the English Wikipedia. Second, we investigate using multiple KBs which do not contain rich textual and structural information Wikipedia does. These more involved settings pose a few additional challenges beyond those addressed in the standard English Wikification problem. Key among them is that no supervision is available to facilitate training machine learning models. The first extension, cross-lingual Wikification, introduces problems such as recognizing multilingual named entities mentioned in text, translating non-English names into English, and computing word similarity across languages. Since it is impossible to acquire manually annotated examples for all languages, building models for all languages in Wikipedia requires exploring indirect or incidental supervision signals which already exist in Wikipedia. For the second setting, we need to deal with the fact that most KBs do not contain the rich information Wikipedia has; consequently, the main supervision signal used to train Wikification rankers does not exist anymore. In this thesis, we show that supervision signals can be obtained by carefully examining the redundancy and relations between multiple KBs. By developing algorithms and models which harvest these incidental signals, we can achieve better performance on these tasks.
|
|
Keyword:
Concept disambiguation; Cross-lingual wikification; Entity disambiguation; Entity linking; Incidental supervision; Indirect supervision; Named entity recognition; Wikification
|
|
URL: http://hdl.handle.net/2142/98336
|
|
BASE
|
|
Hide details
|
|
86 |
Desambiguación Verbal Automática: un estudio sobre el rendimiento de la información semántica argumental ; Verb Sense Disambiguation: a study about the performance of argumental semantic information
|
|
|
|
BASE
|
|
Show details
|
|
87 |
Knowledge-driven entity recognition and disambiguation in biomedical text
|
|
|
|
BASE
|
|
Show details
|
|
88 |
Evaluation of word embedding vector averaging functions for biomedical word sense disambiguation
|
|
|
|
BASE
|
|
Show details
|
|
89 |
Unsupervised, knowledge-free, and interpretable word sense disambiguation
|
|
|
|
In: EMNLP 2017 - Conference on Empirical Methods in Natural Language Processing: System Demonstrations, Proceedings (2017)
|
|
BASE
|
|
Show details
|
|
90 |
Domain-specific Named Entity Disambiguation in Historical Memoirs
|
|
|
|
BASE
|
|
Show details
|
|
91 |
Dalla Word Sense Disambiguation alla sintassi: il problema dell'articolo partitivo in italiano
|
|
|
|
BASE
|
|
Show details
|
|
92 |
Uso de representaciones vectoriales de las palabras para la detección de dobles sentidos (puns)
|
|
|
|
BASE
|
|
Show details
|
|
93 |
Approches d'analyse distributionnelle pour améliorer la désambiguïsation sémantique
|
|
|
|
In: Journées internationales d'Analyse statistique des Données Textuelles (JADT) ; https://hal.archives-ouvertes.fr/hal-01477502 ; Journées internationales d'Analyse statistique des Données Textuelles (JADT), Jun 2016, Nice, France ; https://jadt2016.sciencesconf.org/ (2016)
|
|
BASE
|
|
Show details
|
|
94 |
Ambiguity Diagnosis for Terms in Digital Humanities
|
|
|
|
In: Language Resources and Evaluation Conference ; https://hal.inria.fr/hal-01423650 ; Language Resources and Evaluation Conference, May 2016, Portorož, Slovenia (2016)
|
|
BASE
|
|
Show details
|
|
95 |
СИНТАКСИЧЕСКАЯ МНОГОЗНАЧНОСТЬ И НЕОДНОЗНАЧНОСТЬ В ПЕРСПЕКТИВЕ МАШИННОГО ПЕРЕВОДА
|
|
КОЗЕРЕНКО ЕЛЕНА БОРИСОВНА. - : Федеральное государственное бюджетное образовательное учреждение высшего образования «Московский педагогический государственный университет», 2016
|
|
BASE
|
|
Show details
|
|
96 |
Building Semantic Trees from XML Documents
|
|
|
|
In: ISSN: 1570-8268 ; Journal of Web Semantics ; https://hal-univ-pau.archives-ouvertes.fr/hal-02079156 ; Journal of Web Semantics, Elsevier, 2016, 37-38, pp.1-24. ⟨10.1016/J.WEBSEM.2016.03.002⟩ (2016)
|
|
BASE
|
|
Show details
|
|
97 |
Semantic Interoperability of Multilingual Lexical Resources in Lexical Linked Data ; Interopérabilité Sémantique Multi-lingue des Ressources Lexicales en Données Liées Ouvertes
|
|
|
|
In: https://tel.archives-ouvertes.fr/tel-01681358 ; Informatique et langage [cs.CL]. Université Grenoble Alpes, 2016. Français. ⟨NNT : 2016GREAM067⟩ (2016)
|
|
BASE
|
|
Show details
|
|
98 |
Using Domain Ontologies for Classification and Semantic Interpretation of Documents
|
|
|
|
In: Proceedings of ALLDATA 2016 ; International Workshop on Knowledge Extraction and Semantic Annotation (KESA 2016) in ALLDATA 2016 : 2nd International Conference on Big Data, Small Data, Linked Data and Open Data ; https://hal.archives-ouvertes.fr/hal-01535945 ; International Workshop on Knowledge Extraction and Semantic Annotation (KESA 2016) in ALLDATA 2016 : 2nd International Conference on Big Data, Small Data, Linked Data and Open Data, Feb 2016, Lisbon, Portugal. pp. 76-81 (2016)
|
|
BASE
|
|
Show details
|
|
99 |
Automatic processing of Tunisian dialect: construction of linguistic resources ; TRAITEMENT AUTOMATIQUE DU DIALECTE TUNISIEN : CONSTRUCTION DE RESSOURCES LINGUISTIQUES
|
|
|
|
In: https://hal.archives-ouvertes.fr/tel-02869866 ; Informatique et langage [cs.CL]. Université de Sfax (Tunisie), 2016. Français (2016)
|
|
BASE
|
|
Show details
|
|
100 |
Prosodic disambiguation and attachment height
|
|
|
|
In: Speech Prosody 8 ; https://halshs.archives-ouvertes.fr/halshs-01422841 ; Speech Prosody 8, May 2016, Boston, United States. pp.1176-1180 (2016)
|
|
BASE
|
|
Show details
|
|
Page: 1 2 3 4 5 6 7 8 9... 89
|
|