1 |
Infusing Automatic Question Generation with Natural Language Understanding
|
|
|
|
BASE
|
|
Show details
|
|
2 |
Automatic Language Identification for Metadata Records: Measuring the Effectiveness of Various Approaches
|
|
|
|
BASE
|
|
Show details
|
|
3 |
Co-Training for Topic Classification of Scholarly Data
|
|
|
|
In: 2015 Conference on Empirical Methods in Natural Language Processing, September 17-21, 2015. Lisbon, Portugal. (2015)
|
|
BASE
|
|
Show details
|
|
4 |
Exploration of Visual, Acoustic, and Physiological Modalities to Complement Linguistic Representations for Sentiment Analysis
|
|
|
|
BASE
|
|
Show details
|
|
5 |
Multilingual Word Sense Disambiguation Using Wikipedia
|
|
|
|
Abstract:
Ambiguity is inherent to human language. In particular, word sense ambiguity is prevalent in all natural languages, with a large number of the words in any given language carrying more than one meaning. Word sense disambiguation is the task of automatically assigning the most appropriate meaning to a polysemous word within a given context. Generally the problem of resolving ambiguity in literature has revolved around the famous quote “you shall know the meaning of the word by the company it keeps.” In this thesis, we investigate the role of context for resolving ambiguity through three different approaches. Instead of using a predefined monolingual sense inventory such as WordNet, we use a language-independent framework where the word senses and sense-tagged data are derived automatically from Wikipedia. Using Wikipedia as a source of sense-annotations provides the much needed solution for knowledge acquisition bottleneck. In order to evaluate the viability of Wikipedia based sense-annotations, we cast the task of disambiguating polysemous nouns as a monolingual classification task and experimented on lexical samples from four different languages (viz. English, German, Italian and Spanish). The experiments confirm that the Wikipedia based sense annotations are reliable and can be used to construct accurate monolingual sense classifiers. It is a long belief that exploiting multiple languages helps in building accurate word sense disambiguation systems. Subsequently, we developed two approaches that recast the task of disambiguating polysemous nouns as a multilingual classification task. The first approach for multilingual word sense disambiguation attempts to effectively use a machine translation system to leverage two relevant multilingual aspects of the semantics of text. First, the various senses of a target word may be translated into different words, which constitute unique, yet highly salient signal that effectively expand the target word’s feature space. Second, the translated context words themselves embed co-occurrence information that a translation engine gathers from very large parallel corpora. The second approach for multlingual word sense disambiguation attempts to reduce the reliance on the machine translation system during training by using the multilingual knowledge available in Wikipedia through its interlingual links. Finally, the experiments on a lexical sample from four different languages confirm that the multilingual systems perform better than the monolingual system and significantly improve the disambiguation accuracy.
|
|
Keyword:
multilingual; supervised learning; Wikipedia; word sense disambiguation
|
|
URL: https://digital.library.unt.edu/ark:/67531/metadc500036/
|
|
BASE
|
|
Hide details
|
|
8 |
Finding Meaning in Context Using Graph Algorithms in Mono- and Cross-lingual Settings
|
|
|
|
BASE
|
|
Show details
|
|
9 |
Sentence Similarity Analysis with Applications in Automatic Short Answer Grading
|
|
|
|
BASE
|
|
Show details
|
|
10 |
Measuring Semantic Relatedness Using Salient Encyclopedic Concepts
|
|
|
|
BASE
|
|
Show details
|
|
11 |
Topic Modeling on Historical Newspapers
|
|
|
|
In: Association for Computational Linguistics (ACL) Workshop on Language Technology for Cultural Heritage, Social Sciences, and Humanities (LATECH), 2011, Portland, Oregon, United States (2011)
|
|
BASE
|
|
Show details
|
|
12 |
Multilingual Subjectivity: Are More Languages Better?
|
|
|
|
In: International Conference on Computational Linguistics (COLING), 2010, Beijing, China (2010)
|
|
BASE
|
|
Show details
|
|
13 |
SemEval-2010 Task 2: Cross-Lingual Lexical Substitution
|
|
|
|
In: Association for Computational Linguistics (ACL) Workshop on Semantic Evaluations (SemEval), 2010, Uppsala, Sweden (2010)
|
|
BASE
|
|
Show details
|
|
14 |
Annotating and Identifying Emotions in Text
|
|
|
|
In: Intelligent Information Access, 2010. Berlin: Springer-Verlag, v. 301/2010, pp. 21-38. (2010)
|
|
BASE
|
|
Show details
|
|
15 |
Text Mining for Automatic Image Tagging
|
|
|
|
In: Twenty-third Annual International Conference on Computational Linguistics (COLING), 2010, Beijing, China (2010)
|
|
BASE
|
|
Show details
|
|
16 |
Amazon Mechanical Turk for Subjectivity Word Sense Disambiguation
|
|
|
|
In: North American Chapter of the Association for Computational Linguistics Workshop on Creating Speech and Language Data with Amazon's Mechanical Turk, 2010, Los Angeles, California, United States (2010)
|
|
BASE
|
|
Show details
|
|
17 |
Linguistic Ethnography: Identifying Dominant Word Classes in Text
|
|
|
|
In: Conference on Computational Linguistics and Intelligent Text Processing (CICLing), 2009, Mexico City, Mexico (2009)
|
|
BASE
|
|
Show details
|
|
18 |
Combining Lexical Resources for Contextual Synonym Expansion
|
|
|
|
In: International Conference in Recent Advances in Natural Language Processing (RANLP), 2009, Borovets, Bulgaria (2009)
|
|
BASE
|
|
Show details
|
|
19 |
The Decomposition of Human-Written Book Summaries
|
|
|
|
In: Conference on Computational Linguistics and Intelligent Text Processing (CICLing), 2009, Mexico City, Mexico (2009)
|
|
BASE
|
|
Show details
|
|
20 |
Subjectivity Word Sense Disambiguation
|
|
|
|
In: Conference on Empirical Methods in Natural Language Processing (EMNLP), 2009, Singapore (2009)
|
|
BASE
|
|
Show details
|
|
|
|