1 |
Infusing Automatic Question Generation with Natural Language Understanding
|
|
|
|
BASE
|
|
Show details
|
|
2 |
Automatic Language Identification for Metadata Records: Measuring the Effectiveness of Various Approaches
|
|
|
|
Abstract:
Automatic language identification has been applied to short texts such as queries in information retrieval, but it has not yet been applied to metadata records. Applying this technology to metadata records, particularly their title elements, would enable creators of metadata records to obtain a value for the language element, which is often left blank due to a lack of linguistic expertise. It would also enable the addition of the language value to existing metadata records that currently lack a language value. Titles lend themselves to the problem of language identification mainly due to their shortness, a factor which increases the difficulty of accurately identifying a language. This study implemented four proven approaches to language identification as well as one open-source approach on a collection of multilingual titles of books and movies. Of the five approaches considered, a reduced N-gram frequency profile and distance measure approach outperformed all others, accurately identifying over 83% of all titles in the collection. Future plans are to offer this technology to curators of digital collections for use.
|
|
Keyword:
Computational linguistics; digital collections; Language and languages -- Identification; language identification; Machine-readable bibliographic data; metadata
|
|
URL: https://digital.library.unt.edu/ark:/67531/metadc801895/
|
|
BASE
|
|
Hide details
|
|
3 |
Co-Training for Topic Classification of Scholarly Data
|
|
|
|
In: 2015 Conference on Empirical Methods in Natural Language Processing, September 17-21, 2015. Lisbon, Portugal. (2015)
|
|
BASE
|
|
Show details
|
|
4 |
Exploration of Visual, Acoustic, and Physiological Modalities to Complement Linguistic Representations for Sentiment Analysis
|
|
|
|
BASE
|
|
Show details
|
|
8 |
Finding Meaning in Context Using Graph Algorithms in Mono- and Cross-lingual Settings
|
|
|
|
BASE
|
|
Show details
|
|
9 |
Sentence Similarity Analysis with Applications in Automatic Short Answer Grading
|
|
|
|
BASE
|
|
Show details
|
|
10 |
Measuring Semantic Relatedness Using Salient Encyclopedic Concepts
|
|
|
|
BASE
|
|
Show details
|
|
11 |
Topic Modeling on Historical Newspapers
|
|
|
|
In: Association for Computational Linguistics (ACL) Workshop on Language Technology for Cultural Heritage, Social Sciences, and Humanities (LATECH), 2011, Portland, Oregon, United States (2011)
|
|
BASE
|
|
Show details
|
|
12 |
Multilingual Subjectivity: Are More Languages Better?
|
|
|
|
In: International Conference on Computational Linguistics (COLING), 2010, Beijing, China (2010)
|
|
BASE
|
|
Show details
|
|
13 |
SemEval-2010 Task 2: Cross-Lingual Lexical Substitution
|
|
|
|
In: Association for Computational Linguistics (ACL) Workshop on Semantic Evaluations (SemEval), 2010, Uppsala, Sweden (2010)
|
|
BASE
|
|
Show details
|
|
14 |
Annotating and Identifying Emotions in Text
|
|
|
|
In: Intelligent Information Access, 2010. Berlin: Springer-Verlag, v. 301/2010, pp. 21-38. (2010)
|
|
BASE
|
|
Show details
|
|
15 |
Text Mining for Automatic Image Tagging
|
|
|
|
In: Twenty-third Annual International Conference on Computational Linguistics (COLING), 2010, Beijing, China (2010)
|
|
BASE
|
|
Show details
|
|
16 |
Amazon Mechanical Turk for Subjectivity Word Sense Disambiguation
|
|
|
|
In: North American Chapter of the Association for Computational Linguistics Workshop on Creating Speech and Language Data with Amazon's Mechanical Turk, 2010, Los Angeles, California, United States (2010)
|
|
BASE
|
|
Show details
|
|
17 |
Linguistic Ethnography: Identifying Dominant Word Classes in Text
|
|
|
|
In: Conference on Computational Linguistics and Intelligent Text Processing (CICLing), 2009, Mexico City, Mexico (2009)
|
|
BASE
|
|
Show details
|
|
18 |
Combining Lexical Resources for Contextual Synonym Expansion
|
|
|
|
In: International Conference in Recent Advances in Natural Language Processing (RANLP), 2009, Borovets, Bulgaria (2009)
|
|
BASE
|
|
Show details
|
|
19 |
The Decomposition of Human-Written Book Summaries
|
|
|
|
In: Conference on Computational Linguistics and Intelligent Text Processing (CICLing), 2009, Mexico City, Mexico (2009)
|
|
BASE
|
|
Show details
|
|
20 |
Subjectivity Word Sense Disambiguation
|
|
|
|
In: Conference on Empirical Methods in Natural Language Processing (EMNLP), 2009, Singapore (2009)
|
|
BASE
|
|
Show details
|
|
|
|