DE eng

Search in the Catalogues and Directories

Hits 1 – 19 of 19

1
An Information-extraction system for Urdu—a resource-poor language
In: http://www.cedar.buffalo.edu/~rohini/Papers/ACM-TALIP.pdf (2010)
Abstract: There has been an increase in the amount of multilingual text on the Internet due to the proliferation of news sources and blogs. The Urdu language, in particular, has experienced explosive growth on the Web. Text mining for information discovery, which includes tasks such as identifying topics, relationships and events, and sentiment analysis, requires sophisticated natural language processing (NLP). NLP systems begin with modules such as word segmentation, part-of-speech tagging, and morphological analysis and progress to modules such as shallow parsing and named entity tagging. While there have been considerable advances in developing such comprehensive NLP systems for English, the work for Urdu is still in its infancy. The tasks of interest in Urdu NLP includes analyzing data sources such as blogs and comments to news articles to provide insight into social and human behavior. All of this requires a robust NLP system. The objective of this work is to develop an NLP infrastructure for Urdu that is customizable and capable of providing basic analysis on which more advanced information extraction tools can be built. This system assimilates resources from various online sources to facilitate improved named entity tagging and Urdu-to-English transliteration. The annotated data required to train the learning models used here is acquired by standardizing the currently limited resources available for Urdu. Techniques
Keyword: Algorithms; bootstrap learning; Categories and Subject Descriptors; Experimentation; General General Terms; H.0 [Information Systems; Languages; named entity tagging; part of speech tagging; Performance Additional Key Words and Phrases; shallow parsing; text mining; transliterations; Urdu natural language processing
URL: http://citeseerx.ist.psu.edu/viewdoc/summary?doi=10.1.1.364.7220
http://www.cedar.buffalo.edu/~rohini/Papers/ACM-TALIP.pdf
BASE
Hide details
2
Automatic Scoring of Short Handwritten Essays in Reading Comprehension Tests
In: http://www.cedar.buffalo.edu/~srihari/papers/TR-01-07.pdf (2007)
BASE
Show details
3
Information extraction for multi-participant, task-oriented, synchronous, computer-mediated communication: A corpus study of chat data
In: http://research.ihost.com/and2007/cd/Proceedings_files/p131.pdf (2007)
BASE
Show details
4
datasets for Using Verbs and Adjectives to Automatically Classify Blog Sentiment
Chesley, Paula; Xu, Li; Rohini, Srihari. - : dataset self-published online, 2006
BASE
Show details
5
An Expert Lexicon Approach to Identifying English Phrasal Verbs
In: http://acl.ldc.upenn.edu/acl2003/main/pdfs/Li.pdf (2003)
BASE
Show details
6
An Expert Lexicon Approach to Identifying English Phrasal Verbs
In: http://acl.ldc.upenn.edu/P/P03/P03-1065.pdf (2003)
BASE
Show details
7
Use of Multimedia Input in Automated Image Annotation and Content-Based Retrieval
In: http://www.cedar.buffalo.edu/Staff/Rohini/Postscript/spie95.ps.Z (1995)
BASE
Show details
8
An On-Line Cursive Word Recognition System
In: http://www.cedar.buffalo.edu/Linguistics/papers/ieee.ps (1994)
BASE
Show details
9
Visual Semantics: Extracting Visual Information from Text Accompanying Pictures
In: http://www.cedar.buffalo.edu/Piction/papers/vis_sem.ps (1994)
BASE
Show details
10
Document Understanding: Research Directions
In: http://www.cedar.buffalo.edu/Publications/Postscript/Survey.ps (1992)
BASE
Show details
11
TREC 2008 at the University at Buffalo: Legal and Blog Track
In: http://trec.nist.gov/pubs/trec17/papers/suny-buffalo.legal.blog.rev.pdf
BASE
Show details
12
An On-Line Cursive Word Recognition System
In: http://www.cedar.buffalo.edu/handwriting/papers/ieee.pdf
BASE
Show details
13
Automated Scoring of Handwritten Essays based on Latent Semantic Analysis
In: http://www.cedar.buffalo.edu/~srihari/papers/DAS-2006.pdf
BASE
Show details
14
Proceedings of the ACL 2003 Workshop on Multilingual Summarization and Question Answering, pp. 84-93. Question Answering on a Case Insensitive Corpus
In: http://acl.ldc.upenn.edu/acl2003/mlsum/pdfs/Li.pdf
BASE
Show details
15
Learning to Summarize using Coherence
In: http://www.cedar.buffalo.edu/%7Erohini/Papers/pdasNIPS09Wkshp.pdf
BASE
Show details
16
Utterance Topic Model for Generating Coherent Summaries
In: http://www.cedar.buffalo.edu/%7Erohini/Papers/UBSummarizer-TAC2009.pdf
BASE
Show details
17
Utterance Topic Model for Generating Coherent Summaries
In: http://www.nist.gov/tac/publications/2009/participant.papers/UBSummarizer.proceedings.pdf
BASE
Show details
18
A Question Answering System Supported by Information Extraction
BASE
Show details
19
A Hybrid Approach for Named Entity and Sub-Type Tagging
BASE
Show details

Catalogues
0
0
0
0
0
0
0
Bibliographies
0
0
0
0
0
0
0
0
0
Linked Open Data catalogues
0
Online resources
0
0
0
0
Open access documents
19
0
0
0
0
© 2013 - 2024 Lin|gu|is|tik | Imprint | Privacy Policy | Datenschutzeinstellungen ändern