1 |
Mixed Attention Transformer for Leveraging Word-Level Knowledge to Neural Cross-Lingual Information Retrieval ...
|
|
|
|
BASE
|
|
Show details
|
|
2 |
Frontiers, Challenges, and Opportunities for Information Retrieval – Report from SWIRL 2012, The Second Strategic Workshop on Information Retrieval in Lorne
|
|
|
|
BASE
|
|
Show details
|
|
3 |
Improving college professors' communication skills inside the classroom [electronic resource] : an exploratory study
|
|
|
|
BASE
|
|
Show details
|
|
4 |
OCRonym: Entity Extraction and Retrieval for Scanned Books ...
|
|
|
|
Abstract:
In the past five years, massive book-scanning projects have produced an explosion in the number of sources for the humanities, available on-line to the broadest possible audiences. Transcribing page images by optical character recognition makes many searching and browsing tasks practical for scholars. But even low OCR error rates compound into high probability of error in a given sentence, and the error rate is even higher for names. We propose to build a prototype system for information extraction and retrieval of noisy OCR. In particular, we will optimize the extraction and retrieval of names, which are highly informative features for detecting topics and events in documents. We will build statistical models of characters and words from scanned books to improve lexical coverage, and we will improve name categorization and disambiguation by linking document contexts to external sources such as Wikipedia. Our testbed comes from over one million scanned books from the Internet Archive. ...
|
|
Keyword:
Library and information science
|
|
URL: https://hcommons.org/deposits/item/hc:11967/ https://dx.doi.org/10.17613/m6wp9h
|
|
BASE
|
|
Hide details
|
|
5 |
Simultaneous Multilingual Search for Translingual Information Retrieval
|
|
|
|
BASE
|
|
Show details
|
|
6 |
Simultaneous Multilingual Search for Translingual Information Retrieval ...
|
|
|
|
BASE
|
|
Show details
|
|
8 |
Unsupervised Non-topical Classification of Documents
|
|
|
|
In: DTIC (2006)
|
|
BASE
|
|
Show details
|
|
11 |
Cross-Document Coreference on a Large Scale Corpus
|
|
|
|
In: DTIC (2004)
|
|
BASE
|
|
Show details
|
|
16 |
UMass at TREC 2002: Cross Language and Novelty Tracks
|
|
|
|
In: DTIC (2002)
|
|
BASE
|
|
Show details
|
|
17 |
First international Conference on Human Language Technology Research : proceedings of HLT 2001 ; March 18 - 21, 2001, San Diego, California
|
|
|
|
UB Frankfurt Linguistik
|
|
Show details
|
|
|
|