1 |
C3: Continued Pretraining with Contrastive Weak Supervision for Cross Language Ad-Hoc Retrieval ...
|
|
|
|
BASE
|
|
Show details
|
|
2 |
Transfer Learning Approaches for Building Cross-Language Dense Retrieval Models ...
|
|
|
|
BASE
|
|
Show details
|
|
3 |
The Multilingual TEDx Corpus for Speech Recognition and Translation ...
|
|
|
|
BASE
|
|
Show details
|
|
4 |
Finding Old Answers to New Math Questions: The ARQMath Lab at CLEF 2020
|
|
|
|
BASE
|
|
Show details
|
|
5 |
USC-SFI MALACH Interviews and Transcripts English – Speech Recognition Edition
|
|
|
|
BASE
|
|
Show details
|
|
6 |
USC-SFI MALACH Interviews and Transcripts English – Speech Recognition Edition ...
|
|
|
|
BASE
|
|
Show details
|
|
7 |
Avocado Research Email Collection
|
|
|
|
Abstract:
*Introduction* Avocado Research Email Collection consists of emails and attachments taken from 279 accounts of a defunct information technology company referred to as "Avocado". Most of the accounts are those of Avocado employees; the remainder represent shared accounts such as "Leads", or system accounts such as "Conference Room Upper Canada". The collection consists of the processed personal folders of these accounts with metadata describing folder structure, email characteristics and contacts, among others. It is expected to be useful for social network analysis, e-discovery and related fields. *Data* The source data for the collection consisted of Personal Storage Table (PST) files for 282 accounts. A PST file is used by MS Outlook to store emails, calendar entries, contact details, and related information. Data was extracted from the PST files using libpst version 0.6.54. Three files produced no output and and are not included in the collection. Each account is referred to as a "custodian" although some of the accounts do not correspond to humans. The collection is divided into metadata and text. The metadata is represented in XML, with a single top-level XML file listing the custodians, and then one XML file per custodian listing all items extracted from that custodian's PST files. The full XML tree can be read by loading the top-level file with an XML parser that handles directives. All XML metadata files are encoded in UTF-8. The text contains the extracted text of the items in the custodians' folders, with the extracted text for each item being held in a separate file. The text files are then zipped into a zip file per custodian. *Licensing* Users are required to sign two license agreements in order to access this corpus, the Avocado Collection Organizational License Agreement and the Avocado Collection End User Agreement. Those agreements can be viewed in the License field of this catalog entry. *Updates* None at this time.
|
|
URL: https://catalog.ldc.upenn.edu/LDC2015T03
|
|
BASE
|
|
Hide details
|
|
9 |
An Information Retrieval Test Collection for English SMS Conversations
|
|
|
|
BASE
|
|
Show details
|
|
10 |
Microblogging Temporal Summarization: Filtering Important Twitter Updates for Breaking News
|
|
|
|
BASE
|
|
Show details
|
|
13 |
Frontiers, Challenges, and Opportunities for Information Retrieval – Report from SWIRL 2012, The Second Strategic Workshop on Information Retrieval in Lorne
|
|
|
|
BASE
|
|
Show details
|
|
14 |
Formative Evaluation for Multilingual Multimedia Search and Sense-Making
|
|
|
|
BASE
|
|
Show details
|
|
16 |
Advances in Multilingual and Multimodal Information Retrieval : 8th Workshop of the Cross-Language Evaluation Forum, CLEF 2007, Budapest, Hungary, September 19-21, 2007, Revised Selected Papers
|
|
|
|
UB Frankfurt Linguistik
|
|
Show details
|
|
18 |
Combining Evidence from Unconstrained Spoken Term Frequency Estimation for Improved Speech Retrieval
|
|
|
|
BASE
|
|
Show details
|
|
19 |
Classifying Attitude by Topic Aspect for English and Chinese Document Collections
|
|
|
|
BASE
|
|
Show details
|
|
20 |
Overview of the CLEF-2006 cross-language speech retrieval track
|
|
|
|
In: Oard, Douglas W., Wang, Jianqiang, Jones, Gareth J.F. orcid:0000-0003-2923-8365 , White, Ryen W., Pecina, Pavel, Soergel, Dagobert, Huang, Xiaoli and Shafran, Izhak (2007) Overview of the CLEF-2006 cross-language speech retrieval track. In: CLEF 2006: Workshop on Cross-Language Information Retrieval and Evaluation, 20-22 Sept. 2006, Alicante, Spain. (2007)
|
|
BASE
|
|
Show details
|
|
|
|