DE eng

Search in the Catalogues and Directories

Page: 1 2 3 4 5 6
Hits 1 – 20 of 119

1
Second DIHARD Challenge Development - Eleven Sources
Ryant, Neville; Liberman, Mark; Fiumara, James; Cieri, Christopher. - : Linguistic Data Consortium, 2021. : https://www.ldc.upenn.edu, 2021
Abstract: *Introduction* Second DIHARD Challenge Development - Eleven Sources was developed by LDC and contains approximately 22 hours of English and Chinese speech data along with corresponding annotations used in support of the Second DIHARD Challenge. The DIHARD Challenges are a set of shared tasks on diarization focusing on "hard" diarization; that is, speech diarization for challenging corpora where there was an expectation that existing state-of-the-art systems would fare poorly. As with the first challenge, the second development and evaluation sets were drawn from a diverse sampling of sources including monologues, map task dialogues, broadcast interviews, sociolinguistic interviews, meeting speech, speech in restaurants, clinical recordings, extended child language acquisition recordings, and YouTube videos. *Data* This release, when combined with Second DIHARD Challenge Development - SEEDLingS (LDC2021S11), contains the development set audio data and annotation, except for CHiME-5 audio files, which must be obtained from the University of Sheffield. Data sources used in this release are as follows (all sources are in English unless otherwise indicated): * Autism Diagnosis Observation Schedule (ADOS) interviews * CHiME-5 dinner party recordings (annotations only in this release) * Conversations in Restaurants * DCIEM/HCRC map task (LDC96S38) * Audiobook recordings from LibriVox * Meeting speech from 2004 Spring NIST Rich Transcription (RT-04S) Development (LDC2007S11) and Evaluation (LDC2007S12) releases * 2001 U.S. Supreme Court oral arguments * Sociolinguistic interviews from SLX Corpus of Classic Sociolinguistic Interviews (LDC2003T15) * Mixer 6 Speech (LDC2013S03) * English and Chinese video collected by LDC as part of the Video Annotation for Speech Technologies (VAST) project * YouthPoint radio interviews All audio is provided in the form of 16 kHz, 16-bit, mono-channel FLAC files. The diarization for each recording is stored as a NIST Rich Transcription Time Marked (RTTM) file. RTTM files are space-separated text files containing one turn per line. Segmentation files are stored as HTK label files. Each of these files contains one speech segment per line. Scoring regions for each recording are specific by un-partitioned evaluation map (UEM) files. All annotation file types are encoded as UTF-8. More information about file formats, data sources and domains is contained in the included documentation. *Samples* Please view these samples: * Audio Sample (FLAC) * Label Sample (TXT) * RTTM Sample (TXT) *Updates* None at this time.
URL: https://catalog.ldc.upenn.edu/LDC2021S10
BASE
Hide details
2
Second DIHARD Challenge Development - SEEDLingS
Ryant, Neville; Liberman, Mark; Fiumara, James. - : Linguistic Data Consortium, 2021. : https://www.ldc.upenn.edu, 2021
BASE
Show details
3
First DIHARD Challenge -- System Submissions and Scores ...
BASE
Show details
4
First DIHARD Challenge -- System Submissions and Scores ...
BASE
Show details
5
Second DIHARD Challenge Development - SEEDLingS ...
Liberman, Mark; Fiumara, James; Cieri, Christopher. - : Linguistic Data Consortium, 2021
BASE
Show details
6
Coding categories relevant to interaction
Ogden, Richard; Cantarutti, Marina. - : Oxford University Press, 2021
BASE
Show details
7
Second DIHARD Challenge Development - Eleven Sources ...
Liberman, Mark; Fiumara, James; Cieri, Christopher. - : Linguistic Data Consortium, 2021
BASE
Show details
8
Automated Analysis of Digitized Letter Fluency Data
In: Front Psychol (2021)
BASE
Show details
9
Beyond Citations: Corpus-based Methods for Detecting the Impact of Research Outcomes on Society
Rezapour, Rezvaneh [Verfasser]; Bopp, Jutta [Verfasser]; Fiedler, Norman [Verfasser]. - Mannheim : Leibniz-Institut für Deutsche Sprache (IDS), Bibliothek, 2020
DNB Subject Category Language
Show details
10
A Corpus Linguistic Perspective on Contemporary German Pop Lyrics with the Multi-Layer Annotated "Songkorpus"
Schneider, Roman [Verfasser]; Calzolari, Nicoletta [Herausgeber]; Béchet, Frédéric [Herausgeber]. - Mannheim : Leibniz-Institut für Deutsche Sprache (IDS), Bibliothek, 2020
DNB Subject Category Language
Show details
11
Using Automatic Speech Recognition in Spoken Corpus Curation
Gorisch, Jan [Verfasser]; Gref, Michael [Verfasser]; Schmidt, Thomas [Verfasser]. - Mannheim : Leibniz-Institut für Deutsche Sprache (IDS), Bibliothek, 2020
DNB Subject Category Language
Show details
12
Improving Sentence Boundary Detection for Spoken Language Transcripts
Rehbein, Ines [Verfasser]; Ruppenhofer, Josef [Verfasser]; Schmidt, Thomas [Verfasser]. - Mannheim : Leibniz-Institut für Deutsche Sprache (IDS), Bibliothek, 2020
DNB Subject Category Language
Show details
13
Interoperability in an Infrastructure Enabling Multidisciplinary Research: The case of CLARIN
de Jong, Franciska [Verfasser]; Maegaard, Bente [Verfasser]; Fišer, Darja [Verfasser]. - Mannheim : Leibniz-Institut für Deutsche Sprache (IDS), Bibliothek, 2020
DNB Subject Category Language
Show details
14
Treebanking User-Generated Content: A Proposal for a Unified Representation in Universal Dependencies
Sanguinetti, Manuela [Verfasser]; Bosco, Cristina [Verfasser]; Cassidy, Lauren [Verfasser]. - Mannheim : Leibniz-Institut für Deutsche Sprache (IDS), Bibliothek, 2020
DNB Subject Category Language
Show details
15
Privacy by Design and Language Resources
Kamocki, Paweł [Verfasser]; Witt, Andreas [Verfasser]; Calzolari, Nicoletta [Herausgeber]. - Mannheim : Leibniz-Institut für Deutsche Sprache (IDS), Bibliothek, 2020
DNB Subject Category Language
Show details
16
RKorAPClient: An R Package for Accessing the German Reference Corpus DeReKo via KorAP
Kupietz, Marc [Verfasser]; Diewald, Nils [Verfasser]; Margaretha, Eliza [Verfasser]. - Mannheim : Leibniz-Institut für Deutsche Sprache (IDS), Bibliothek, 2020
DNB Subject Category Language
Show details
17
Fine-grained Named Entity Annotations for German Biographic Interviews
Ruppenhofer, Josef [Verfasser]; Rehbein, Ines [Verfasser]; Flinz, Carolina [Verfasser]. - Mannheim : Leibniz-Institut für Deutsche Sprache (IDS), Bibliothek, 2020
DNB Subject Category Language
Show details
18
Corpus Query Lingua Franca part II: Ontology
Evert, Stefan [Verfasser]; Harlamov, Oleg [Verfasser]; Heinrich, Philipp [Verfasser]. - Mannheim : Leibniz-Institut für Deutsche Sprache (IDS), Bibliothek, 2020
DNB Subject Category Language
Show details
19
Doctor Who? Framing Through Names and Titles in German
van den Berg, Esther [Verfasser]; Korfhage, Katharina [Verfasser]; Ruppenhofer, Josef [Verfasser]. - Mannheim : Leibniz-Institut für Deutsche Sprache (IDS), Bibliothek, 2020
DNB Subject Category Language
Show details
20
Corpus REDEWIEDERGABE
Brunner, Annelen [Verfasser]; Engelberg, Stefan [Verfasser]; Jannidis, Fotis [Verfasser]. - Mannheim : Leibniz-Institut für Deutsche Sprache (IDS), Bibliothek, 2020
DNB Subject Category Language
Show details

Page: 1 2 3 4 5 6

Catalogues
0
0
0
0
22
0
0
Bibliographies
0
0
0
0
0
0
1
0
0
Linked Open Data catalogues
0
Online resources
0
0
0
0
Open access documents
96
0
0
0
0
© 2013 - 2024 Lin|gu|is|tik | Imprint | Privacy Policy | Datenschutzeinstellungen ändern