Home Catalogue search

eng

Refine your search:
- Keyword
- Creator / Publisher:
- Year
- Medium
- Type
- BLLDB-Access:
  - free (119)
  - subject to license (1)

Search in the Catalogues and Directories






	Sort by
Simple Search

Page: 1 2 3 4 5 6

Hits 1 – 20 of 119

1	Second DIHARD Challenge Development - Eleven Sources
	Ryant, Neville; Liberman, Mark; Fiumara, James; Cieri, Christopher. - : Linguistic Data Consortium, 2021. : https://www.ldc.upenn.edu, 2021
	Abstract: Introduction Second DIHARD Challenge Development - Eleven Sources was developed by LDC and contains approximately 22 hours of English and Chinese speech data along with corresponding annotations used in support of the Second DIHARD Challenge. The DIHARD Challenges are a set of shared tasks on diarization focusing on "hard" diarization; that is, speech diarization for challenging corpora where there was an expectation that existing state-of-the-art systems would fare poorly. As with the first challenge, the second development and evaluation sets were drawn from a diverse sampling of sources including monologues, map task dialogues, broadcast interviews, sociolinguistic interviews, meeting speech, speech in restaurants, clinical recordings, extended child language acquisition recordings, and YouTube videos. Data This release, when combined with Second DIHARD Challenge Development - SEEDLingS (LDC2021S11), contains the development set audio data and annotation, except for CHiME-5 audio files, which must be obtained from the University of Sheffield. Data sources used in this release are as follows (all sources are in English unless otherwise indicated): * Autism Diagnosis Observation Schedule (ADOS) interviews * CHiME-5 dinner party recordings (annotations only in this release) * Conversations in Restaurants * DCIEM/HCRC map task (LDC96S38) * Audiobook recordings from LibriVox * Meeting speech from 2004 Spring NIST Rich Transcription (RT-04S) Development (LDC2007S11) and Evaluation (LDC2007S12) releases * 2001 U.S. Supreme Court oral arguments * Sociolinguistic interviews from SLX Corpus of Classic Sociolinguistic Interviews (LDC2003T15) * Mixer 6 Speech (LDC2013S03) * English and Chinese video collected by LDC as part of the Video Annotation for Speech Technologies (VAST) project * YouthPoint radio interviews All audio is provided in the form of 16 kHz, 16-bit, mono-channel FLAC files. The diarization for each recording is stored as a NIST Rich Transcription Time Marked (RTTM) file. RTTM files are space-separated text files containing one turn per line. Segmentation files are stored as HTK label files. Each of these files contains one speech segment per line. Scoring regions for each recording are specific by un-partitioned evaluation map (UEM) files. All annotation file types are encoded as UTF-8. More information about file formats, data sources and domains is contained in the included documentation. Samples Please view these samples: * Audio Sample (FLAC) * Label Sample (TXT) * RTTM Sample (TXT) Updates None at this time.
	URL: https://catalog.ldc.upenn.edu/LDC2021S10
	BASE
	Hide details

2	Second DIHARD Challenge Development - SEEDLingS
	Ryant, Neville; Liberman, Mark; Fiumara, James. - : Linguistic Data Consortium, 2021. : https://www.ldc.upenn.edu, 2021
	BASE
	Show details

3	First DIHARD Challenge -- System Submissions and Scores ...
	Ryant, Neville; Church, Kenneth; Cieri, Christopher. - : Zenodo, 2021
	BASE
	Show details

4	First DIHARD Challenge -- System Submissions and Scores ...
	Ryant, Neville; Church, Kenneth; Cieri, Christopher. - : Zenodo, 2021
	BASE
	Show details

5	Second DIHARD Challenge Development - SEEDLingS ...
	Liberman, Mark; Fiumara, James; Cieri, Christopher. - : Linguistic Data Consortium, 2021
	BASE
	Show details

6	Coding categories relevant to interaction
	Ogden, Richard; Cantarutti, Marina. - : Oxford University Press, 2021
	BASE
	Show details

7	Second DIHARD Challenge Development - Eleven Sources ...
	Liberman, Mark; Fiumara, James; Cieri, Christopher. - : Linguistic Data Consortium, 2021
	BASE
	Show details

8	Automated Analysis of Digitized Letter Fluency Data
	Cho, Sunghye; Nevler, Naomi; Parjane, Natalia...
	In: Front Psychol (2021)
	BASE
	Show details

9	Beyond Citations: Corpus-based Methods for Detecting the Impact of Research Outcomes on Society
	Rezapour, Rezvaneh [Verfasser]; Bopp, Jutta [Verfasser]; Fiedler, Norman [Verfasser]. - Mannheim : Leibniz-Institut für Deutsche Sprache (IDS), Bibliothek, 2020
	DNB Subject Category Language
	Show details

10	A Corpus Linguistic Perspective on Contemporary German Pop Lyrics with the Multi-Layer Annotated "Songkorpus"
	Schneider, Roman [Verfasser]; Calzolari, Nicoletta [Herausgeber]; Béchet, Frédéric [Herausgeber]. - Mannheim : Leibniz-Institut für Deutsche Sprache (IDS), Bibliothek, 2020
	DNB Subject Category Language
	Show details

11	Using Automatic Speech Recognition in Spoken Corpus Curation
	Gorisch, Jan [Verfasser]; Gref, Michael [Verfasser]; Schmidt, Thomas [Verfasser]. - Mannheim : Leibniz-Institut für Deutsche Sprache (IDS), Bibliothek, 2020
	DNB Subject Category Language
	Show details

12	Improving Sentence Boundary Detection for Spoken Language Transcripts
	Rehbein, Ines [Verfasser]; Ruppenhofer, Josef [Verfasser]; Schmidt, Thomas [Verfasser]. - Mannheim : Leibniz-Institut für Deutsche Sprache (IDS), Bibliothek, 2020
	DNB Subject Category Language
	Show details

13	Interoperability in an Infrastructure Enabling Multidisciplinary Research: The case of CLARIN
	de Jong, Franciska [Verfasser]; Maegaard, Bente [Verfasser]; Fišer, Darja [Verfasser]. - Mannheim : Leibniz-Institut für Deutsche Sprache (IDS), Bibliothek, 2020
	DNB Subject Category Language
	Show details

14	Treebanking User-Generated Content: A Proposal for a Unified Representation in Universal Dependencies
	Sanguinetti, Manuela [Verfasser]; Bosco, Cristina [Verfasser]; Cassidy, Lauren [Verfasser]. - Mannheim : Leibniz-Institut für Deutsche Sprache (IDS), Bibliothek, 2020
	DNB Subject Category Language
	Show details

15	Privacy by Design and Language Resources
	Kamocki, Paweł [Verfasser]; Witt, Andreas [Verfasser]; Calzolari, Nicoletta [Herausgeber]. - Mannheim : Leibniz-Institut für Deutsche Sprache (IDS), Bibliothek, 2020
	DNB Subject Category Language
	Show details

16	RKorAPClient: An R Package for Accessing the German Reference Corpus DeReKo via KorAP
	Kupietz, Marc [Verfasser]; Diewald, Nils [Verfasser]; Margaretha, Eliza [Verfasser]. - Mannheim : Leibniz-Institut für Deutsche Sprache (IDS), Bibliothek, 2020
	DNB Subject Category Language
	Show details

17	Fine-grained Named Entity Annotations for German Biographic Interviews
	Ruppenhofer, Josef [Verfasser]; Rehbein, Ines [Verfasser]; Flinz, Carolina [Verfasser]. - Mannheim : Leibniz-Institut für Deutsche Sprache (IDS), Bibliothek, 2020
	DNB Subject Category Language
	Show details

18	Corpus Query Lingua Franca part II: Ontology
	Evert, Stefan [Verfasser]; Harlamov, Oleg [Verfasser]; Heinrich, Philipp [Verfasser]. - Mannheim : Leibniz-Institut für Deutsche Sprache (IDS), Bibliothek, 2020
	DNB Subject Category Language
	Show details

19	Doctor Who? Framing Through Names and Titles in German
	van den Berg, Esther [Verfasser]; Korfhage, Katharina [Verfasser]; Ruppenhofer, Josef [Verfasser]. - Mannheim : Leibniz-Institut für Deutsche Sprache (IDS), Bibliothek, 2020
	DNB Subject Category Language
	Show details

20	Corpus REDEWIEDERGABE
	Brunner, Annelen [Verfasser]; Engelberg, Stefan [Verfasser]; Jannidis, Fotis [Verfasser]. - Mannheim : Leibniz-Institut für Deutsche Sprache (IDS), Bibliothek, 2020
	DNB Subject Category Language
	Show details

Page: 1 2 3 4 5 6

© 2013 - 2024 Lin|gu|is|tik | Imprint | Privacy Policy | Datenschutzeinstellungen ändern