DE eng

Search in the Catalogues and Directories

Hits 1 – 10 of 10

1
Spoken corpus Gos VideoLectures 4.2 (transcription)
Verdonik, Darinka; Potočnik, Tomaž; Sepesy Maučec, Mirjam. - : Faculty of Electrical Engineering and Computer Science, University of Maribor, 2021
BASE
Show details
2
Spoken corpus Gos VideoLectures 4.1 (transcription)
Verdonik, Darinka; Potočnik, Tomaž; Sepesy Maučec, Mirjam. - : Faculty of Electrical Engineering and Computer Science, University of Maribor, 2021
BASE
Show details
3
Spoken corpus Gos VideoLectures 4.0 (transcription)
Verdonik, Darinka; Potočnik, Tomaž; Sepesy Maučec, Mirjam; Erjavec, Tomaž; Majhenič, Simona; Žgank, Andrej. - : Faculty of Electrical Engineering and Computer Science, University of Maribor, 2019
Abstract: Gos VideoLectures is an add-on to the Gos reference corpus of spoken Slovene (http://hdl.handle.net/11356/1040), and covers public academic speech. The Gos VideoLectures corpus contains a selection of public lectures available through the web portal Videolectures.net provided by the Jožef Stefan Institute, and covers 55 lectures and 22 hours of speech. This resource contains only annotated transcriptions of the corpus – audio recordings are available at http://hdl.handle.net/11356/1222. The transcriptions for Gos VideoLectures were done manually and carefully checked. The main guidelines for transcription were those of the Gos corpus (http://www.korpus-gos.net/Support/About). The transcription tool Transcriber 1.5.1 (http://trans.sourceforge.net/en/presentation.php) was used for making transcriptions. It can be also used for reading or exporting transcriptions (.trs files) to different formats. The transcriptions comprise the TRS files with tabular metadata, their conversion to TEI and to vertical file format (as used e.g. by Sketch Engine). Each recording has two TRS files, one with pronunciation-based and the other with the standardised/normalised transcription. The TEI and CWB encodings join these two transcriptions at the token level, with the normalised words being also automatically PoS tagged and lemmatised. The TRS pack also contains files with automatically produced word and phone-level alignment with the speech signal. The corpus can be used for training continuous speech recognition for Slovene language, for phonetic research or any other research of Slovene academic speech.
Keyword: academic speech; speech database; speech recognition; speech transcription; spoken corpus; TEI
URL: http://hdl.handle.net/11356/1223
BASE
Hide details
4
Spoken corpus Gos VideoLectures 3.0 (transcription)
Verdonik, Darinka; Potočnik, Tomaž; Sepesy Maučec, Mirjam. - : Faculty of Electrical Engineering and Computer Science, University of Maribor, 2018
BASE
Show details
5
A speech corpus as a source of lexical information
In: International Journal of Lexicography 30 (2017) 2, 143-166
IDS OBELEX meta
Show details
6
Spoken corpus Gos VideoLectures 2.0 (transcription)
Verdonik, Darinka; Potočnik, Tomaž; Sepesy Maučec, Mirjam. - : Faculty of Electrical Engineering and Computer Science, University of Maribor, 2017
BASE
Show details
7
Spoken corpus Gos VideoLectures 1.0 (transcription)
Verdonik, Darinka; Potočnik, Tomaž; Sepesy Maučec, Mirjam. - : Faculty of Electrical Engineering and Computer Science, University of Maribor, 2016
BASE
Show details
8
SNABI database for continuous speech recognition 1.2
Kačič, Zdravko; Horvat, Bogomir; Zögling Markuš, Aleksandra. - : Faculty of Electrical Engineering and Computer Science, University of Maribor, 2016
BASE
Show details
9
Large vocabulary continuous speech recognition of an inflected language using stems and endings
In: Speech communication. - Amsterdam [u.a.] : Elsevier 49 (2007) 6, 437-452
BLLDB
OLC Linguistik
Show details
10
Modelling highly inflected languages
In: Information sciences. - New York, NY : Elsevier Science Inc. 166 (2004) 1-4, 249-269
BLLDB
Show details

Catalogues
0
0
1
0
0
0
0
Bibliographies
2
0
0
0
0
0
1
0
0
Linked Open Data catalogues
0
Online resources
0
0
0
0
Open access documents
7
0
0
0
0
© 2013 - 2024 Lin|gu|is|tik | Imprint | Privacy Policy | Datenschutzeinstellungen ändern