Home Catalogue search

eng

Refine your search:

Search in the Catalogues and Directories






	Sort by
Simple Search

Hits 1 – 12 of 12

1	Spoken corpus Gos VideoLectures 4.2 (transcription)
	Verdonik, Darinka; Potočnik, Tomaž; Sepesy Maučec, Mirjam. - : Faculty of Electrical Engineering and Computer Science, University of Maribor, 2021
	BASE
	Show details

2	Spoken corpus Gos VideoLectures 4.1 (transcription)
	Verdonik, Darinka; Potočnik, Tomaž; Sepesy Maučec, Mirjam; Erjavec, Tomaž; Majhenič, Simona; Žgank, Andrej. - : Faculty of Electrical Engineering and Computer Science, University of Maribor, 2021
	Abstract: Gos VideoLectures is an add-on to the Gos reference corpus of spoken Slovene (http://hdl.handle.net/11356/1040), and covers public academic speech. It can be used for training continuous speech recognition for Slovene language, for phonetic research or any other research of Slovene academic speech. The corpus contains a selection of public lectures available through the web portal Videolectures.net provided by the Jožef Stefan Institute, and covers 55 lectures and 22 hours of speech. This resource contains only annotated transcriptions of the corpus, while the audio recordings are available at http://hdl.handle.net/11356/1222. The transcriptions for Gos VideoLectures were done manually and carefully checked. The main guidelines for transcription were those of the Gos corpus (http://www.korpus-gos.net/Support/About). The transcription tool Transcriber 1.5.1 (http://trans.sourceforge.net/en/presentation.php) was used for making transcriptions. It can be also used for reading or exporting transcriptions (.trs files) to different formats. The transcriptions comprise the TRS files with tabular metadata, their conversion to TEI and to vertical file format (as used e.g. by Sketch Engine). Each recording has two TRS files, one with pronunciation-based and the other with the standardised/normalised transcription. The TRS zip also contains files with automatically produced word and phone-level alignment with the speech signal, as well as the annotation guidelines (in Slovenian). The TEI and vertical encodings join the two transcriptions at the token level, with the normalised words also linguistically annotated. The annotiations comprise the word lemma, the MULTEXT-East MSDs and the Universal dependencies morphological features. As opposed to version 4.0, this version uses the CLASSLA tool (https://github.com/clarinsi/classla) for linguistic annotation and changes the TEI encoding of the normalised words.
	Keyword: academic speech; speech database; speech recognition; speech transcription; spoken corpus; TEI
	URL: http://hdl.handle.net/11356/1439
	BASE
	Hide details

3	Spoken corpus Gos VideoLectures 4.0 (transcription)
	Verdonik, Darinka; Potočnik, Tomaž; Sepesy Maučec, Mirjam. - : Faculty of Electrical Engineering and Computer Science, University of Maribor, 2019
	BASE
	Show details

4	SNABI database for continuous speech recognition 1.2
	Kačič, Zdravko; Horvat, Bogomir; Zögling Markuš, Aleksandra. - : Faculty of Electrical Engineering and Computer Science, University of Maribor, 2016
	BASE
	Show details

5	Factors Influencing Quality of Experience
	Reiter, Ulrich; Brunnström, Kjell; De Moor, Katrien...
	In: Quality of Experience ; https://hal.archives-ouvertes.fr/hal-01159290 ; Sebastian Möller; Alexander Raake. Quality of Experience, Springer International Publishing, pp.55-72, 2014, T-Labs Series in Telecommunication Services, 978-3-319-02680-0. ⟨10.1007/978-3-319-02681-7_4⟩ (2014)
	BASE
	Show details

6	Qualinet White Paper on Definitions of Quality of Experience
	Brunnström, Kjell; Beker, Sergio Ariel; De Moor, Katrien...
	In: https://hal.archives-ouvertes.fr/hal-00977812 ; 2013 (2013)
	BASE
	Show details

7	The impact of context on discourse marker use in two conversational genres
	Verdonik, Darinka; Žgank, Andrej; Pisanski Peterlin, Agnes
	In: Discourse studies. - London [u.a.] : Sage 10 (2008) 6, 759-775
	BLLDB
	Show details

8	The impact of context on discourse marker use in two conversational genres
	Verdonik, Darinka; Zgank, Andrej; Pisanski Peterlin, Agnes
	In: Discourse studies. - London [u.a.] : Sage 10 (2008) 6, 759-776
	OLC Linguistik
	Show details

9	Data-driven generation of phonetic broad classes, based on phoneme confusion matrix similarity
	Horvat, Bogomir; Žgank, Andrej; Kačič, Zdravko
	In: Speech communication. - Amsterdam [u.a.] : Elsevier 47 (2005) 3, 379-393
	BLLDB
	OLC Linguistik
	Show details

10	"LentInfo" Information-Providing System for the Festival Lent Programme
	Zgank, Andrej; Rojc, Matej
	In: International journal of speech technology. - Boston, Mass. [u.a.] : Kluwer Acad. Publ. 6 (2003) 3, 233-244
	OLC Linguistik
	Show details

11	Clustering of triphones using phoneme similarity estimation for the definition of a multilingual set of triphones
	Imperl, Bojan; Kacic, Zdravko; Horvat, Bogomir...
	In: Speech communication. - Amsterdam [u.a.] : Elsevier 39 (2003) 3, 353-366
	OLC Linguistik
	Show details

12	Clustering of triphones using phoneme similarity estimation for the definition of a multilingual set of triphones
	Imperl, Bojan; Kačič, Zdravko; Horvat, Bogomir...
	In: Speech communication. - Amsterdam [u.a.] : Elsevier 39 (2003) 3-4, 353-366
	BLLDB
	Show details

© 2013 - 2024 Lin|gu|is|tik | Imprint | Privacy Policy | Datenschutzeinstellungen ändern