2 |
Prague Dependency Treebank of Spoken Czech 2.0 (PDTSC 2.0)
|
|
Mikulová, Marie; Bémová, Alevtina; Hajič, Jan; Hajičová, Eva; Ircing, Pavel; Kolářová, Veronika; Lopatková, Markéta; Mareček, David; Mírovský, Jiří; Nedoluzhko, Anna; Pajas, Petr; Panevová, Jarmila; Peterek, Nino; Romportl, Jan; Sgall, Petr; Ševčíková, Magda; Štěpánek, Jan; Urešová, Zdeňka; Žabokrtský, Zdeněk. - : Charles University, Faculty of Mathematics and Physics, Institute of Formal and Applied Linguistics (UFAL), 2021
|
|
Abstract:
The Prague Dependency Treebank of Spoken Czech 2.0 (PDTSC 2.0) is a corpus of spoken language, consisting of 742,316 tokens and 73,835 sentences, representing 7,324 minutes (over 120 hours) of spontaneous dialogs. The dialogs have been recorded, transcribed and edited in several interlinked layers: audio recordings, automatic and manual transcripts and manually reconstructed text. These layers were part of the first version of the corpus (PDTSC 1.0). Version 2.0 is extended by an automatic dependency parser at the analytical and by the manual annotation of “deep” syntax at the tectogrammatical layer, which contains semantic roles and relations as well as annotation of coreference.
|
|
Keyword:
audio; coreference; semantics; speech recognition; speech reconstruction; spoken corpus; syntax
|
|
URL: http://hdl.handle.net/11234/1-3189
|
|
BASE
|
|
Hide details
|
|
5 |
Enriching VALLEX with Light Verbs: From Theory to Data and Back Again
|
|
|
|
In: Prague Bulletin of Mathematical Linguistics , Vol 111, Iss 1, Pp 29-56 (2018) (2018)
|
|
BASE
|
|
Show details
|
|
7 |
Reflexive Verbs in a Valency Lexicon: The Case of Czech Reflexive Morphemes
|
|
|
|
In: Proceedings of the 16th EURALEX International Congress: The User in Focus, Bolzano/Bozen, Italien 15 - 19 July 2014 (2014), 1007-1023
|
|
IDS OBELEX meta
|
|
Show details
|
|
14 |
Valencní slovník ceských sloves
|
|
|
|
MPI-SHH Linguistik
|
|
Show details
|
|
15 |
Studies in formal slavic linguistics : contributions from Formal Description of Slavic Languages 6.5, held at the University of Nova Gorica, December 1-3, 2006
|
|
|
|
BLLDB
|
|
UB Frankfurt Linguistik
|
|
Show details
|
|
|
|