DE eng

Search in the Catalogues and Directories

Hits 1 – 3 of 3

1
ParCzech 3.0
Kopp, Matyáš; Stankov, Vladislav; Bojar, Ondřej; Hladká, Barbora; Straňák, Pavel. - : Charles University, Faculty of Mathematics and Physics, Institute of Formal and Applied Linguistics (UFAL), 2021
Abstract: The ParCzech 3.0 corpus is the third version of ParCzech consisting of stenographic protocols that record the Chamber of Deputies’ meetings held in the 7th term (2013-2017) and the current 8th term (2017-Mar 2021). The protocols are provided in their original HTML format, Parla-CLARIN TEI format, and the format suitable for Automatic Speech Recognition. The corpus is automatically enriched with the morphological, syntactic, and named-entity annotations using the procedures UDPipe 2 and NameTag 2. The audio files are aligned with the texts in the annotated TEI files.
Keyword: Chamber of Deputies; Parliament of the Czech Republic; speech corpus; stenographic protocols; TEI encoding
URL: http://hdl.handle.net/11234/1-3631
BASE
Hide details
2
ParCzech PS7 2.0
Hladká, Barbora; Kopp, Matyáš; Straňák, Pavel. - : Charles University, Faculty of Mathematics and Physics, Institute of Formal and Applied Linguistics (UFAL), 2020
BASE
Show details
3
ParCzech PS7 1.0
Hladká, Barbora; Kopp, Matyáš; Straňák, Pavel. - : Charles University, Faculty of Mathematics and Physics, Institute of Formal and Applied Linguistics (UFAL), 2020
BASE
Show details

Catalogues
0
0
0
0
0
0
0
Bibliographies
0
0
0
0
0
0
0
0
0
Linked Open Data catalogues
0
Online resources
0
0
0
0
Open access documents
3
0
0
0
0
© 2013 - 2024 Lin|gu|is|tik | Imprint | Privacy Policy | Datenschutzeinstellungen ändern