DE eng

Search in the Catalogues and Directories

Hits 1 – 16 of 16

1
FAUST 0.5
Hajič, Jan; Mareček, David; Fučíková, Eva; Cinková, Silvie; Štěpánek, Jan; Mikulová, Marie. - : Charles University, Faculty of Mathematics and Physics, Institute of Formal and Applied Linguistics (UFAL), 2021
Abstract: Syntactic (including deep-syntactic - tectogrammatical) annotation of user-generated noisy sentences. The annotation was made on Czech-English and English-Czech Faust Dev/Test sets. The English data includes manual annotations of English reference translations of Czech source texts. This texts were translated independently by two translators. After some necessary cleanings, 1000 segments were randomly selected for manual annotation. Both the reference translations were annotated, which means 2000 annotated segments in total. The Czech data includes manual annotations of Czech reference translations of English source texts. This texts were translated independently by three translators. After some necessary cleanings, 1000 segments were randomly selected for manual annotation. All three reference translations were annotated, which means 3000 annotated segments in total. Faust is part of PDT-C 1.0 (http://hdl.handle.net/11234/1-3185).
Keyword: noisy texts; parallel corpus; tectogrammatics; treebank
URL: http://hdl.handle.net/11234/1-3308
BASE
Hide details
2
Usage-based linguistics and the magic number four
In: Cognitive linguistics. - Berlin ; Boston, Mass. : de Gruyter Mouton 28 (2017) 2, 209-237
BLLDB
Show details
3
ПРОЕКТ СОЗДАНИЯ КИТАЙСКО-РУССКОГО ПАРАЛЛЕЛЬНОГО КОРПУСА ОФИЦИАЛЬНО-ДЕЛОВЫХ ТЕКСТОВ С ДИСКУРСИВНО-СТРУКТУРНОЙ РАЗМЕТКОЙ
МУХИН МИХАИЛ ЮРЬЕВИЧ; ЯН И. - : Государственное образовательное учреждение высшего профессионального образования «Южно-Уральский государственный университет», 2016
BASE
Show details
4
The grammatical annotation of speech corpora : techniques and perspectives
In: Spoken corpora and linguistic studies. - Amsterdam [u.a.] : Benjamins (2014), 105-128
BLLDB
Show details
5
Copenhagen Dependency Treebanks versions 1-3
Buch-Kromann, Matthias. - : Copenhagen Business School, 2014
BASE
Show details
6
Czech-English Parallel Corpus 1.0 (CzEng 1.0)
Bojar, Ondřej; Žabokrtský, Zdeněk; Dušek, Ondřej. - : Charles University, Faculty of Mathematics and Physics, Institute of Formal and Applied Linguistics (UFAL), 2014
BASE
Show details
7
Approximate inference: a sampling based modeling technique to capture complex dependencies in a language model
In: Speech communication. - Amsterdam [u.a.] : Elsevier 55 (2013) 1, 162-177
BLLDB
Show details
8
Prague Czech-English Dependency Treebank 2.0
Hajič, Jan; Hajičová, Eva; Panevová, Jarmila. - : Charles University, Faculty of Mathematics and Physics, Institute of Formal and Applied Linguistics (UFAL), 2013
BASE
Show details
9
Parallel Aligned Treebanks at LDC: New Challenges Interfacing Existing Infrastructures
In: http://www.lrec-conf.org/proceedings/lrec2012/pdf/277_Paper.pdf (2012)
BASE
Show details
10
Cross-domain effects on parse selection for precision grammars
In: Research on language and computation. - London : King's College 8 (2011) 4, 299-340
BLLDB
OLC Linguistik
Show details
11
A O(|G|n6) time extension of inversion transduction grammars
In: Machine translation. - Dordrecht [u.a.] : Springer Science + Business Media 25 (2011) 4, 291-315
BLLDB
OLC Linguistik
Show details
12
Automatically generated parallel treebanks and their exploitability in machine translation
In: Machine translation. - Dordrecht [u.a.] : Springer Science + Business Media 23 (2010) 1, 1-22
BLLDB
OLC Linguistik
Show details
13
Announcing Prague Czech-English Dependency Treebank 2.0
In: http://ufal.mff.cuni.cz/~bojar/publications/2012-FILE-pcedt20_lrec2012-2012-lrec-pcedt.pdf
BASE
Show details
14
Parallel Aligned Treebanks at LDC: New Challenges Interfacing Existing Infrastructures
In: http://papers.ldc.upenn.edu/LREC2012/Li-Parallel_Aligned_Treebanks.pdf
BASE
Show details
15
The Joy of Parallelism with CzEng 1.0
In: http://ufal.mff.cuni.cz/~bojar/publications/2012-FILE-czeng10_lrec2012-2012-lrec-czeng.pdf
BASE
Show details
16
A tree is a Baum is an árbol is a sach’a: Creating a trilingual treebank
In: http://www.lrec-conf.org/proceedings/lrec2012/pdf/350_Paper.pdf
BASE
Show details

Catalogues
0
0
3
0
0
0
0
Bibliographies
6
0
0
0
0
0
0
0
0
Linked Open Data catalogues
0
Online resources
0
0
0
0
Open access documents
10
0
0
0
0
© 2013 - 2024 Lin|gu|is|tik | Imprint | Privacy Policy | Datenschutzeinstellungen ändern