Home
Catalogue search
Refine your search:
Keyword:
Czech-English parallel corpus (1)
Universal Dependencies (1)
automatic parallel treebank (1)
dependency syntax (1)
evaluation (1)
parsing (1)
training data for machine translation (1)
Creator / Publisher:
Martin Popel (5)
The Pennsylvania State University CiteSeerX Archives (5)
David Mareček (4)
Václav Novák (2)
Alcalde, Hector Fernandez (1)
Attia, Mohammed (1)
Badmaeva, Elena (1)
Banerjee, Esha (1)
Burchardt, Aljoscha (1)
Cinkova, Silvie (1)
more
Year
Medium:
Online (6)
Type:
Article (6)
BLLDB-Access
Search in the Catalogues and Directories
All fields
Title
Creator / Publisher
Keyword
Year
AND
OR
AND NOT
All fields
Title
Creator / Publisher
Keyword
Year
AND
OR
AND NOT
All fields
Title
Creator / Publisher
Keyword
Year
AND
OR
AND NOT
All fields
Title
Creator / Publisher
Keyword
Year
AND
OR
AND NOT
All fields
Title
Creator / Publisher
Keyword
Year
Sort by
creator [A → Z]
'
creator [Z → A]
'
publishing year ↑ (asc)
'
publishing year ↓ (desc)
'
title [A → Z]
'
title [Z → A]
'
Simple Search
Hits 1 – 6 of 6
1
CoNLL 2017 Shared Task: Multilingual Parsing from Raw Text to Universal Dependencies
Çöltekin, Çağrı
;
Kayadelen, Tolga
;
Droganova, Kira
. - : Association for Computational Linguistics, 2017. : country:USA, 2017. : place:Stroudsburg, PA, 2017
BASE
Show details
2
HamleDT 2.0: Thirty dependency treebanks Stanfordized.
Rudolf Rosa
;
Jan Mašek
;
David Mareček
...
In: http://www.lrec-conf.org/proceedings/lrec2014/pdf/915_Paper.pdf (2014)
BASE
Show details
3
The Joy of Parallelism with CzEng 1.0
David Mareček
;
Michal Novák
;
Martin Popel
In: http://ufal.mff.cuni.cz/~bojar/publications/2012-FILE-czeng10_lrec2012-2012-lrec-czeng.pdf
Abstract:
CzEng 1.0 is an updated release of our Czech-English parallel corpus, freely available for non-commercial research or educational purposes. In this release, we approximately doubled the corpus size, reaching 15 million sentence pairs (about 200 million tokens per language). More importantly, we carefully filtered the data to reduce the amount of non-matching sentence pairs. CzEng 1.0 is automatically aligned at the level of sentences as well as words. We provide not only the plain text representation, but also automatic morphological tags, surface syntactic as well as deep syntactic dependency parse trees and automatic co-reference links in both English and Czech. This paper describes key properties of the released resource including the distribution of text domains, the corpus data formats, and a toolkit to handle the provided rich annotation. We also summarize the procedure of the rich annotation (incl. co-reference resolution) and of the automatic filtering. Finally, we provide some suggestions on exploiting such an automatically annotated sentence-parallel corpus.
Keyword:
automatic parallel treebank
;
Czech-English parallel corpus
;
training data for machine translation
URL:
http://citeseerx.ist.psu.edu/viewdoc/summary?doi=10.1.1.407.1255
http://ufal.mff.cuni.cz/~bojar/publications/2012-FILE-czeng10_lrec2012-2012-lrec-czeng.pdf
BASE
Hide details
4
English-Czech MT in 2008 ∗
David Mareček
;
Václav Novák
;
Martin Popel
In: http://ufal.mff.cuni.cz/~bojar/publications/2009-FILE-bojar_etal_2009_WMT-2009-wmt.pdf
BASE
Show details
5
Hidden Markov Tree Model in Dependency-based Machine Translation ∗
Martin Popel
In: http://aclweb.org/anthology-new/P/P09/P09-2037.pdf
BASE
Show details
6
English-Czech MT in 2008 ∗
David Mareček
;
Václav Novák
;
Martin Popel
In: http://aclweb.org/anthology-new/W/W09/W09-0422.pdf
BASE
Show details
Mobile view
All
Catalogues
UB Frankfurt Linguistik
0
IDS Mannheim
0
OLC Linguistik
0
UB Frankfurt Retrokatalog
0
DNB Subject Category Language
0
Institut für Empirische Sprachwissenschaft
0
Leibniz-Centre General Linguistics (ZAS)
0
Bibliographies
BLLDB
0
BDSL
0
IDS Bibliografie zur deutschen Grammatik
0
IDS Bibliografie zur Gesprächsforschung
0
IDS Konnektoren im Deutschen
0
IDS Präpositionen im Deutschen
0
IDS OBELEX meta
0
MPI-SHH Linguistics Collection
0
MPI for Psycholinguistics
0
Linked Open Data catalogues
Annohub
0
Online resources
Link directory
0
Journal directory
0
Database directory
0
Dictionary directory
0
Open access documents
BASE
6
Linguistik-Repository
0
IDS Publikationsserver
0
Online dissertations
0
Language Description Heritage
0
© 2013 - 2024 Lin|gu|is|tik
|
Imprint
|
Privacy Policy
|
Datenschutzeinstellungen ändern