DE eng

Search in the Catalogues and Directories

Hits 1 – 6 of 6

1
English-Urdu Religious Parallel Corpus
Jawaid, Bushra; Zeman, Daniel. - : Charles University, Faculty of Mathematics and Physics, Institute of Formal and Applied Linguistics (UFAL), 2018
BASE
Show details
2
WMT16 Tuning Shared Task Models (English-to-Czech)
Kamran, Amir; Jawaid, Bushra; Bojar, Ondřej; Stanojevic, Milos. - : Charles University, Faculty of Mathematics and Physics, Institute of Formal and Applied Linguistics (UFAL), 2016. : University of Amsterdam, ILLC, 2016
Abstract: This item contains models to tune for the WMT16 Tuning shared task for English-to-Czech. CzEng 1.6pre (http://ufal.mff.cuni.cz/czeng/czeng16pre) corpus is used for the training of the translation models. The data is tokenized (using Moses tokenizer), lowercased and sentences longer than 60 words and shorter than 4 words are removed before training. Alignment is done using fast_align (https://github.com/clab/fast_align) and the standard Moses pipeline is used for training. Two 5-gram language models are trained using KenLM: one only using the CzEng Czech data and the other is trained using all available Czech mono data for WMT except Common Crawl. Also included are two lexicalized bidirectional reordering models, word based and hierarchical, with msd conditioned on both source and target of processed CzEng.
Keyword: baseline models; machine translation; shared task; tuning; WMT16
URL: http://hdl.handle.net/11372/LRT-1672
BASE
Hide details
3
WMT16 Tuning Shared Task Models (Czech-to-English)
Kamran, Amir; Jawaid, Bushra; Bojar, Ondřej. - : Charles University, Faculty of Mathematics and Physics, Institute of Formal and Applied Linguistics (UFAL), 2016. : University of Amsterdam, ILLC, 2016
BASE
Show details
4
Urdu Monolingual Corpus
Jawaid, Bushra; Kamran, Amir; Bojar, Ondřej. - : Charles University, Faculty of Mathematics and Physics, Institute of Formal and Applied Linguistics (UFAL), 2014
BASE
Show details
5
Linguistic digital repository based on DSpace
Pajas, Petr; Vandas, Karel; Mišutka, Jozef. - : Charles University, Faculty of Mathematics and Physics, Institute of Formal and Applied Linguistics (UFAL), 2014
BASE
Show details
6
Word-order issues in English-to-Urdu statistical machine translation
In: The Prague bulletin of mathematical linguistics. - Praha : Univ. (2011) 95, 87-106
BLLDB
OLC Linguistik
Show details

Catalogues
0
0
1
0
0
0
0
Bibliographies
1
0
0
0
0
0
0
0
0
Linked Open Data catalogues
0
Online resources
0
0
0
0
Open access documents
5
0
0
0
0
© 2013 - 2024 Lin|gu|is|tik | Imprint | Privacy Policy | Datenschutzeinstellungen ändern