DE eng

Search in the Catalogues and Directories

Hits 1 – 9 of 9

1
Swedish Turkish Parallel Treebank
Abstract: In this paper, we describe our work on building a parallel treebank for a less studied and typologically dissimilar language pair, namely Swedish and Turkish. The treebank is a balanced syntactically annotated corpus containing both fiction and technical documents. In total, it consists of approximately 160,000 tokens in Swedish and 145,000 in Turkish. The texts are linguistically annotated using different layers from part of speech tags and morphological features to dependency annotation. Each layer is automatically processed by using basic language resources for the involved languages. The sentences and words are aligned, and partly manually corrected. We create the treebank by reusing and adjusting existing tools for the automatic annotation, alignment, and their correction and visualization. The treebank was developed within the project Supporting research environment for minor languages aiming at to create representative language resources for language pairs dissimilar in language structure. Therefore, efforts are put on developing a general method for formatting and annotation procedure, as well as using tools that can be applied to other language pairs easily. 1.
URL: http://citeseerx.ist.psu.edu/viewdoc/summary?doi=10.1.1.149.1040
BASE
Hide details
2
Swedish-Turkish Parallel Treebank
In: http://www.lrec-conf.org/proceedings/lrec2008/pdf/121_paper.pdf (2008)
BASE
Show details
3
Single Malt or Blended? A study in Multilingual Parser Optimization
In: http://acl.ldc.upenn.edu/D/D07/D07-1097.pdf (2007)
BASE
Show details
4
Supporting Research Environment for Less Explored Languages: A Case Study of Swedish and Turkish
In: http://stp.lingfil.uu.se/~nivre/docs/saagvall2.pdf
BASE
Show details
5
A Basic Language Resource Kit for Persian
In: http://www.lrec-conf.org/proceedings/lrec2012/pdf/338_Paper.pdf
BASE
Show details
6
The English-Swedish-Turkish Parallel Treebank
In: http://www.lrec-conf.org/proceedings/lrec2010/pdf/116_Paper.pdf
BASE
Show details
7
A Multilingual Evaluation of Three Spelling Normalisation Methods for Historical Text
In: http://www.aclweb.org/anthology/W/W14/W14-0605.pdf
BASE
Show details
8
Parsing the Past – Identification of Verb Constructions in Historical Text
In: http://aclweb.org/anthology-new/W/W12/W12-1010.pdf
BASE
Show details
9
Normalisation of Historical Text Using Context-Sensitive Weighted Levenshtein Distance and Compound Splitting
In: http://emmtee.net/oe/nodalida13/conference/8.pdf
BASE
Show details

Catalogues
0
0
0
0
0
0
0
Bibliographies
0
0
0
0
0
0
0
0
0
Linked Open Data catalogues
0
Online resources
0
0
0
0
Open access documents
9
0
0
0
0
© 2013 - 2024 Lin|gu|is|tik | Imprint | Privacy Policy | Datenschutzeinstellungen ändern