Home
Catalogue search
Refine your search:
Keyword
Creator / Publisher:
Anna Widiger (4)
Bruno Pouliquen (4)
Camelia Ignat (4)
Dan Tufiş (4)
Ralf Steinberger (4)
The Pennsylvania State University CiteSeerX Archives (4)
Tomaž Erjavec (4)
Dániel Varga (3)
Year:
2006 (4)
Medium
Type
BLLDB-Access
Search in the Catalogues and Directories
All fields
Title
Creator / Publisher
Keyword
Year
AND
OR
AND NOT
All fields
Title
Creator / Publisher
Keyword
Year
AND
OR
AND NOT
All fields
Title
Creator / Publisher
Keyword
Year
AND
OR
AND NOT
All fields
Title
Creator / Publisher
Keyword
Year
AND
OR
AND NOT
All fields
Title
Creator / Publisher
Keyword
Year
Sort by
creator [A → Z]
'
creator [Z → A]
'
publishing year ↑ (asc)
'
publishing year ↓ (desc)
'
title [A → Z]
'
title [Z → A]
'
Simple Search
Hits 1 – 4 of 4
1
The JRC-Acquis: A multilingual aligned parallel corpus with 20+ languages
Ralf Steinberger
;
Bruno Pouliquen
;
Anna Widiger
...
In: http://langtech.jrc.it/Documents/0605_LREC_JRC-Acquis_Steinberger-et-al.pdf (2006)
BASE
Show details
2
The JRC-Acquis: A multilingual aligned parallel corpus with 20+ languages
Ralf Steinberger
;
Bruno Pouliquen
;
Anna Widiger
;
Camelia Ignat
;
Tomaž Erjavec
;
Dan Tufiş
In: http://www.racai.ro/~tufis/papers/Steinberger-PWIETV-LREC2006.pdf (2006)
Abstract:
We are presenting a new and unique parallel corpus available in all 2 official European Union (EU) languages, with additional documents available for some EU candidate countries. The average size is about 10 Million words per language. The UTF-8-encoded corpus has been manually classified according to EUROVOC (1994) subject domains and is available in XML format. Pair-wise paragraph alignment information is available for all 190+ language pair combinations. The corpus is accompanied by a tool to produce a bilingual paragraph-aligned parallel corpus for all possible language pair combinations. Motivation to compile the parallel corpus Parallel corpora are extremely useful to train and evaluate automatic text analysis systems and to generate new linguistic resources such as subject-specific monolingual and multilingual terminology lists, and more. In the full paper, we will elaborate the many uses of parallel corpora, will summarise the specific interest of the Joint Research Centre in this corpus, and will stress the importance of having such a corpus especially for languages for which less linguistic resources are available. What is the JRC Collection of the ‘Acquis Communautaire ’ (short: the JRC-Acquis)
URL:
http://citeseerx.ist.psu.edu/viewdoc/summary?doi=10.1.1.145.5767
http://www.racai.ro/~tufis/papers/Steinberger-PWIETV-LREC2006.pdf
BASE
Hide details
3
The JRC-Acquis: A multilingual aligned parallel corpus with 20+ languages
Ralf Steinberger
;
Bruno Pouliquen
;
Anna Widiger
...
In: http://nl.ijs.si/et/Bib/LREC06/lrec06-jrc-340.pdf (2006)
BASE
Show details
4
The JRC-Acquis: A multilingual aligned parallel corpus with 20+ languages
Ralf Steinberger
;
Bruno Pouliquen
;
Anna Widiger
...
In: http://www.mt-archive.info/LREC-2006-Steinberger.pdf (2006)
BASE
Show details
Mobile view
All
Catalogues
UB Frankfurt Linguistik
0
IDS Mannheim
0
OLC Linguistik
0
UB Frankfurt Retrokatalog
0
DNB Subject Category Language
0
Institut für Empirische Sprachwissenschaft
0
Leibniz-Centre General Linguistics (ZAS)
0
Bibliographies
BLLDB
0
BDSL
0
IDS Bibliografie zur deutschen Grammatik
0
IDS Bibliografie zur Gesprächsforschung
0
IDS Konnektoren im Deutschen
0
IDS Präpositionen im Deutschen
0
IDS OBELEX meta
0
MPI-SHH Linguistics Collection
0
MPI for Psycholinguistics
0
Linked Open Data catalogues
Annohub
0
Online resources
Link directory
0
Journal directory
0
Database directory
0
Dictionary directory
0
Open access documents
BASE
4
Linguistik-Repository
0
IDS Publikationsserver
0
Online dissertations
0
Language Description Heritage
0
© 2013 - 2024 Lin|gu|is|tik
|
Imprint
|
Privacy Policy
|
Datenschutzeinstellungen ändern