1 |
Available online at www.sciencedirect.com
|
|
|
|
In: http://www.lextutor.ca/cv/crossley_cobb_mcn_13.pdf (2013)
|
|
BASE
|
|
Show details
|
|
2 |
Informatica 37 (2013) 193–201 193 Vector Disambiguation for Translation Extraction from Comparable
|
|
|
|
In: http://www.informatica.si/PDF/37-2/14_Apidianaki - Vector Disambiguation for Translation Ex.pdf (2013)
|
|
BASE
|
|
Show details
|
|
3 |
Identifying word translations from comparable documents without a seed lexicon
|
|
|
|
In: http://www.lrec-conf.org/proceedings/lrec2012/pdf/888_Paper.pdf (2012)
|
|
BASE
|
|
Show details
|
|
4 |
Development and application of a cross-language document comparability metric
|
|
|
|
In: http://www.lrec-conf.org/proceedings/lrec2012/pdf/804_Paper.pdf (2012)
|
|
BASE
|
|
Show details
|
|
5 |
Web-Corpora from Top-Level Domains Represent National Varieties of English
|
|
|
|
In: http://lexicometrica.univ-paris3.fr/jadt/jadt2012/Communications/Cook,+Paul+et+al.+-+Do+web+Corpora+from+Top-Level+Domains.pdf (2012)
|
|
BASE
|
|
Show details
|
|
6 |
PaCo2: A Fully Automated tool for gathering Parallel Corpora from the Web
|
|
|
|
In: http://www.lrec-conf.org/proceedings/lrec2012/pdf/231_Paper.pdf (2012)
|
|
Abstract:
The importance of parallel corpora in the NLP field is well known. This paper presents a tool that can build parallel corpora given just a seed word list and a pair of languages. Our approach is similar to others proposed in the literature, but introduces a new phase to the process. While most of the systems leave the task of finding websites containing parallel content up to the user, PaCo2 (Parallel Corpora Collector) takes care of that as well. The tool is language independent as far as possible, and adapting the system to work with new languages is fairly straightforward. Evaluation of the different modules has been carried out for Basque-Spanish, Spanish-English and Portuguese-English language pairs. Even though there is still room for improvement, results are positive. Results show that the corpora created have very good quality translations units, and the quality is maintained for the various language pairs. Details of the corpora created up until now are also provided.
|
|
Keyword:
Corpus Building; Multilingual Resources; Parallel Corpora
|
|
URL: http://www.lrec-conf.org/proceedings/lrec2012/pdf/231_Paper.pdf http://citeseerx.ist.psu.edu/viewdoc/summary?doi=10.1.1.684.1320
|
|
BASE
|
|
Hide details
|
|
7 |
Extracting Directional and Comparable Corpora from a Multilingual Corpus for Translation Studies
|
|
|
|
In: http://www.lrec-conf.org/proceedings/lrec2012/pdf/188_Paper.pdf (2012)
|
|
BASE
|
|
Show details
|
|
8 |
The Twins corpus of museum visitor questions
|
|
|
|
In: http://people.ict.usc.edu/~traum/Papers/twins-corpus.pdf (2012)
|
|
BASE
|
|
Show details
|
|
9 |
Extracting Directional and Comparable Corpora from a Multilingual Corpus for Translation Studies
|
|
|
|
In: http://publications.idiap.ch/downloads/papers/2012/Cartoni_LREC_2012.pdf (2012)
|
|
BASE
|
|
Show details
|
|
10 |
Evaluating DBMS-based access strategies to very large multi-layer corpora
|
|
|
|
In: http://www1.ids-mannheim.de/fileadmin/gra/texte/LREC2012_final.pdf (2012)
|
|
BASE
|
|
Show details
|
|
11 |
Reference Lists for the Evaluation of Term Extraction Tools
|
|
|
|
In: http://hal.inria.fr/docs/00/81/65/66/PDF/tke2012-submission-36.pdf (2012)
|
|
BASE
|
|
Show details
|
|
12 |
A new web interface to facilitate access to corpora: development of the ASLLRP data access interface
|
|
|
|
In: http://www.bu.edu/linguistics/UG/LREC2012/LREC-dai-final.pdf (2012)
|
|
BASE
|
|
Show details
|
|
13 |
TimeBankPT: A TimeML annotated corpus of Portuguese
|
|
|
|
In: http://www.di.fc.ul.pt/~ahb/CostaBrancoLREC2012.pdf (2012)
|
|
BASE
|
|
Show details
|
|
14 |
A generic formalism to represent linguistic corpora in RDF and OWL/DL
|
|
|
|
In: http://www.lrec-conf.org/proceedings/lrec2012/pdf/915_Paper.pdf (2012)
|
|
BASE
|
|
Show details
|
|
15 |
Building a learner corpus
|
|
|
|
In: http://www.lrec-conf.org/proceedings/lrec2012/pdf/992_Paper.pdf (2012)
|
|
BASE
|
|
Show details
|
|
16 |
Pexacc: A parallel sentence mining algorithm from comparable corpora
|
|
|
|
In: http://www.lrec-conf.org/proceedings/lrec2012/pdf/382_Paper.pdf (2012)
|
|
BASE
|
|
Show details
|
|
17 |
Text Simplification Tools for Spanish
|
|
|
|
In: http://www.lrec-conf.org/proceedings/lrec2012/pdf/762_Paper.pdf (2012)
|
|
BASE
|
|
Show details
|
|
18 |
Native language detection with ’cheap’ learner corpora
|
|
|
|
In: http://ftp.cs.toronto.edu/pub/gh/Brooke+Hirst-LRCbook-2013.pdf (2011)
|
|
BASE
|
|
Show details
|
|
19 |
Native language detection with ’cheap’ learner corpora
|
|
|
|
In: http://ftp.cs.toronto.edu/pub/gh/Brooke+Hirst-LCR-2012-OLD.pdf (2011)
|
|
BASE
|
|
Show details
|
|
20 |
Collecting spatial information for locations in a text-to-scene conversion system
|
|
|
|
In: http://ceur-ws.org/Vol-759/paper03.pdf (2011)
|
|
BASE
|
|
Show details
|
|
|
|