DE eng

Search in the Catalogues and Directories

Hits 1 – 15 of 15

1
Tourism English-Croatian Parallel Corpus 2.0
Toral, Antonio; Esplà-Gomis, Miquel; Klubička, Filip. - : Abu-MaTran project, 2016
BASE
Show details
2
Domain adaptation of statistical machine translation with domain-focused web crawling [<Journal>]
Pecina, Pavel [Verfasser]; Toral, Antonio [Verfasser]; Papavassiliou, Vassilis [Verfasser].
DNB Subject Category Language
Show details
3
Mining and exploiting domain-specific corpora in the PANACEA platform
Bel Rafecas, Núria; Prokopidis, Prokopis; Toral, Antonio. - : ELRA (European Language Resources Association)
BASE
Show details
4
Task-based Evaluation of the PANACEA Production Chain
BASE
Show details
5
Criteria for evaluation of resources, technology and integration
BASE
Show details
6
Final Report on the Corpus Acquisition & Annotation subsystem and its components
BASE
Show details
7
English-French and English-Greek parallel corpus for the Environment and Labour Legislation domains
BASE
Show details
8
Monolingual corpus acquired in five languages and two domains
BASE
Show details
9
Second evaluation report. Evaluation of PANACEA v2 and produced resources
BASE
Show details
10
Technologies and tools for corpus creation, normalization and annotation
Abstract: The objectives of the Corpus Acquisition and Annotation (CAA) subsystem are the acquisition and processing of monolingual and bilingual language resources (LRs) required in the PANACEA context. Therefore, the CAA subsystem includes: i) a Corpus Acquisition Component (CAC) for extracting monolingual and bilingual data from the web, ii) a component for cleanup and normalization (CNC) of these data and iii) a text processing component (TPC) which consists of NLP tools including modules for sentence splitting, POS tagging, lemmatization, parsing and named entity recognition. This report presents the terminology used in this document in Section 2. The report explaines state-of-the-art and existing tools for corpus acquisition, corpus normalization, and text processing in Sections 3, 4 and 5 respectively. The resources to be produced in the context of WP4 are discussed in Section/n6. In Section 7 it presents the solution path we aim to explore for generating these resources.
Keyword: automatic acquisition of lexicon; natural language processing; Panacea Project
URL: http://hdl.handle.net/10230/22510
BASE
Hide details
11
Tool-based Evaluation of the PANACEA Production Chain
BASE
Show details
12
Report on the revised Corpus Acquisition & Annotation subsystem and its components
BASE
Show details
13
First evaluation report. Evaluation of PANACEA v1 and produced resources
BASE
Show details
14
Initial functional prototype and documentation describing the initial CAA subsystem and its components
BASE
Show details
15
Second version (v2) of the integrated platform and documentation
BASE
Show details

Catalogues
0
0
0
0
1
0
0
Bibliographies
0
0
0
0
0
0
0
0
0
Linked Open Data catalogues
0
Online resources
0
0
0
0
Open access documents
14
0
0
0
0
© 2013 - 2024 Lin|gu|is|tik | Imprint | Privacy Policy | Datenschutzeinstellungen ändern