DE eng

Search in the Catalogues and Directories

Hits 1 – 8 of 8

1
Third evaluation report. Evaluation of PANACEA v3 and produced resources
BASE
Show details
2
Final Report on the Corpus Acquisition & Annotation subsystem and its components
BASE
Show details
3
Integrated Final Version of the Components for Lexical Acquisition
BASE
Show details
4
Travelling Object definition for multilevel lexicon in PANACEA platform
BASE
Show details
5
Technologies and tools for corpus creation, normalization and annotation
BASE
Show details
6
Report on the revised Corpus Acquisition & Annotation subsystem and its components
Abstract: PANACEA WP4 targets the creation of a Corpus Acquisition and Annotation (CAA) subsystem for the acquisition and processing of monolingual and bilingual language resources (LRs). The CAA subsystem consists of tools that have been integrated as web services in the PANACEA platform of LR production. D4.2 Initial functional prototype and documentation in T13 provided documentation on the initial functional prototype of this subsystem, while this deliverable presents updates in the revised subsystem during the second development cycle of the project. The deliverable is structured as follows. A revised version of the Focused Monolingual Crawler (FMC), that has been implemented according to the results of the first evaluation cycle and the reviewers’ comments in the first annual review report, is described in section 2. New and revised versions of tools for corpus normalization (cleaning and deduplication) are detailed in section 3. Section 4 provides documentation on the NLP tools introduced for the first time in the subsystem. These tools focus mainly on sentence splitting/tokenization and POS tagging/lemmatization for English (EN), French (FR), Spanish (ES), German (DE), Italian (IT) and Greek (EL).
Keyword: automatic acquisition of lexicon; natural language processing; Panacea Project
URL: http://hdl.handle.net/10230/22513
BASE
Hide details
7
Merged dictionaries
BASE
Show details
8
Monolingual lexica for English, Spanish and Italian tuned for a particular domain (LAB and ENV)
BASE
Show details

Catalogues
0
0
0
0
0
0
0
Bibliographies
0
0
0
0
0
0
0
0
0
Linked Open Data catalogues
0
Online resources
0
0
0
0
Open access documents
8
0
0
0
0
© 2013 - 2024 Lin|gu|is|tik | Imprint | Privacy Policy | Datenschutzeinstellungen ändern