1 |
First version (v1) of the integrated platform/nand documentation
|
|
|
|
BASE
|
|
Show details
|
|
2 |
Criteria for evaluation of resources, technology and integration
|
|
|
|
BASE
|
|
Show details
|
|
4 |
Third evaluation report. Evaluation of PANACEA v3 and produced resources
|
|
|
|
BASE
|
|
Show details
|
|
6 |
Integrated Final Version of the Components for Lexical Acquisition
|
|
|
|
BASE
|
|
Show details
|
|
8 |
User’s Workshop: Technology transfer, papers produced and/ndissemination materials
|
|
|
|
BASE
|
|
Show details
|
|
9 |
Travelling Object definition for multilevel lexicon in PANACEA platform
|
|
|
|
BASE
|
|
Show details
|
|
10 |
PANACEA, Platform for Automatic, Normalized Annotation and Cost-Effective Acquisition of Language Resources for Human Language Technologies
|
|
|
|
BASE
|
|
Show details
|
|
11 |
Monolingual corpus acquired in five languages and two domains
|
|
|
|
BASE
|
|
Show details
|
|
12 |
Platform Software, Project Tools + Resources Licensing Policy and Exploitation Plan
|
|
|
|
BASE
|
|
Show details
|
|
13 |
Technologies and tools for corpus creation, normalization and annotation
|
|
Prokopidis, Prokopis; Papavassiliou, Vassilis; Pecina, Pavel; Rimell, Laura; Poibeau, Thierry; Bartolini, Roberto; Caselli, Tommaso; Frontini, Francesca; Aleksic, Vera; Thurmair, Gregor; Poch, Marc; Bel Rafecas, Núria; Hamon, Olivier
|
|
Abstract:
The objectives of the Corpus Acquisition and Annotation (CAA) subsystem are the acquisition and processing of monolingual and bilingual language resources (LRs) required in the PANACEA context. Therefore, the CAA subsystem includes: i) a Corpus Acquisition Component (CAC) for extracting monolingual and bilingual data from the web, ii) a component for cleanup and normalization (CNC) of these data and iii) a text processing component (TPC) which consists of NLP tools including modules for sentence splitting, POS tagging, lemmatization, parsing and named entity recognition. This report presents the terminology used in this document in Section 2. The report explaines state-of-the-art and existing tools for corpus acquisition, corpus normalization, and text processing in Sections 3, 4 and 5 respectively. The resources to be produced in the context of WP4 are discussed in Section/n6. In Section 7 it presents the solution path we aim to explore for generating these resources.
|
|
Keyword:
automatic acquisition of lexicon; natural language processing; Panacea Project
|
|
URL: http://hdl.handle.net/10230/22510
|
|
BASE
|
|
Hide details
|
|
15 |
PANACEA: Language Resource Factory. Data Available for free!
|
|
|
|
BASE
|
|
Show details
|
|
16 |
A Web Service-Based Platform for the Automatic Production of Language Resources
|
|
|
|
BASE
|
|
Show details
|
|
17 |
Third version (v4) of the integrated platform and documentation
|
|
|
|
BASE
|
|
Show details
|
|
19 |
[What is it?] PANACEA: A Web Service-Based Platform for the Automatic Production of Language Resources
|
|
|
|
BASE
|
|
Show details
|
|
|
|