3 |
4-Couv, a Backcover Treebank
|
|
|
|
In: Treebanks and Linguistic Theories 14 ; https://hal.archives-ouvertes.fr/hal-01498941 ; Treebanks and Linguistic Theories 14, Dec 2015, Warsaw, Poland. pp.249-257 (2015)
|
|
Abstract:
International audience ; We introduce 4-Couv, a treebanking project aiming at develop a large multipurpose treebank for French language. The main characteristic of this project is to provide adequate material for both linguistic and psycholinguistic research. The treebank is made of short and self-contained texts selected from a corpus of backcovers coming from different editors and different genres. Such material makes possible classical linguistic research (especially in syntax and discourse), but offers also new perspectives in experimental linguistics: being short texts and semantically coherent, they perfectly fit with the requirements of eye-tracking or electro-encephalographic recordings. At this stage, 4-Couv contains 3,500 trees automatically tagged and parsed, and manually corrected. Its format is compatible with other standard French treebanks. We present in this paper, the treebank, its annotation and the different treebanking tools that have been developed for the different stages of its elaboration: text selection, tagging, parsing and manual correction.
|
|
Keyword:
[INFO.INFO-TT]Computer Science [cs]/Document and Text Processing; [SHS.LANGUE]Humanities and Social Sciences/Linguistics; Analyse syntaxique automatique; Annotation scheme; Annotation tools; Conventions d'annotation; Corpus arboré; Étiquetage automatique; Experimental linguistics; Linguistique expérimentale; Outils d'annotation; Parsing; Parsing stochastique; Stochastic parsing; Tagging; Treebank
|
|
URL: https://hal.archives-ouvertes.fr/hal-01498941/document https://hal.archives-ouvertes.fr/hal-01498941 https://hal.archives-ouvertes.fr/hal-01498941/file/TLT2015-short-Final.join-u8_PB.pdf
|
|
BASE
|
|
Hide details
|
|
4 |
Création d'un nouveau treebank à partir de quatrièmes de couverture
|
|
|
|
In: Traitement Automatique des Langues Naturelles 22 ; https://hal.archives-ouvertes.fr/hal-01498946 ; Traitement Automatique des Langues Naturelles 22, Jun 2015, Caen, France. pp.480-486 (2015)
|
|
BASE
|
|
Show details
|
|
5 |
Chinese computational linguistics and natural language processing based on naturally annotated big data : 13th China national conference, CCL 2014 and second international symposium, NLP-NABD 2014, Wuhan, China, October 18 - 19, 2014 : proceedings
|
|
|
|
BLLDB
|
|
UB Frankfurt Linguistik
|
|
Show details
|
|
13 |
A treebank-based study on the influence of Italian word order on parsing performance
|
|
|
|
BASE
|
|
Show details
|
|
14 |
Harmonization and Merging of two Italian Dependency Treebanks
|
|
|
|
BASE
|
|
Show details
|
|
|
|