1 |
MarsaTag, a tagger for French written texts and speech transcriptions
|
|
|
|
In: Second Asian Pacific Corpus linguistics Conference ; https://hal.archives-ouvertes.fr/hal-01500736 ; Second Asian Pacific Corpus linguistics Conference, Mar 2014, Hong Kong, China. pp.220-220 (2014)
|
|
BASE
|
|
Show details
|
|
2 |
Phrase extraction and rescoring in statistical machine translation
|
|
Srivastava, Ankit Kumar. - : Dublin City University. Centre for Next Generation Localisation (CNGL), 2014. : Dublin City University. School of Computing, 2014
|
|
In: Srivastava, Ankit Kumar (2014) Phrase extraction and rescoring in statistical machine translation. PhD thesis, Dublin City University. (2014)
|
|
BASE
|
|
Show details
|
|
3 |
Deep Syntax Annotation of the Sequoia French Treebank
|
|
|
|
In: International Conference on Language Resources and Evaluation (LREC) ; https://hal.inria.fr/hal-00969191 ; International Conference on Language Resources and Evaluation (LREC), May 2014, Reykjavik, Iceland (2014)
|
|
BASE
|
|
Show details
|
|
4 |
Rhapsodie: a Prosodic-Syntactic Treebank for Spoken French
|
|
|
|
In: Language Resources and Evaluation Conference ; https://hal.sorbonne-universite.fr/hal-00968959 ; Language Resources and Evaluation Conference, May 2014, Reykjavik, Iceland (2014)
|
|
Abstract:
International audience ; The main objective of the Rhapsodie project (ANR Rhapsodie 07 Corp-030-01) was to define rich, explicit, and reproducible schemes for the annotation of prosody and syntax in different genres (± spontaneous, ± planned, face-to-face interviews vs. broadcast, etc.), in order to study the prosody/syntax/discourse interface in spoken French, and their roles in the segmentation of speech into discourse units (Lacheret, Kahane, & Pietrandrea forthcoming). We here describe the deliverable, a syntactic and prosodic treebank of spoken French, composed of 57 short samples of spoken French (5 minutes long on average, amounting to 3 hours of speech and 33000 words), orthographically and phonetically transcribed. The transcriptions and the annotations are all aligned on the speech signal: phonemes, syllables, words, speakers, overlaps. This resource is freely available at www.projet-rhapsodie.fr. The sound samples (wav/mp3), the acoustic analysis (original F0 curve manually corrected and automatic stylized F0, pitch format), the orthographic transcriptions (txt), the microsyntactic annotations (tabular format), the macrosyntactic annotations (txt, tabular format), the prosodic annotations (xml, textgrid, tabular format), and the metadata (xml and html) can be freely downloaded under the terms of the Creative Commons licence Attribution - Noncommercial - Share Alike 3.0 France. The metadata are encoded in the IMDI-CMFI format and can be parsed on line.
|
|
Keyword:
[INFO.INFO-TS]Computer Science [cs]/Signal and Image Processing; [SHS.LANGUE]Humanities and Social Sciences/Linguistics; [SPI.SIGNAL]Engineering Sciences [physics]/Signal and Image processing; prosodic annotation; spoken French Treebank; syntactic annotation
|
|
URL: https://hal.sorbonne-universite.fr/hal-00968959/file/LREC2014_AL.pdf https://hal.sorbonne-universite.fr/hal-00968959/document https://hal.sorbonne-universite.fr/hal-00968959
|
|
BASE
|
|
Hide details
|
|
5 |
Correcting and Validating Syntactic Dependency in the Spoken French Treebank Rhapsodie
|
|
|
|
In: Proceedings of the 9th Language Resources and Evaluation Conference (LREC) ; https://halshs.archives-ouvertes.fr/halshs-01011059 ; Proceedings of the 9th Language Resources and Evaluation Conference (LREC), 2014, Iceland. pp.1-6 (2014)
|
|
BASE
|
|
Show details
|
|
12 |
Building Computational Resources : The URDU.KON-TB Treebank and the Urdu Parser
|
|
|
|
BASE
|
|
Show details
|
|
13 |
From Syntax to Semantics. First Steps Towards Tectogrammatical Annotation of Latin
|
|
Passarotti, Marco Carlo (orcid:0000-0002-9806-7187). - : The Association for Computational Linguistics, 2014. : country:SWE, 2014. : place:Gothenburg, 2014
|
|
BASE
|
|
Show details
|
|
14 |
Reflexões sobre anotação sintática e ferramentas de busca - Uso da linguagem XML para anotação sintática no corpus digital DOViC
|
|
|
|
In: Letras & Letras; v. 30, n. 2 (2014): Linguística de Corpus: abordagem e metodologia em pesquisas linguísticas de base empírica; 82-103 ; 1981-5239 (2014)
|
|
BASE
|
|
Show details
|
|
15 |
Challenges in Enhancing the Index Thomisticus Treebank with Semantic and Pragmatic Annotation
|
|
|
|
BASE
|
|
Show details
|
|
|
|