81 |
Paris and Stanford at EPE 2017: Downstream Evaluation of Graph-based Dependency Representations
|
|
|
|
In: EPE 2017 - The First Shared Task on Extrinsic Parser Evaluation ; https://hal.inria.fr/hal-01592051 ; EPE 2017 - The First Shared Task on Extrinsic Parser Evaluation, Sep 2017, Pisa, Italy. pp.47-59 ; http://epe.nlpl.eu (2017)
|
|
BASE
|
|
Show details
|
|
82 |
Computational methods for descriptive and theoretical morphology: a brief introduction
|
|
|
|
In: ISSN: 1871-5621 ; EISSN: 1871-5656 ; Morphology ; https://hal.inria.fr/hal-01628253 ; Morphology, Springer Verlag, 2017, Computational methods for descriptive and theoretical morphology, 27 (4), pp.1-7. ⟨10.1017/CBO9781139248860⟩ (2017)
|
|
BASE
|
|
Show details
|
|
83 |
Annotating omission in statement pairs
|
|
|
|
In: 11th Linguistic Annotation Workshop ; https://hal.inria.fr/hal-01584035 ; 11th Linguistic Annotation Workshop, Apr 2017, Valencia, Spain. pp.41-45 (2017)
|
|
BASE
|
|
Show details
|
|
84 |
Speeding up corpus development for linguistic research: language documentation and acquisition in Romansh Tuatschin
|
|
|
|
In: Joint SIGHUM Workshop on Computational Linguistics for Cultural Heritage, Social Sciences, Humanities and Literature ; https://hal.inria.fr/hal-01570614 ; Joint SIGHUM Workshop on Computational Linguistics for Cultural Heritage, Social Sciences, Humanities and Literature, Aug 2017, Vancouver, Canada. pp.89 - 94, ⟨10.18653/v1/W17-2212⟩ ; https://sighum.wordpress.com/events/latech-clfl-2017/ (2017)
|
|
Abstract:
International audience ; In this paper, we present ongoing work for developing language resources and basic NLP tools for an undocumented variety of Romansh, in the context of a language documentation and language acquisition project. Our tools are designed to improve the speed and reliability of corpus annotations for noisy data involving large amounts of code-switching, occurrences of child speech and orthographic noise. Being able to increase the efficiency of language resource development for language documentation and acquisition research also constitutes a step towards solving the data sparsity issues with which researchers have been struggling.
|
|
Keyword:
[INFO.INFO-CL]Computer Science [cs]/Computation and Language [cs.CL]; [SHS.LANGUE]Humanities and Social Sciences/Linguistics; Corpus annotation and tagging; Language documentation methodology; Natural Language Processing; Romansh Tuatschin
|
|
URL: https://hal.inria.fr/hal-01570614/document https://hal.inria.fr/hal-01570614 https://doi.org/10.18653/v1/W17-2212 https://hal.inria.fr/hal-01570614/file/speeding-corpus-development-10.pdf
|
|
BASE
|
|
Hide details
|
|
85 |
Milk and the Indo-Europeans
|
|
|
|
In: Language Dispersal Beyond Farming ; https://hal.inria.fr/hal-01667476 ; Martine Robeets; Alexander Savalyev Language Dispersal Beyond Farming, John Benjamins Publishing Company, pp.291-311, 2017, 978 90 272 1255 9. ⟨10.1075/z.215.13gar⟩ (2017)
|
|
BASE
|
|
Show details
|
|
88 |
Milk and the Indo-Europeans
|
|
|
|
In: Language Dispersal Beyond Farming ; https://hal.inria.fr/hal-01667476 ; Martine Robeets; Alexander Savalyev Language Dispersal Beyond Farming, John Benjamins Publishing Company, pp.291-311, 2017, 978 90 272 1255 9. ⟨10.1075/z.215.13gar⟩ (2017)
|
|
BASE
|
|
Show details
|
|
89 |
From Noisy Questions to Minecraft Texts: Annotation Challenges in Extreme Syntax Scenarios
|
|
|
|
In: 2nd Workshop on Noisy User-generated Text (W-NUT) at CoLing 2016 ; https://hal.inria.fr/hal-01584054 ; 2nd Workshop on Noisy User-generated Text (W-NUT) at CoLing 2016, Dec 2016, Osaka, Japan (2016)
|
|
BASE
|
|
Show details
|
|
90 |
External Lexical Information for Multilingual Part-of-Speech Tagging ...
|
|
|
|
BASE
|
|
Show details
|
|
91 |
Constructing a poor man’s wordnet in a resource-rich world
|
|
|
|
In: ISSN: 1574-020X ; EISSN: 1574-0218 ; Language Resources and Evaluation ; https://hal.inria.fr/hal-01174492 ; Language Resources and Evaluation, Springer Verlag, 2015, 49 (3), pp.601-635. ⟨10.1007/s10579-015-9295-6⟩ (2015)
|
|
BASE
|
|
Show details
|
|
92 |
Could Greek and Italic share a same Indo-European substratum?
|
|
|
|
In: 22nd International Conference on Historical Linguistics ; https://hal.inria.fr/hal-01256310 ; 22nd International Conference on Historical Linguistics, Jul 2015, Naples, Italy ; http://www.ichl22.unina.it (2015)
|
|
BASE
|
|
Show details
|
|
93 |
Developing a French FrameNet: Methodology and First results
|
|
|
|
In: LREC - The 9th edition of the Language Resources and Evaluation Conference ; https://hal.inria.fr/hal-01022385 ; LREC - The 9th edition of the Language Resources and Evaluation Conference, May 2014, Reykjavik, Iceland (2014)
|
|
BASE
|
|
Show details
|
|
94 |
A language-independent and fully unsupervised approach to lexicon induction and part-of-speech tagging for closely related languages
|
|
|
|
In: Language Resources and Evaluation Conference ; https://hal.inria.fr/hal-01022298 ; Language Resources and Evaluation Conference, European Language Resources Association, May 2014, Reykjavik, Iceland (2014)
|
|
BASE
|
|
Show details
|
|
95 |
Data-driven Synset Induction and Disambiguation for Wordnet Development
|
|
|
|
In: ISSN: 1574-020X ; EISSN: 1574-0218 ; Language Resources and Evaluation ; https://hal.inria.fr/hal-01088000 ; Language Resources and Evaluation, Springer Verlag, 2014, 48 (4), pp.655-677. ⟨10.1007/s10579-014-9291-2⟩ (2014)
|
|
BASE
|
|
Show details
|
|
96 |
Crowdsourcing for Language Resource Development: Criticisms About Amazon Mechanical Turk Overpowering Use
|
|
|
|
In: Human Language Technology Challenges for Computer Science and Linguistics ; https://hal.inria.fr/hal-01053047 ; Vetulani, Zygmunt and Mariani, Joseph. Human Language Technology Challenges for Computer Science and Linguistics, 8387, Springer International Publishing, pp.303-314, 2014, Lecture Notes in Computer Science, 978-3-319-08957-7. ⟨10.1007/978-3-319-08958-4_25⟩ (2014)
|
|
BASE
|
|
Show details
|
|
97 |
The CoMeRe corpus for French: structuring and annotating heterogeneous CMC genres
|
|
|
|
In: ISSN: 0175-1336 ; Journal for language technology and computational linguistics ; https://halshs.archives-ouvertes.fr/halshs-00953507 ; Journal for language technology and computational linguistics, GSCL (Gesellschaft für Sprachtechnologie und Computerlinguistik) 2014, 29 (2), pp.1-30 ; http://www.jlcl.org/2014_Heft2/Heft2-2014.pdf (2014)
|
|
BASE
|
|
Show details
|
|
98 |
The Opacity-Compactness Tradeoff: Morphomic Features for an Economical Account of Khaling Verbal Inflection
|
|
|
|
In: 16th International Morphology Meeting (IMM 16) ; https://hal.inria.fr/hal-01114854 ; 16th International Morphology Meeting (IMM 16), May 2014, Budapest, Hungary (2014)
|
|
BASE
|
|
Show details
|
|
99 |
The Opacity-Compactness Tradeoff: Morphomic Features for an Economical Account of Khaling Verbal Inflection
|
|
|
|
In: 16th International Morphology Meeting (IMM 16) ; https://hal.inria.fr/hal-01114854 ; 16th International Morphology Meeting (IMM 16), May 2014, Budapest, Hungary (2014)
|
|
BASE
|
|
Show details
|
|
100 |
A language-independent and fully unsupervised approach to lexicon induction and part-of-speech tagging for closely related languages
|
|
|
|
In: Proceedings of the Ninth International Conference on Language Resources and Evaluation (LREC 2014) (2014)
|
|
BASE
|
|
Show details
|
|
|
|