41 |
TEI Lex-0: A Target Format for TEI-Encoded Dictionaries and Lexical Resources ...
|
|
|
|
BASE
|
|
Show details
|
|
42 |
TEI and the Mixtepec-Mixtec corpus: data integration, annotation and normalization of heterogeneous data for an under-resourced language
|
|
|
|
BASE
|
|
Show details
|
|
43 |
TEI and the Mixtepec-Mixtec corpus: data integration, annotation and normalization of heterogeneous data for an under-resourced language
|
|
|
|
BASE
|
|
Show details
|
|
48 |
Bridging the Gaps between Digital Humanities, Lexicography, and Linguistics: A TEI Dictionary for the Documentation of Mixtepec-Mixtec
|
|
|
|
In: ISSN: 2160-5076 ; Dictionaries: Journal of the Dictionary Society of North America ; https://hal.inria.fr/hal-01968871 ; Dictionaries: Journal of the Dictionary Society of North America, Dictionary Society of North America, 2018, 39 (2), pp.79-106 (2018)
|
|
Abstract:
International audience ; This paper discusses the digital dictionary component in an ongoing language documentation project for the Mixtepec-Mixtec language (iso 639-3: mix). Mixtepec-Mixtec (Sa'an Savi 'rain language') is an Oto-monguean language spoken by roughly 9,000-10,000 people in the Juxtlahuaca district of Oaxaca Mexico. Creating a digital dictionary for an under-resourced language entails a number of challenges that require unique and nuanced encoding solutions in which a delicate balance between the linguistic content, data structure, potential linked resources, and editorial metadata must be found. Herein we demonstrate how we use TEI to create a reusable, extensible, and machine readable language resource with an emphasis on how our solutions using a combination of novel and established TEI dictionary structures enable us to address our specific needs for Mixtepec-Mixtec and also provide a relevant roadmap for similar under-resourced language projects.
|
|
Keyword:
[INFO.INFO-CL]Computer Science [cs]/Computation and Language [cs.CL]; [INFO.INFO-DS]Computer Science [cs]/Data Structures and Algorithms [cs.DS]; [SCCO.LING]Cognitive science/Linguistics; Dictionary encoding; Digital humanities; Language documentation; Mixtec; TEI
|
|
URL: https://hal.inria.fr/hal-01968871/document https://hal.inria.fr/hal-01968871/file/04_RWiP_Bowers-Romary-edited-Authors-copy.pdf https://hal.inria.fr/hal-01968871
|
|
BASE
|
|
Hide details
|
|
49 |
TEI Lex-0: A Target Format for TEI-Encoded Dictionaries and Lexical Resources
|
|
|
|
In: TEI Conference and Members' Meeting ; https://hal.inria.fr/hal-02265312 ; TEI Conference and Members' Meeting, Sep 2018, Tokyo, Japan (2018)
|
|
BASE
|
|
Show details
|
|
50 |
Enhancing Usability for Automatically Structuring Digitised Dictionaries
|
|
|
|
In: GLOBALEX workshop at LREC 2018 ; https://hal.archives-ouvertes.fr/hal-01708137 ; GLOBALEX workshop at LREC 2018, May 2018, Miyazaki, Japan (2018)
|
|
BASE
|
|
Show details
|
|
51 |
Retro-digitizing and Automatically Structuring a Large Bibliography Collection
|
|
|
|
In: European Association for Digital Humanities (EADH) Conference ; https://hal.archives-ouvertes.fr/hal-01941534 ; European Association for Digital Humanities (EADH) Conference, EADH, Dec 2018, Galway, Ireland (2018)
|
|
BASE
|
|
Show details
|
|
52 |
A stand-off XML-TEI representation of reference annotation
|
|
|
|
In: DGfS 2018: 40. Jahrestagung der Deutschen Gesellschaft für Sprachwissenschaft ; https://hal.inria.fr/hal-01876327 ; DGfS 2018: 40. Jahrestagung der Deutschen Gesellschaft für Sprachwissenschaft, Mar 2018, Stuttgart, Germany. 2017 (2018)
|
|
BASE
|
|
Show details
|
|
53 |
A Diachronic Digital Edition of the Petit Larousse illustré
|
|
|
|
In: Journée d'étude CORLI : Traitements et standardisation des corpus multimodaux et web 2.0. ; https://hal.archives-ouvertes.fr/hal-01873805 ; Journée d'étude CORLI : Traitements et standardisation des corpus multimodaux et web 2.0., May 2018, Paris, France (2018)
|
|
BASE
|
|
Show details
|
|
54 |
Automatically Encoding Encyclopedic-like Resources in TEI
|
|
|
|
In: The annual TEI Conference and Members Meeting ; https://hal.inria.fr/hal-01819505 ; The annual TEI Conference and Members Meeting, Sep 2018, Tokyo, Japan ; https://tei2018.dhii.asia/ (2018)
|
|
BASE
|
|
Show details
|
|
55 |
TEI-Lex0 Etym -towards terse(r) recommendations for the encoding of etymological information
|
|
|
|
In: TEI Conference and Members' Meeting ; https://hal.inria.fr/hal-02075506 ; TEI Conference and Members' Meeting, Sep 2018, Tokyo, Japan (2018)
|
|
BASE
|
|
Show details
|
|
56 |
Encoding Mixtepec-Mixtec Etymology in TEI
|
|
|
|
In: TEI Conference and Members' Meeting ; https://hal.inria.fr/hal-02003975 ; TEI Conference and Members' Meeting, Sep 2018, Tokyo, Japan (2018)
|
|
BASE
|
|
Show details
|
|
57 |
Presenting the Nénufar Project: a Diachronic Digital Edition of the Petit Larousse Illustré
|
|
|
|
In: GLOBALEX 2018 - Globalex workshop at LREC2018 ; https://hal.archives-ouvertes.fr/hal-01728328 ; GLOBALEX 2018 - Globalex workshop at LREC2018, May 2018, Miyazaki, Japan. pp.1-6 ; https://globalex.link/globalex2018/ (2018)
|
|
BASE
|
|
Show details
|
|
59 |
TBX in ODD: Schema-agnostic specification and documentation for TermBase eXchange
|
|
|
|
In: LOTKS 2017- Workshop on Language, Ontology, Terminology and Knowledge Structures ; https://hal.inria.fr/hal-01581440 ; LOTKS 2017- Workshop on Language, Ontology, Terminology and Knowledge Structures, Sep 2017, Montpellier, France ; https://langandonto.github.io/LangOnto-TermiKS-2017/ (2017)
|
|
BASE
|
|
Show details
|
|
60 |
Automatic Extraction of TEI Structures in Digitized Lexical Resources using Conditional Random Fields
|
|
|
|
In: electronic lexicography, eLex 2017 ; https://hal.archives-ouvertes.fr/hal-01508868 ; electronic lexicography, eLex 2017, Sep 2017, Leiden, Netherlands (2017)
|
|
BASE
|
|
Show details
|
|
|
|