1 |
TEI Encoding of a Classical Mixtec Dictionary Using GROBID- Dictionaries
|
|
|
|
In: ELEX 2019: Smart Lexicography ; https://hal.inria.fr/hal-02264033 ; ELEX 2019: Smart Lexicography, Oct 2019, Sintra, Portugal ; https://elex.link/elex2019/ (2019)
|
|
Abstract:
International audience ; This paper presents the application of GROBID-Dictionaries (Khemakhem et al. 2017, Khemakhem et al. 2018a, Khemakhem et al. 2018b, Khemakhem et al. 2018c), an open source machine learning system for automatically structuring print dictionaries in digital format into TEI (Text Encoding Initiative) to a historical lexical resource of Colonial Mixtec 'Voces del Dzaha Dzahui' published by the Dominican fray Francisco Alvarado in the year 1593. The GROBID-Dictionaries application was applied to a reorganized and modernized version of the historical resource published by Jansen and Perez Jiménez (2009). The TEI dictionary produced will be integrated into a language documentation project dealing with Mixtepec-Mixtec (ISO 639-3: mix) (Bowers & Romary, 2017, 2018a, 2018b) an under-resourced indigenous language native to the Juxtlahuaca district of Oaxaca Mexico.
|
|
Keyword:
[INFO.INFO-CL]Computer Science [cs]/Computation and Language [cs.CL]; [INFO.INFO-DL]Computer Science [cs]/Digital Libraries [cs.DL]; GROBID-Dictionaries; Mixtec; TEI
|
|
URL: https://hal.inria.fr/hal-02264033/file/eLex_2019_abstract_111.pdf https://hal.inria.fr/hal-02264033 https://hal.inria.fr/hal-02264033/document
|
|
BASE
|
|
Hide details
|
|
2 |
Enhancing Usability for Automatically Structuring Digitised Dictionaries
|
|
|
|
In: GLOBALEX workshop at LREC 2018 ; https://hal.archives-ouvertes.fr/hal-01708137 ; GLOBALEX workshop at LREC 2018, May 2018, Miyazaki, Japan (2018)
|
|
BASE
|
|
Show details
|
|
3 |
Automatically Encoding Encyclopedic-like Resources in TEI
|
|
|
|
In: The annual TEI Conference and Members Meeting ; https://hal.inria.fr/hal-01819505 ; The annual TEI Conference and Members Meeting, Sep 2018, Tokyo, Japan ; https://tei2018.dhii.asia/ (2018)
|
|
BASE
|
|
Show details
|
|
4 |
Presenting the Nénufar Project: a Diachronic Digital Edition of the Petit Larousse Illustré
|
|
|
|
In: GLOBALEX 2018 - Globalex workshop at LREC2018 ; https://hal.archives-ouvertes.fr/hal-01728328 ; GLOBALEX 2018 - Globalex workshop at LREC2018, May 2018, Miyazaki, Japan. pp.1-6 ; https://globalex.link/globalex2018/ (2018)
|
|
BASE
|
|
Show details
|
|
5 |
Automatic Extraction of TEI Structures in Digitized Lexical Resources using Conditional Random Fields
|
|
|
|
In: electronic lexicography, eLex 2017 ; https://hal.archives-ouvertes.fr/hal-01508868 ; electronic lexicography, eLex 2017, Sep 2017, Leiden, Netherlands (2017)
|
|
BASE
|
|
Show details
|
|
6 |
Deep encoding of etymological information in TEI
|
|
|
|
In: ISSN: 2162-5603 ; EISSN: 2162-5603 ; Journal of the Text Encoding Initiative ; https://hal.inria.fr/hal-01296498 ; Journal of the Text Encoding Initiative, TEI Consortium, 2017, ⟨10.4000/jtei.1643⟩ ; https://jtei.revues.org/1643 (2017)
|
|
BASE
|
|
Show details
|
|
7 |
TEI across corpora, languages and genres: Towards a standard for the representation of social media and computer-mediated communication
|
|
|
|
In: Text Encoding Initiative: connect, animate, innovate. 2015 Annual Conference and Members’ Meeting of the TEI Consortium ; https://halshs.archives-ouvertes.fr/halshs-01222982 ; Text Encoding Initiative: connect, animate, innovate. 2015 Annual Conference and Members’ Meeting of the TEI Consortium, TEI Consortium, Oct 2015, Lyon, France ; http://tei2015.huma-num.fr (2015)
|
|
BASE
|
|
Show details
|
|
8 |
Computer-mediated communication in TEI: What lies ahead
|
|
|
|
In: The Linked TEI: Text Encoding in the Web. 2013 Annual Conference and Members' Meeting of the TEI Consortium ; https://halshs.archives-ouvertes.fr/halshs-00878833 ; The Linked TEI: Text Encoding in the Web. 2013 Annual Conference and Members' Meeting of the TEI Consortium, Oct 2013, Rome, Italy (2013)
|
|
BASE
|
|
Show details
|
|
|
|