41 |
TEI Lex-0: A Target Format for TEI-Encoded Dictionaries and Lexical Resources ...
|
|
|
|
BASE
|
|
Show details
|
|
42 |
TEI and the Mixtepec-Mixtec corpus: data integration, annotation and normalization of heterogeneous data for an under-resourced language
|
|
|
|
BASE
|
|
Show details
|
|
43 |
TEI and the Mixtepec-Mixtec corpus: data integration, annotation and normalization of heterogeneous data for an under-resourced language
|
|
|
|
BASE
|
|
Show details
|
|
48 |
Bridging the Gaps between Digital Humanities, Lexicography, and Linguistics: A TEI Dictionary for the Documentation of Mixtepec-Mixtec
|
|
|
|
In: ISSN: 2160-5076 ; Dictionaries: Journal of the Dictionary Society of North America ; https://hal.inria.fr/hal-01968871 ; Dictionaries: Journal of the Dictionary Society of North America, Dictionary Society of North America, 2018, 39 (2), pp.79-106 (2018)
|
|
BASE
|
|
Show details
|
|
49 |
TEI Lex-0: A Target Format for TEI-Encoded Dictionaries and Lexical Resources
|
|
|
|
In: TEI Conference and Members' Meeting ; https://hal.inria.fr/hal-02265312 ; TEI Conference and Members' Meeting, Sep 2018, Tokyo, Japan (2018)
|
|
Abstract:
International audience ; Achieving consistent encoding within a given community of practice has been a recurrent issue for the TEI Guidelines. The topic is of particular importance for lexical data if we think of the potential wealth of content we could gain from pooling together the information available in the variety of highly structured, historical and contemporary lexical resources. Still, the encoding possibilities offered by the Dictionaries Chapter in the Guidelines are too numerous and too flexible to guarantee sufficient interoperability and a coherent model for searching, visualising or enriching multiple lexical resources.Following the spirit of TEI Analytics [Zillig, 2009], developed in the context of the MONK project, TEI Lex-0 aims at establishing a target format to facilitate the interoperability of heterogeneously encoded lexical resources. This is important both in the context of building lexical infrastructures as such [Ermolaev and Tasovac, 2012] and in the context of developing generic TEI-aware tools such as dictionary viewers and profilers. The format itself should not necessarily be one which is used for editing or managing individual resources, but one to which they can be univocally transformed to be queried, visualised, or mined in a uniform way. We are also aiming to stay as aligned as possible with the TEI subset developed in conjunction with the revision of the ISO LMF (Lexical Markup Framework) standard so that coherent design guidelines can be provided to the community (cf. [Romary, 2015]).The paper will provide an overview of the various domains covered by TEI Lex- 0 and the main decisions that were taken over the last 18 months: constraining the general structure of a lexical entry; offering mechanisms to overcome the limits of when used in retro-digitized dictionaries (by allowing, for instance, and as children of ); systematizing the representation of morpho-syntactic information [Bański et al., 2017]; providing a strict -based encoding of sense-related information; deprecating ; dealing with internal and external references in dictionary entries, providing more advanced encodings of etymology (see submission by Bowers, Herold and Romary); as well as defining technical constraints on the systematic use of @xml:id at different levels of the dictionary microstructure. The activity of the group has already lead to changes in the Guidelines in response to specific GitHub tickets.
|
|
Keyword:
[INFO.INFO-CL]Computer Science [cs]/Computation and Language [cs.CL]; [SCCO.LING]Cognitive science/Linguistics
|
|
URL: https://hal.inria.fr/hal-02265312 https://hal.inria.fr/hal-02265312/document https://hal.inria.fr/hal-02265312/file/TEI%20Lex%200%20pres%20-%20Tasovac_rev.pdf
|
|
BASE
|
|
Hide details
|
|
50 |
Enhancing Usability for Automatically Structuring Digitised Dictionaries
|
|
|
|
In: GLOBALEX workshop at LREC 2018 ; https://hal.archives-ouvertes.fr/hal-01708137 ; GLOBALEX workshop at LREC 2018, May 2018, Miyazaki, Japan (2018)
|
|
BASE
|
|
Show details
|
|
51 |
Retro-digitizing and Automatically Structuring a Large Bibliography Collection
|
|
|
|
In: European Association for Digital Humanities (EADH) Conference ; https://hal.archives-ouvertes.fr/hal-01941534 ; European Association for Digital Humanities (EADH) Conference, EADH, Dec 2018, Galway, Ireland (2018)
|
|
BASE
|
|
Show details
|
|
52 |
A stand-off XML-TEI representation of reference annotation
|
|
|
|
In: DGfS 2018: 40. Jahrestagung der Deutschen Gesellschaft für Sprachwissenschaft ; https://hal.inria.fr/hal-01876327 ; DGfS 2018: 40. Jahrestagung der Deutschen Gesellschaft für Sprachwissenschaft, Mar 2018, Stuttgart, Germany. 2017 (2018)
|
|
BASE
|
|
Show details
|
|
53 |
A Diachronic Digital Edition of the Petit Larousse illustré
|
|
|
|
In: Journée d'étude CORLI : Traitements et standardisation des corpus multimodaux et web 2.0. ; https://hal.archives-ouvertes.fr/hal-01873805 ; Journée d'étude CORLI : Traitements et standardisation des corpus multimodaux et web 2.0., May 2018, Paris, France (2018)
|
|
BASE
|
|
Show details
|
|
54 |
Automatically Encoding Encyclopedic-like Resources in TEI
|
|
|
|
In: The annual TEI Conference and Members Meeting ; https://hal.inria.fr/hal-01819505 ; The annual TEI Conference and Members Meeting, Sep 2018, Tokyo, Japan ; https://tei2018.dhii.asia/ (2018)
|
|
BASE
|
|
Show details
|
|
55 |
TEI-Lex0 Etym -towards terse(r) recommendations for the encoding of etymological information
|
|
|
|
In: TEI Conference and Members' Meeting ; https://hal.inria.fr/hal-02075506 ; TEI Conference and Members' Meeting, Sep 2018, Tokyo, Japan (2018)
|
|
BASE
|
|
Show details
|
|
56 |
Encoding Mixtepec-Mixtec Etymology in TEI
|
|
|
|
In: TEI Conference and Members' Meeting ; https://hal.inria.fr/hal-02003975 ; TEI Conference and Members' Meeting, Sep 2018, Tokyo, Japan (2018)
|
|
BASE
|
|
Show details
|
|
57 |
Presenting the Nénufar Project: a Diachronic Digital Edition of the Petit Larousse Illustré
|
|
|
|
In: GLOBALEX 2018 - Globalex workshop at LREC2018 ; https://hal.archives-ouvertes.fr/hal-01728328 ; GLOBALEX 2018 - Globalex workshop at LREC2018, May 2018, Miyazaki, Japan. pp.1-6 ; https://globalex.link/globalex2018/ (2018)
|
|
BASE
|
|
Show details
|
|
59 |
TBX in ODD: Schema-agnostic specification and documentation for TermBase eXchange
|
|
|
|
In: LOTKS 2017- Workshop on Language, Ontology, Terminology and Knowledge Structures ; https://hal.inria.fr/hal-01581440 ; LOTKS 2017- Workshop on Language, Ontology, Terminology and Knowledge Structures, Sep 2017, Montpellier, France ; https://langandonto.github.io/LangOnto-TermiKS-2017/ (2017)
|
|
BASE
|
|
Show details
|
|
60 |
Automatic Extraction of TEI Structures in Digitized Lexical Resources using Conditional Random Fields
|
|
|
|
In: electronic lexicography, eLex 2017 ; https://hal.archives-ouvertes.fr/hal-01508868 ; electronic lexicography, eLex 2017, Sep 2017, Leiden, Netherlands (2017)
|
|
BASE
|
|
Show details
|
|
|
|