DE eng

Search in the Catalogues and Directories

Hits 1 – 1 of 1

1
The CoMeRe French CMC corpora and their modeling in TEI
In: ird-cmc-rennes: Social Media and CMC Corpora for the eHumanities. ; https://halshs.archives-ouvertes.fr/halshs-01222979 ; ird-cmc-rennes: Social Media and CMC Corpora for the eHumanities., Oct 2015, Rennes, France ; http://ird-cmc-rennes.sciencesconf.org/ (2015)
Abstract: International audience ; CoMeRe (acronym which in French stands for network mediated communication) is a national project involving researchers from 8 different research units to develop a repos-itory of CMC all modeled within the same extension of the TEI (Chanier et al. 2014). The project was carried out from 2013 to 2015 with the support of Corpus-Ecrits (http://corpusecrits.huma-num.fr/, a national research consortium on written corpora) and Ortolang (http://www.ortolang.fr, a national infrastructure for tools and corpora on French language),.Three key principles underlie CoMeRe: variety, openness and standards. “Variety” is one of our keywords since we have assembled interactions stemming from networks such as the Internet or telecommunications (mobile phones), as well as mono- and multimodal, and synchronous and asynchronous communications. The genres covered within CoMeRe include text or oral chats, email, discussion forums, blogs, tweets, audio-graphic conferencing systems (conference systems with text, audio, and iconic signs for communication), or even collaborative working/learning environments with verbal and nonverbal communication. “Openness” is our second keyword. The first set of 11 corpora has been released (http://hdl.handle.net/11403/comere) as open data on Ortolang. Our wish to release CoMeRe corpora as open data stems from the fact that, although studies on new CMC communication genres draw much attention, there is cur-rently no existing dataset with significant coverage to form the basis for systematic re-search."Standards" refers to two different aspects. Firstly, corpora have been structured and referred to in a uniform way. The TEI-IS is the model developed as an extension of the TEI in order to encompass the Interaction Space (IS) of CMC multimodal discourse. “Standards” also refers to the uniform basic level of automatic annotations, related to segmentation and part of speech (POS) tagging which is underway.
Keyword: [SHS.LANGUE]Humanities and Social Sciences/Linguistics; CMC; CoMeRe; Computer-Mediated Communication; corpora
URL: https://halshs.archives-ouvertes.fr/halshs-01222979
https://halshs.archives-ouvertes.fr/halshs-01222979/file/teicmc4rennes.pdf
BASE
Hide details

Catalogues
0
0
0
0
0
0
0
Bibliographies
0
0
0
0
0
0
0
0
0
Linked Open Data catalogues
0
Online resources
0
0
0
0
Open access documents
1
0
0
0
0
© 2013 - 2024 Lin|gu|is|tik | Imprint | Privacy Policy | Datenschutzeinstellungen ändern