41 |
DravidianCodeMix: Sentiment Analysis and Offensive Language Identification Dataset for Dravidian Languages in Code-Mixed Text ...
|
|
|
|
BASE
|
|
Show details
|
|
42 |
Factors contributing to prefixation of biaspectual verbs in Croatian Dataset ...
|
|
|
|
BASE
|
|
Show details
|
|
43 |
Factors contributing to prefixation of biaspectual verbs in Croatian Dataset ...
|
|
|
|
BASE
|
|
Show details
|
|
44 |
Analyse diachronique du processus de déterminologisation. Une réflexion en diachronie courte en physique des particules ...
|
|
|
|
BASE
|
|
Show details
|
|
48 |
Loose and tight languages: A typology based on associations between constructions and lexemes ...
|
|
|
|
BASE
|
|
Show details
|
|
50 |
Loose and tight languages: A typology based on associations between constructions and lexemes ...
|
|
|
|
BASE
|
|
Show details
|
|
51 |
Yongning Na for Natural Language Processing: a single-speaker audio corpus with transcriptions ...
|
|
|
|
BASE
|
|
Show details
|
|
52 |
Yongning Na for Natural Language Processing: a single-speaker audio corpus with transcriptions ...
|
|
|
|
Abstract:
(français ci-dessous) This archive contains a dataset (audio files and transcriptions) of a minority language, Yongning Na (iso 639-3 code: nru). The archive contains a subset of the Na corpus of the Pangloss Collection: it is a single-speaker corpus, consisting of all the audio resources transcribed, for the main speaker of this corpus (Ms. LATAMI Dashilame). The corpus is versioned, so that the experiments carried out on these resources (for linguistic research or for Natural Language Processing) are fully reproducible. All relevant information is contained in YAML files (.yml extension; one in French, one in English). The data sub-folder contains the converted and demultiplexed audio files, as well as the annotations associated with each channel of the audio files. The summary files contain, among other things, the list of graphemes used in the language (complex graphemes are particularly important), as well as information on the various resources (audio and annotations), such as their identifiers (DOIs) ...
|
|
Keyword:
audio corpora; endangered languages; interdisciplinary research; language conservation; language documentation; multimedia linguistic resources; Naish languages; Sino-Tibetan languages
|
|
URL: https://zenodo.org/record/5336698 https://dx.doi.org/10.5281/zenodo.5336698
|
|
BASE
|
|
Hide details
|
|
59 |
Assessing Writing in French-as-a-Foreign-Language: Teacher Practices and Learner Uptake
|
|
|
|
In: Languages; Volume 6; Issue 4; Pages: 210 (2021)
|
|
BASE
|
|
Show details
|
|
60 |
ACTER (Annotated Corpora for Term Extraction Research) v1.4
|
|
Rigouts Terryn, Ayla. - : Ghent University, 2021. : LT3 Language and Translation Technology Team, 2021
|
|
BASE
|
|
Show details
|
|
|
|