3 |
DaCToR: A data collection tool for the RELATER project
|
|
|
|
Abstract:
Collecting domain-specific data for under-resourced languages, e.g., dialects of languages, can be very expensive, potentially financially prohibitive and taking long time. Moreover, in the case of rarely written languages, the normalization of non-canonical transcription might be another time consuming but necessary task. In order to collect domain-specific data in such circumstances in a time and cost-efficient way, collecting read data of pre-prepared texts is often a viable option. In order to collect data in the domain of psychiatric diagnosis in Arabic dialects for the project RELATER, we have prepared the data collection tool DaCToR for collecting read texts by speakers in the respective countries and districts in which the dialects are spoken. In this paper we describe our tool, its purpose within the project RELATER and the dialects which we have started to collect with the tool.
|
|
Keyword:
DATA processing & computer science; ddc:004; info:eu-repo/classification/ddc/004
|
|
URL: https://doi.org/10.5445/IR/1000127261 https://publikationen.bibliothek.kit.edu/1000127261/95368882 https://publikationen.bibliothek.kit.edu/1000127261
|
|
BASE
|
|
Hide details
|
|
4 |
A Very Low Resource Language Speech Corpus for Computational Language Documentation Experiments
|
|
|
|
In: Language Resources and Evaluation Conference (LREC) ; https://hal.archives-ouvertes.fr/hal-01807093 ; Language Resources and Evaluation Conference (LREC), Nicoletta Calzolari (Conference chair) and Khalid Choukri and Christopher Cieri and Thierry Declerck and Sara Goggi and Koiti Hasida and Hitoshi Isahara and Bente Maegaard and Joseph Mariani and Hélène Mazo and Asuncion Moreno and Jan Odijk and Stelios Pi, May 2018, Miyazaki, Japan (2018)
|
|
BASE
|
|
Show details
|
|
5 |
Neural language codes for multilingual acoustic models
|
|
|
|
In: ISSN: 2308-457X (2018)
|
|
BASE
|
|
Show details
|
|
6 |
A Very Low Resource Language Speech Corpus for Computational Language Documentation Experiments ...
|
|
|
|
BASE
|
|
Show details
|
|
7 |
Modified polyphone decision tree specialization for porting multilingual grapheme based ASR systems to new languages
|
|
|
|
BASE
|
|
Show details
|
|
|
|