Home Catalogue search

eng

Refine your search:

Search in the Catalogues and Directories






	Sort by
Simple Search

Page: 1 2

Hits 1 – 20 of 29

1	Evaluating and Improving Child-Directed Automatic Speech Recognition
	Booth, Eric; Carns, Jake; Kennington, Casey...
	In: Computer Science Faculty Publications and Presentations (2020)
	BASE
	Show details

2	Questions, biases and ‘negation’: evidence from Scots varieties
	Jamieson, Elyse Anne. - : The University of Edinburgh, 2019
	BASE
	Show details

3	Collecting, Transcribing, Analyzing : Machine-Assisted Linguistic Fieldwork ; Collecter, Transcrire, Analyser : quand la machine assiste le linguiste dans son travail de terrain
	Gauthier, Elodie. - : HAL CCSD, 2018
	In: https://tel.archives-ouvertes.fr/tel-01893309 ; Informatique et langage [cs.CL]. Université Grenoble Alpes, 2018. Français. ⟨NNT : 2018GREAM011⟩ (2018)
	Abstract: In the last few decades, many scientists were concerned with the fast extinction of languages. Faced with this alarming decline of the world's linguistic heritage, action is urgently needed to enable fieldwork linguists, at least, to document languages by providing them innovative collection tools and to enable them to describe these languages. Machine assistance might be interesting to help them in such a task.This is what we propose in this work, focusing on three pillars of the linguistic fieldwork: collection, transcription and analysis.Recordings are essential, since they are the source material, the starting point of the descriptive work. Speech recording is also a valuable object for the documentation of the language.The growing proliferation of smartphones and other interactive voice mobile devices offer new opportunities for fieldwork linguists and researchers in language documentation. Field recordings should also include ethnolinguistic material which is particularly valuable to document traditions and way of living. However, large data collections require well organized repositories to access the content, with efficient file naming and metadata conventions.Thus, we have developed LIG-AIKUMA, a free Android app running on various mobile phones and tablets. The app aims to record speech for language documentation, over an innovative way.It includes a smart generation and handling of speaker metadata as well as respeaking and parallel audio data mapping.LIG-AIKUMA proposes a range of different speech collection modes (recording, respeaking, translation and elicitation) and offers the possibility to share recordings between users. Through these modes, parallel corpora are built such as "under-resourced speech - well-resourced speech", "speech - image", "speech - video", which are also of a great interest for speech technologies, especially for unsupervised learning.After the data collection step, the fieldwork linguist transcribes these data. Nonetheless, it can not be done -currently- on the whole collection, since the task is tedious and time-consuming.We propose to use automatic techniques to help the fieldwork linguist to take advantage of all his speech collection. Along these lines, automatic speech recognition (ASR) is a way to produce transcripts of the recordings, with a decent quality.Once the transcripts are obtained (and corrected), the linguist can analyze his data. In order to analyze the whole collection collected, we consider the use of forced alignment methods. We demonstrate that such techniques can lead to fine evaluation of linguistic features. In return, we show that modeling specific features may lead to improvements of the ASR systems. ; Depuis quelques décennies, de nombreux scientifiques alertent au sujet de la disparition des langues qui ne cesse de s'accélérer.Face au déclin alarmant du patrimoine linguistique mondial, il est urgent d'agir afin de permettre aux linguistes de terrain, a minima, de documenter les langues en leur fournissant des outils de collecte innovants et, si possible, de leur permettre de décrire ces langues grâce au traitement des données assisté par ordinateur.C'est ce que propose ce travail, en se concentrant sur trois axes majeurs du métier de linguiste de terrain : la collecte, la transcription et l'analyse.Les enregistrements audio sont primordiaux, puisqu'ils constituent le matériau source, le point de départ du travail de description. De plus, tel un instantané, ils représentent un objet précieux pour la documentation de la langue. Cependant, les outils actuels d'enregistrement n'offrent pas au linguiste la possibilité d'être efficace dans son travail et l'ensemble des appareils qu'il doit utiliser (enregistreur, ordinateur, microphone, etc.) peut devenir encombrant.Ainsi, nous avons développé LIG-AIKUMA, une application mobile de collecte de parole innovante, qui permet d'effectuer des enregistrements directement exploitables par les moteurs de reconnaissance automatique de la parole (RAP). Les fonctionnalités implémentées permettent d'enregistrer différents types de discours (parole spontanée, parole élicitée, parole lue) et de partager les enregistrements avec les locuteurs. L'application permet, en outre, la construction de corpus alignés << parole source (peu dotée)-parole cible (bien dotée) >>, << parole-image >>, << parole-vidéo >> qui présentent un intérêt fort pour les technologies de la parole, notamment pour l'apprentissage non supervisé.Bien que la collecte ait été menée de façon efficace, l'exploitation (de la transcription jusqu'à la glose, en passant par la traduction) de la totalité de ces enregistrements est impossible, tant la tâche est fastidieuse et chronophage.Afin de compléter l'aide apportée aux linguistes, nous proposons d'utiliser des techniques de traitement automatique de la langue pour lui permettre de tirer partie de la totalité de ses données collectées. Parmi celles-ci, la RAP peut être utilisée pour produire des transcriptions, d'une qualité satisfaisante, de ses enregistrements.Une fois les transcriptions obtenues, le linguiste peut s'adonner à l'analyse de ses données. Afin qu'il puisse procéder à l'étude de l'ensemble de ses corpus, nous considérons l'usage des méthodes d'alignement forcé. Nous démontrons que de telles techniques peuvent conduire à des analyses linguistiques fines. En retour, nous montrons que la modélisation de ces observations peut mener à des améliorations des systèmes de RAP.
	Keyword: [INFO.INFO-CL]Computer Science [cs]/Computation and Language [cs.CL]; Automatic speech processing; Automatic speech recognition; Collecte de données; Data collection; Documentation et description des langues; Études phonétiques; Language documentation and description; Large scale phonetics; Linguistique de terrain assisté par la machine; Machine-Assisted linguistic fieldwork; Reconnaissance automatique de la parole; Traitement automatique de la parole
	URL: https://tel.archives-ouvertes.fr/tel-01893309 https://tel.archives-ouvertes.fr/tel-01893309/file/GAUTHIER_2018_archivage.pdf https://tel.archives-ouvertes.fr/tel-01893309/document
	BASE
	Hide details

4	Challenges of building an authentic emotional speech corpus of spontaneous Japanese dialog
	Yoshiko Arimoto. - : Center for Corpus Development, National Institute for Japanese Language and Linguistics, 2018
	BASE
	Show details

5	LIG-AIKUMA: a Mobile App to Collect Parallel Speech for Under-Resourced Language Studies
	Gauthier, Elodie; Blachon, David; Besacier, Laurent...
	In: Interspeech 2016 proceedings ; Interspeech 2016 (short demo paper) ; https://hal.archives-ouvertes.fr/hal-01350062 ; Interspeech 2016 (short demo paper), Sep 2016, San-Francisco, France (2016)
	BASE
	Show details

6	Languages, Cultures, Media
	Osborne, John; Lewandowska-Tomaszczyk, Barbara; Kopytowska, Monika. - : HAL CCSD, 2016. : Université Savoie Mont-Blanc, 2016
	In: https://hal.archives-ouvertes.fr/hal-01412764 ; France. Langages (18), Université Savoie Mont-Blanc, pp.361, 2016, 978-2-919732-75-3 (2016)
	BASE
	Show details

7	Improving Data Selection for Low Resource STT and KWS
	Fraga-Silva, Thiago; Laurent,Antoine; Gauvain,Jean-Luc. - 2016
	BASE
	Show details

8	Machine Translation Based Data Augmentation for Cantonese Keyword Spotting (Author's Manuscript)
	Huang, Guangpu; Gorin,Arseniy; Gauvain,Jean-Luc. - 2016
	BASE
	Show details

9	Investigation of Back-off Based Interpolation Between Recurrent Neural Network and N-gram Language Models (Author's Manuscript)
	Chen, X; Liu,X; Gales,M J F. - 2016
	BASE
	Show details

10	Investigating Techniques for Low Resource Conversational Speech Recognition
	Laurent, Antoine; Fraga-Silva,Thiago; Lamel,Lori. - 2016
	BASE
	Show details

11	Code-switched English Pronunciation Modeling for Swahili Spoken Term Detection (Pub Version, Open Access)
	Kleynhans,Neil; Hartman,William; van Niekerk,Daniel. - 2016
	BASE
	Show details

12	A Multi-Sensor Helmet to Capture Rare Singing, an Intangible Cultural Heritage Study
	Al Kork, Samer K.; Jaumard-Hakoun, Aurore; Adda-Decker, Martine...
	In: 10th International Seminar on Speech Production (ISSP) ; https://hal.archives-ouvertes.fr/hal-01188296 ; 10th International Seminar on Speech Production (ISSP), May 2014, Cologne, Germany. pp.5-8 ; www.issp2014.uni-koeln.de (2014)
	BASE
	Show details

13	A Multi-Sensor Helmet To Capture Rare Singing, An Intangible Cultural Heritage Study ...
	Al Kork, Samer. - : Zenodo, 2014
	BASE
	Show details

14	Building a Database of Political Speech Does Culture Matter in Charisma Annotations?
	Hines, Andrew; Cullen, Ailbhe; Harte, Naomi
	In: Conference papers (2014)
	BASE
	Show details

15	Open-Source Multi-Language Audio Database for Spoken Language Processing Applications
	Zahorian, Stephen
	In: DTIC (2012)
	BASE
	Show details

16	Dummheit als Methode: eine dramatologische Textinterpretation
	Hitzler, Ronald
	In: Qualitativ-empirische Sozialforschung : Konzepte, Methoden, Analysen ; 295-318 (2012)
	BASE
	Show details

17	Identität in Erzählung und im Erzählen: Versuch einer Bestimmung der Besonderheit des narrativen Diskurses für die sprachliche Verfassung von ldentität
	Bamberg, Michael
	In: Journal für Psychologie ; 7 ; 1 ; 43-55 (2012)
	BASE
	Show details

18	The Duplicity of Testimonial Interviews—Unfolding and Utilising Multiple Temporalisation in Compound Procedures and Projects
	Scheffer, Thomas
	In: Forum Qualitative Sozialforschung / Forum: Qualitative Social Research ; 8 ; 1 ; 11 (2012)
	BASE
	Show details

19	Ein Kategoriensystem zur Wahrnehmung und Kodierung sprachlicher Diskriminierung
	Galliker, Mark; Wagner, Franc
	In: Journal für Psychologie ; 3 ; 33-43 (2012)
	BASE
	Show details

20	The impact of sound-field systems on learning and attention in elementary school classrooms.
	Dockrell, JE; Shield, B
	In: J Speech Lang Hear Res , 55 (4) pp. 1163-1176. (2012) (2012)
	BASE
	Show details

Page: 1 2

© 2013 - 2024 Lin|gu|is|tik | Imprint | Privacy Policy | Datenschutzeinstellungen ändern