1 |
MAGIC DUST FOR CROSS-LINGUAL ADAPTATION OF MONOLINGUAL WAV2VEC-2.0
|
|
|
|
In: ICASSP 2022 ; https://hal.archives-ouvertes.fr/hal-03544515 ; ICASSP 2022, May 2022, Singapour, Singapore (2022)
|
|
BASE
|
|
Show details
|
|
2 |
End-to-end speaker segmentation for overlap-aware resegmentation
|
|
|
|
In: Interspeech 2021 ; https://hal-univ-lemans.archives-ouvertes.fr/hal-03257524 ; Interspeech 2021, Aug 2021, Brno, Czech Republic ; https://www.interspeech2021.org/ (2021)
|
|
BASE
|
|
Show details
|
|
3 |
Transdisciplinary Analysis of a Corpus of French Newsreels: The ANTRACT Project
|
|
|
|
In: ISSN: 1938-4122 ; Digital Humanities Quarterly ; https://hal.archives-ouvertes.fr/hal-03166755 ; Digital Humanities Quarterly, Alliance of Digital Humanities, 2021, Special Issue on AudioVisual Data in DH, 15 (1) ; http://digitalhumanities.org/dhq/ (2021)
|
|
BASE
|
|
Show details
|
|
4 |
Where are we in Named Entity Recognition from Speech?
|
|
|
|
In: 12th International Conference on Language Resources and Evaluation (LREC) ; https://hal.archives-ouvertes.fr/hal-02475026 ; 12th International Conference on Language Resources and Evaluation (LREC), May 2020, Marseille, France ; https://aclanthology.org/2020.lrec-1.556/ (2020)
|
|
BASE
|
|
Show details
|
|
5 |
A Convolutional Deep Markov Model for Unsupervised Speech Representation Learning
|
|
|
|
In: Interspeech 2020 ; https://hal.archives-ouvertes.fr/hal-02912029 ; Interspeech 2020, Oct 2020, Shanghai, China (2020)
|
|
BASE
|
|
Show details
|
|
6 |
Collective memory shapes the organization of individual memories in the medial prefrontal cortex
|
|
|
|
In: EISSN: 2397-3374 ; Nature Human Behaviour ; https://halshs.archives-ouvertes.fr/halshs-02416130 ; Nature Human Behaviour, Nature Research 2019, ⟨10.1038/s41562-019-0779-z⟩ (2019)
|
|
BASE
|
|
Show details
|
|
7 |
Effective keyword search for low-resourced conversational speech
|
|
|
|
In: icassp 2017 ; https://hal.archives-ouvertes.fr/hal-01744176 ; icassp 2017, IEEE, Mar 2017, La Nouvelle Orléans, United States (2017)
|
|
BASE
|
|
Show details
|
|
8 |
Language Recognition for Dialects and Closely Related Languages
|
|
|
|
In: Odyssey 2016 ; https://hal.archives-ouvertes.fr/hal-01744188 ; Odyssey 2016, Jun 2016, Bilbao, Spain (2016)
|
|
BASE
|
|
Show details
|
|
9 |
Boosting bonsai trees for efficient features combination : application to speaker role identification
|
|
|
|
In: Interspeech ; https://hal.inria.fr/hal-01025171 ; Interspeech, Sep 2014, Singapour, Singapore (2014)
|
|
BASE
|
|
Show details
|
|
10 |
Improving recognition of proper nouns (in ASR) through generation and filtering of phonetic transcriptions
|
|
|
|
In: ISSN: 0885-2308 ; EISSN: 1095-8363 ; Computer Speech and Language ; https://hal.archives-ouvertes.fr/hal-01433238 ; Computer Speech and Language, Elsevier, 2014, 28 (4), pp.979-996. ⟨10.1016/j.csl.2014.02.006⟩ (2014)
|
|
Abstract:
International audience ; Accurate phonetic transcription of proper nouns can be an important resource for commercial applications that embed speech technologies, such as audio indexing and vocal phone directory lookup. However, an accurate phonetic transcription is more difficult to obtain for proper nouns than for regular words. Indeed, phonetic transcription of a proper noun depends on both the origin of the speaker pronouncing it and the origin of the proper noun itself.This work proposes a method that allows the extraction of phonetic transcriptions of proper nouns using actual utterances of those proper nouns, thus yielding transcriptions based on practical use instead of mere pronunciation rules.The proposed method consists in a process that first extracts phonetic transcriptions, and then iteratively filters them. In order to initialize the process, an alignment dictionary is used to detect word boundaries. A rule-based grapheme-to-phoneme generator (LIA_PHON), a knowledge-based approach (JSM), and a Statistical Machine Translation based system were evaluated for this alignment. As a result, compared to our reference dictionary (BDLEX supplemented by LIA_PHON for missing words) on the ESTER 1 French broadcast news corpus, we were able to significantly decrease the Word Error Rate (WER) on segments of speech with proper nouns, without negatively affecting the WER on the rest of the corpus.
|
|
Keyword:
[INFO.INFO-CL]Computer Science [cs]/Computation and Language [cs.CL]; G2P; Moses; Phonetic transcription; Proper nouns; SMT; Speech recognition
|
|
URL: https://hal.archives-ouvertes.fr/hal-01433238/file/CSL_antoine.pdf https://hal.archives-ouvertes.fr/hal-01433238/document https://hal.archives-ouvertes.fr/hal-01433238 https://doi.org/10.1016/j.csl.2014.02.006
|
|
BASE
|
|
Hide details
|
|
11 |
Acoustics-Based Phonetic Transcription Method for Proper Nouns
|
|
|
|
In: International Conference on Spoken Language Processing (ISCA, Interspeech 2010) ; https://hal.archives-ouvertes.fr/hal-01433899 ; International Conference on Spoken Language Processing (ISCA, Interspeech 2010), 2010, Japon (Makuhari), Unknown Region (2010)
|
|
BASE
|
|
Show details
|
|
12 |
Iterative filtering of phonetic transcriptions of proper nouns
|
|
|
|
In: IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP 2009) ; https://hal.archives-ouvertes.fr/hal-01433945 ; IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP 2009), 2009, Taipei, Taiwan. pp.4265--4268 (2009)
|
|
BASE
|
|
Show details
|
|
13 |
Grapheme to phoneme conversion using an SMT system
|
|
|
|
In: INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION. ANNUAL CONFERENCE. 10TH 2009. (INTERSPEECH 2009) ; 10th Annual Conference of the International Speech Communication Association 2009 (INTERSPEECH 2009) ; https://hal.archives-ouvertes.fr/hal-01451534 ; 10th Annual Conference of the International Speech Communication Association 2009 (INTERSPEECH 2009) , Sep 2009, Brighton, United Kingdom. pp.716-719 (2009)
|
|
BASE
|
|
Show details
|
|
14 |
Combinaison de systèmes pour la phonétisation automatique de noms propres
|
|
|
|
In: XXVIIe Journées d'étude sur la parole (JEP 2008) ; https://hal.archives-ouvertes.fr/hal-01450912 ; XXVIIe Journées d'étude sur la parole (JEP 2008), Jun 2008, Avignon, France. pp.4 (2008)
|
|
BASE
|
|
Show details
|
|
15 |
Combined systems for automatic phonetic transcription of proper nouns
|
|
|
|
In: LREC 2008 Proceedings ; 6th Language Evaluation and Resources Conference (LREC 2008) ; https://hal.archives-ouvertes.fr/hal-01433960 ; 6th Language Evaluation and Resources Conference (LREC 2008), May 2008, Marrakech, Morocco. pp.1791-1795 ; http://www.lrec-conf.org/lrec2008/ (2008)
|
|
BASE
|
|
Show details
|
|
|
|