DE eng

Search in the Catalogues and Directories

Page: 1 2 3 4 5...41
Hits 1 – 20 of 801

1
Integrating a Phrase Structure Corpus Grammar and a Lexical-Semantic Network: the HOLINET Knowledge Graph
In: Proceedings of LREC 2022 ; https://hal-amu.archives-ouvertes.fr/hal-03655636 ; Proceedings of LREC 2022, Jun 2022, Marseille, France (2022)
BASE
Show details
2
Multistream neural architectures for cued-speech recognition using a pre-trained visual feature extractor and constrained CTC decoding
In: ICASSP 2022 - IEEE International Conference on Acoustics, Speech and Signal Processing ; https://hal.archives-ouvertes.fr/hal-03578503 ; ICASSP 2022 - IEEE International Conference on Acoustics, Speech and Signal Processing, May 2022, Singapour, Singapore (2022)
BASE
Show details
3
What Do Cognitive Networks Do? Simulations of Spoken Word Recognition Using the Cognitive Network Science Approach
BASE
Show details
4
Sequential and network analyses to describe multiple signal use in captive mangabeys
In: ISSN: 0003-3472 ; EISSN: 1095-8282 ; Animal Behaviour ; https://hal.archives-ouvertes.fr/hal-03480471 ; Animal Behaviour, Elsevier Masson, 2021, 182, pp.203-226. ⟨10.1016/j.anbehav.2021.09.005⟩ (2021)
BASE
Show details
5
Linking an Abstract Corpus Grammar to a Lexical Semantic Network
In: https://hal.archives-ouvertes.fr/hal-03552630 ; [Research Report] Laboratoire Parole et Langage – Université d’Aix-Marseille. 2021 (2021)
BASE
Show details
6
Coupler syntaxe et sémantique dans une même base de connaissances linguistiques
In: https://hal.archives-ouvertes.fr/hal-03552622 ; [Rapport de recherche] Laboratoire Parole et Langage – Université d’Aix-Marseille. 2021 (2021)
BASE
Show details
7
Database of word-level statistics for Mandarin Chinese (DoWLS-MAN)
In: ISSN: 1554-351X ; EISSN: 1554-3528 ; Behavior Research Methods ; https://hal.archives-ouvertes.fr/hal-03328510 ; Behavior Research Methods, Psychonomic Society, Inc, In press, ⟨10.3758/s13428-021-01620-7⟩ (2021)
BASE
Show details
8
What does the Canary Say? Low-Dimensional GAN Applied to Birdsong
In: https://hal.inria.fr/hal-03244723 ; 2021 (2021)
BASE
Show details
9
Modeling the neural network responsible for song learning ; Modélisation du réseau neuronal responsable de l'apprentissage du chant chez l'oiseau chanteur
Pagliarini, Silvia. - : HAL CCSD, 2021
In: https://tel.archives-ouvertes.fr/tel-03217834 ; Modeling and Simulation. Université de Bordeaux, 2021. English. ⟨NNT : 2021BORD0107⟩ (2021)
BASE
Show details
10
Cortical basis of vocalization in behaving freely moving minipigs ; Bases corticales de la vocalisation chez le miniporc en comportement
Palma, Marie. - : HAL CCSD, 2021
In: https://tel.archives-ouvertes.fr/tel-03353386 ; Neuroscience. Université Grenoble Alpes [2020-.], 2021. English. ⟨NNT : 2021GRALS013⟩ (2021)
BASE
Show details
11
Automatic Classification of Phonation Types in Spontaneous Speech: Towards a New Workflow for the Characterization of Speakers’ Voice Quality
In: Interspeech 2021 ; https://hal.archives-ouvertes.fr/hal-03334492 ; Interspeech 2021, Aug 2021, Brno, Czech Republic. pp.1015-1018, ⟨10.21437/Interspeech.2021-1765⟩ (2021)
BASE
Show details
12
Resting functional connectivity in the semantic appraisal network predicts accuracy of emotion identification.
Yang, Winson FZ; Toller, Gianina; Shdo, Suzanne. - : eScholarship, University of California, 2021
BASE
Show details
13
Development of thalamus mediates paternal age effect on offspring reading: A preliminary investigation.
In: Human brain mapping, vol 42, iss 14 (2021)
BASE
Show details
14
Dataset of coronavirus content from Instagram with an exploratory analysis
In: ISSN: 2169-3536 ; EISSN: 2169-3536 ; IEEE Access ; https://hal.archives-ouvertes.fr/hal-03559489 ; IEEE Access, IEEE, 2021, 9, pp.157192-157202. ⟨10.1109/ACCESS.2021.3126552⟩ (2021)
BASE
Show details
15
Phoneme-to-Audio Alignment with Recurrent Neural Networks for Speaking and Singing Voice
In: Proceedings of Interspeech 2021 ; https://hal.archives-ouvertes.fr/hal-03552964 ; Proceedings of Interspeech 2021, International Speech Communication Association, Aug 2021, Brno, Czech Republic. pp.61-65, ⟨10.21437/interspeech.2021-1676⟩ ; https://www.interspeech2021.org/ (2021)
Abstract: International audience ; Phoneme-to-audio alignment is the task of synchronizing voice recordings and their related phonetic transcripts. In this work, we introduce a new system to forced phonetic alignment with Recurrent Neural Networks (RNN). With the Connectionist Temporal Classification (CTC) loss as training objective, and an additional reconstruction cost, we learn to infer relevant perframe phoneme probabilities from which alignment is derived. The core of the neural architecture is a context-aware attention mechanism between mel-spectrograms and side information. We investigate two contexts given by either phoneme sequences (model PHATT) or spectrograms themselves (model SPATT). Evaluations show that these models produce precise alignments for both speaking and singing voice. Best results are obtained with the model PHATT, which outperforms baseline reference with an average imprecision of 16.3ms and 29.8ms on speech and singing, respectively. The model SPATT also appears as an interesting alternative, capable of aligning longer audio files without requiring phoneme sequences on small audio segments.
Keyword: [INFO.INFO-CL]Computer Science [cs]/Computation and Language [cs.CL]; Connectionist Temporal Classification; phoneme-to-audio alignment; recurrent neural network; voice analysis
URL: https://hal.archives-ouvertes.fr/hal-03552964
https://hal.archives-ouvertes.fr/hal-03552964/document
https://doi.org/10.21437/interspeech.2021-1676
https://hal.archives-ouvertes.fr/hal-03552964/file/1676anav.pdf
BASE
Hide details
16
Improving Machine Translation of Arabic Dialects through Multi-Task Learning
In: 20th International Conference Italian Association for Artificial Intelligence:AIxIA 2021 ; https://hal.archives-ouvertes.fr/hal-03435996 ; 20th International Conference Italian Association for Artificial Intelligence:AIxIA 2021, Dec 2021, MILAN/Virtual, Italy (2021)
BASE
Show details
17
Document Domain Randomization for Deep Learning Document Layout Extraction
In: Proceedings of the 16th International Conference on Document Analysis and Recognition (ICDAR, September 5--10, Lausanne, Switzerland) ; https://hal.inria.fr/hal-03336444 ; Proceedings of the 16th International Conference on Document Analysis and Recognition (ICDAR, September 5--10, Lausanne, Switzerland), Sep 2021, Lausanne, Switzerland. pp.497-513, ⟨10.1007/978-3-030-86549-8_32⟩ (2021)
BASE
Show details
18
End-to-End Speech Emotion Recognition: Challenges of Real-Life Emergency Call Centers Data Recordings
In: ISBN: 978-1-6654-0019-0 ; 2021 9th International Conference on Affective Computing and Intelligent Interaction (ACII) ; https://hal.archives-ouvertes.fr/hal-03405970 ; 2021 9th International Conference on Affective Computing and Intelligent Interaction (ACII), Sep 2021, Nara, Japan ; https://www.acii-conf.net/2021/ (2021)
BASE
Show details
19
ISIDORE celebrates its 10th anniversary ...
BASE
Show details
20
ISIDORE celebrates its 10th anniversary ...
BASE
Show details

Page: 1 2 3 4 5...41

Catalogues
3
0
0
0
0
0
0
Bibliographies
4
0
0
0
0
0
0
0
0
Linked Open Data catalogues
0
Online resources
0
0
0
0
Open access documents
797
0
0
0
0
© 2013 - 2024 Lin|gu|is|tik | Imprint | Privacy Policy | Datenschutzeinstellungen ändern