DE eng

Search in the Catalogues and Directories

Page: 1 2 3 4 5
Hits 1 – 20 of 88

1
Voice Restoration with Silent Speech Interfaces (ReSSInt)
BASE
Show details
2
Speech Synthesis from ECoG using Densely Connected 3D Convolutional Neural Networks
In: J Neural Eng (2019)
Abstract: OBJECTIVE: Direct synthesis of speech from neural signals could provide a fast and natural way of communication to people with neurological diseases. Invasively-measured brain activity (electrocorticography; ECoG) supplies the necessary temporal and spatial resolution to decode fast and complex processes such as speech production. A number of impressive advances in speech decoding using neural signals have been achieved in recent years, but the complex dynamics are still not fully understood. However, it is unlikely that simple linear models can capture the relation between neural activity and continuous spoken speech. APPROACH: Here we show that deep neural networks can be used to map ECoG from speech production areas onto an intermediate representation of speech (logMel spectrogram). The proposed method uses a densely connected convolutional neural network topology which is well-suited to work with the small amount of data available from each participant. MAIN RESULTS: In a study with six participants, we achieved correlations up to r = 0.69 between the reconstructed and original logMel spectrograms. We transfered our prediction back into an audible waveform by applying a Wavenet vocoder. The vocoder was conditioned on logMel features that harnessed a much larger, pre-existing data corpus to provide the most natural acoustic output. SIGNIFICANCE: To the best of our knowledge, this is the first time that high-quality speech has been reconstructed from neural recordings during speech production using deep neural networks.
Keyword: Article
URL: https://doi.org/10.1088/1741-2552/ab0c59
http://www.ncbi.nlm.nih.gov/pubmed/30831567
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC6822609/
BASE
Hide details
3
Generating Natural, Intelligible Speech From Brain Activity in Motor, Premotor, and Inferior Frontal Cortices
Herff, Christian; Diener, Lorenz; Angrick, Miguel. - : Frontiers Media S.A., 2019
BASE
Show details
4
Automatic Speech Recognition from Neural Signals: A Focused Review
Herff, Christian; Schultz, Tanja. - : Frontiers Media S.A., 2016
BASE
Show details
5
Brain-to-text: decoding spoken phrases from phone representations in the brain
Herff, Christian; Heger, Dominic; de Pesters, Adriana. - : Frontiers Media S.A., 2015
BASE
Show details
6
Automatic speech recognition for under-resourced languages: A survey
In: Speech communication. - Amsterdam [u.a.] : Elsevier 56 (2014), 85-100
OLC Linguistik
Show details
7
Web-based tools and methods for rapid pronunciation dictionary creation
In: Speech communication. - Amsterdam [u.a.] : Elsevier 56 (2014), 101-118
OLC Linguistik
Show details
8
Introduction to the special issue on processing under-resourced languages
In: Speech communication. - Amsterdam [u.a.] : Elsevier 56 (2014), 83-84
OLC Linguistik
Show details
9
Multilingual Deep Neural Network based Acoustic Modeling For Rapid Language Adaptation
In: http://infoscience.epfl.ch/record/198446 (2014)
BASE
Show details
10
Integration of Language Identification into a Recognition System for Spoken Conversations Containing Code-Switches ...
Weiner, Jochen; Vu, Ngoc Thang; Telaar, Dominic. - : Carnegie Mellon University, 2012
BASE
Show details
11
Integration of Language Identification into a Recognition System for Spoken Conversations Containing Code-Switches ...
Weiner, Jochen; Vu, Ngoc Thang; Telaar, Dominic. - : Carnegie Mellon University, 2012
BASE
Show details
12
An Investigation on Initialization Schemes for Multilayer Perceptron Training Using Multilingual Data and Their Effect on ASR Performance ...
Vu, Ngoc Thang; Wojtek Breiter; Metze, Florian. - : Carnegie Mellon University, 2012
BASE
Show details
13
An Investigation on Initialization Schemes for Multilayer Perceptron Training Using Multilingual Data and Their Effect on ASR Performance ...
Vu, Ngoc Thang; Wojtek Breiter; Metze, Florian. - : Carnegie Mellon University, 2012
BASE
Show details
14
Multilingual Bottle-Neck Features and its Application for Under-Resourced Languages ...
Vu, Ngoc Thang; Metze, Florian; Schultz, Tanja. - : Carnegie Mellon University, 2012
BASE
Show details
15
Multilingual Bottle-Neck Features and its Application for Under-Resourced Languages ...
Vu, Ngoc Thang; Metze, Florian; Schultz, Tanja. - : Carnegie Mellon University, 2012
BASE
Show details
16
Modeling Coarticulation in EMG-based Continuous Speech Recognition
In: Speech Communication, 52 (4), 341-353 ; ISSN: 0167-6393 (2012)
BASE
Show details
17
Analysis of Dialectal Influence in Pan-Arabic ASR ...
Udhyakumar Nallasamy; Garbus, Michael; Metze, Florian. - : Carnegie Mellon University, 2011
BASE
Show details
18
Analysis of Dialectal Influence in Pan-Arabic ASR ...
Udhyakumar Nallasamy; Garbus, Michael; Metze, Florian. - : Carnegie Mellon University, 2011
BASE
Show details
19
Modeling coarticulation in EMG-based continuous speech recognition
In: Speech communication. - Amsterdam [u.a.] : Elsevier 52 (2010) 4, 341-353
BLLDB
OLC Linguistik
Show details
20
Silent speech interfaces
In: Speech communication. - Amsterdam [u.a.] : Elsevier 52 (2010) 4, 270-287
BLLDB
OLC Linguistik
Show details

Page: 1 2 3 4 5

Catalogues
4
0
10
0
1
2
0
Bibliographies
9
0
0
0
0
0
0
0
1
Linked Open Data catalogues
0
Online resources
0
0
0
0
Open access documents
69
0
0
0
0
© 2013 - 2024 Lin|gu|is|tik | Imprint | Privacy Policy | Datenschutzeinstellungen ändern