1 |
A Swiss German Dictionary: Variation in Speech and Writing ...
|
|
|
|
BASE
|
|
Show details
|
|
4 |
The SUMMA Platform Prototype
|
|
|
|
In: http://infoscience.epfl.ch/record/233575 (2017)
|
|
BASE
|
|
Show details
|
|
5 |
Comparative Study on Sentence Boundary Prediction for German and English Broadcast News
|
|
|
|
In: http://infoscience.epfl.ch/record/229982 (2017)
|
|
BASE
|
|
Show details
|
|
7 |
Composition of Deep and Spiking Neural Networks for Very Low Bit Rate Speech Coding ...
|
|
|
|
BASE
|
|
Show details
|
|
8 |
Investigating Cross-lingual Multi-level Adaptive Networks: The Importance of the Correlation of Source and Target Languages
|
|
|
|
In: http://infoscience.epfl.ch/record/223756 (2016)
|
|
BASE
|
|
Show details
|
|
9 |
Speech vocoding for laboratory phonology
|
|
|
|
In: http://infoscience.epfl.ch/record/222447 (2016)
|
|
BASE
|
|
Show details
|
|
10 |
The SIWIS database: a multilingual speech database with acted emphasis
|
|
|
|
In: http://infoscience.epfl.ch/record/218855 (2016)
|
|
BASE
|
|
Show details
|
|
11 |
Incremental Syllable-Context Phonetic Vocoding
|
|
|
|
In: http://infoscience.epfl.ch/record/206809 (2015)
|
|
BASE
|
|
Show details
|
|
12 |
Incremental Syllable-Context Phonetic Vocoding
|
|
|
|
In: http://infoscience.epfl.ch/record/207487 (2015)
|
|
Abstract:
Current very low bit rate speech coders are, due to complexity limitations, designed to work off-line. This paper investigates incremental speech coding that operates real-time and incrementally (i.e., encoded speech depends only on already-uttered speech without the need of future speech information). Since human speech communication is asynchronous (i.e., different information flows being simultaneously processed), we hypothesized that such an incremental speech coder should also operate asynchronously. To accomplish this task, we describe speech coding that reflects the human cortical temporal sampling that packages information into units of different temporal granularity, such as phonemes and syllables, in parallel. More specifically, a phonetic vocoder-cascaded speech recognition and synthesis systems-extended with syllable-based information transmission mechanisms is investigated. There are two main aspects evaluated in this work, the synchronous and asynchronous coding. Synchronous coding refers to the case when the phonetic vocoder and speech generation process depend on the syllable boundaries during encoding and decoding respectively. On the other hand, asynchronous coding refers to the case when the phonetic encoding and speech generation processes are done independently of the syllable boundaries. Our experiments confirmed that the asynchronous incremental speech coding performs better, in terms of intelligibility and overall speech quality, mainly due to better alignment of the segmental and prosodic information. The proposed vocoding operates at an uncompressed bit rate of 213 bits/sec and achieves an average communication delay of 243 ms.
|
|
URL: http://www.idiap.ch/paper/3107 http://infoscience.epfl.ch/record/207487 http://infoscience.epfl.ch/record/207487/files/Cernak_TASLP_2015.pdf https://doi.org/10.1109/TASLP.2015.2418577 http://publications.idiap.ch/index.php/publications/showcite/Cernak_Idiap-RR-05-2015
|
|
BASE
|
|
Hide details
|
|
13 |
Speech vocoding for laboratory phonology
|
|
|
|
In: http://infoscience.epfl.ch/record/207945 (2015)
|
|
BASE
|
|
Show details
|
|
14 |
Stress and Accent Transmission In HMM-Based Syllable-Context Very Low Bit Rate Speech Coding
|
|
|
|
In: http://infoscience.epfl.ch/record/200300 (2014)
|
|
BASE
|
|
Show details
|
|
15 |
Syllable-based Regional Swiss French Accent Identification using Prosodic Features
|
|
|
|
In: http://infoscience.epfl.ch/record/199821 (2014)
|
|
BASE
|
|
Show details
|
|
16 |
Stress and Accent Transmission In HMM-Based Syllable-Context Very Low Bit Rate Speech Coding
|
|
|
|
In: http://infoscience.epfl.ch/record/199810 (2014)
|
|
BASE
|
|
Show details
|
|
17 |
Prosody in Swiss French Accents: Investigation using Analysis by Synthesis
|
|
|
|
In: http://infoscience.epfl.ch/record/198853 (2014)
|
|
BASE
|
|
Show details
|
|
|
|