3 |
Whispered Speech Conversion Based on the Inversion of Mel Frequency Cepstral Coefficient Features
|
|
|
|
In: Algorithms; Volume 15; Issue 2; Pages: 68 (2022)
|
|
Abstract:
A conversion method based on the inversion of Mel frequency cepstral coefficient (MFCC) features was proposed to convert whispered speech into normal speech. First, the MFCC features of whispered speech and normal speech were extracted and a matching relation between the MFCC feature parameters of whispered speech and normal speech was developed through the Gaussian mixture model (GMM). Then, the MFCC feature parameters of normal speech corresponding to whispered speech were obtained based on the GMM and, finally, whispered speech was converted into normal speech through the inversion of MFCC features. The experimental results showed that the cepstral distortion (CD) of the normal speech converted by the proposed method was 21% less than that of the normal speech converted by the linear predictive coefficient (LPC) features, the mean opinion score (MOS) was 3.56, and a satisfactory outcome in both intelligibility and sound quality was achieved.
|
|
Keyword:
cepstral distortion; Gaussian mixture model; MFCC feature inversion; whispered speech conversion
|
|
URL: https://doi.org/10.3390/a15020068
|
|
BASE
|
|
Hide details
|
|
4 |
"El inglés me hizo sentirme orgulloso de mí mismo" : La evolución de las identidades imaginadas de los estudiantes de Grado de Educación Primaria en inglés
|
|
|
|
BASE
|
|
Show details
|
|
5 |
Some stylistic aspects of the discourse reported at Patrick Modiano ; Quelques aspects stylistiques du discours rapporté chez Patrick Modiano
|
|
|
|
In: ISSN: 2789-3588 ; NTELA, Revue du Centre Universitaire de Recherche sur l’Afrique (CURA) ; https://hal.inrae.fr/hal-03586579 ; NTELA, Revue du Centre Universitaire de Recherche sur l’Afrique (CURA), Revue du Centre Universitaire de Recherche sur l’Afrique (CURA, 2021, 2 (02) (2021)
|
|
BASE
|
|
Show details
|
|
7 |
Language effects in early development of number writing and reading ...
|
|
|
|
BASE
|
|
Show details
|
|
8 |
Supplementary materials [Review] to: Language effects in early development of number writing and reading ...
|
|
|
|
BASE
|
|
Show details
|
|
10 |
Temporal Convolution Network Based Joint Optimization of Acoustic-to-Articulatory Inversion
|
|
|
|
In: Applied Sciences ; Volume 11 ; Issue 19 (2021)
|
|
BASE
|
|
Show details
|
|
16 |
Gene Expression Imputation Across Multiple Tissue Types Provides Insight Into the Genetic Architecture of Frontotemporal Dementia and Its Clinical Subtypes
|
|
|
|
BASE
|
|
Show details
|
|
17 |
Subject-auxiliary inversion in academic prose
|
|
|
|
In: Ibérica, Iss 42, Pp 59-84 (2021) (2021)
|
|
BASE
|
|
Show details
|
|
19 |
A robust neural familiar face recognition response in a dynamic (periodic) stream of unfamiliar faces
|
|
|
|
In: ISSN: 0010-9452 ; Cortex ; https://hal.archives-ouvertes.fr/hal-03491617 ; Cortex, Elsevier, 2020, 132, pp.281-295. ⟨10.1016/j.cortex.2020.08.016⟩ (2020)
|
|
BASE
|
|
Show details
|
|
20 |
Subordinate Interrogatives and Subordination in Oral Speech: Syntax and Prosody ; La subordonnée interrogative et la subordination en anglais oral spontané: syntaxe et prosodie
|
|
|
|
In: ISSN: 2118-9692 ; EISSN: 0246-8743 ; Linx ; https://hal.archives-ouvertes.fr/hal-03460479 ; Linx, Presses Universitaires de Paris Nanterre, 2020, 80, ⟨10.4000/linx.6362⟩ (2020)
|
|
BASE
|
|
Show details
|
|
|
|