1 |
Data-driven voice source waveform analysis and synthesis
|
|
|
|
In: http://www.commsp.ee.ic.ac.uk/~mrt102/publications/Gudnason2009b.pdf (2012)
|
|
BASE
|
|
Show details
|
|
2 |
Data-driven voice source waveform modelling
|
|
|
|
In: http://www.commsp.ee.ic.ac.uk/~jg/papers/Thomas2009a.pdf (2009)
|
|
BASE
|
|
Show details
|
|
3 |
Data-driven voice source waveform modelling
|
|
|
|
In: http://www.commsp.ee.ic.ac.uk/~mrt102/publications/Thomas2009.pdf (2009)
|
|
BASE
|
|
Show details
|
|
4 |
Application of the DYPSA Algorithm to Segmented Time-Scale Modification of Speech
|
|
|
|
In: http://www.eurasip.org/Proceedings/Eusipco/Eusipco2008/papers/1569101951.pdf (2008)
|
|
BASE
|
|
Show details
|
|
5 |
Application of the DYPSA Algorithm to Segmented Time-Scale Modification of Speech
|
|
|
|
In: http://www.commsp.ee.ic.ac.uk/~mrt102/publications/Thomas2008b.pdf (2008)
|
|
Abstract:
This paper presents a method for speech time scale modification. Voiced speech is pseudo-periodic, allowing time scale modification by the repetition or removal of cycles as necessary. However, in the case of unvoiced speech and at the boundaries of voiced speech, no such periodicity exists so the speech should not be modified. To address this issue, the proposed approach is novel in its use of the DYPSA algorithm to derive speech periodicity from glottal closure instants (GCIs), followed by a Gaussian Mixture model-based voiced/unvoiced/silence (VUS) classifier. A listening test based on ITU-T P800 has been conducted and has shown that, by employing VUS detection, the average mean opinion score of the perceptual quality of processed speech exceeds that of a method without VUS detection by 0.61 over a range of modification factors. Results are presented as a function of modification factor for normal and fast original talking rate. Reliable time scale modification of high audio quality enables many applications, such as time scale compression for fast scanning of recorded voicemail messages, slowing talking rate for improved intelligibility in forensics and lip synchronization in motion video. 1.
|
|
URL: http://www.commsp.ee.ic.ac.uk/~mrt102/publications/Thomas2008b.pdf http://citeseerx.ist.psu.edu/viewdoc/summary?doi=10.1.1.413.387
|
|
BASE
|
|
Hide details
|
|
6 |
Application of the DYPSA algorithm to segmented time-scale modification of speech
|
|
|
|
In: http://www.ee.ic.ac.uk/hp/staff/pnaylor/PDFs/Thomas2008.pdf (2008)
|
|
BASE
|
|
Show details
|
|
7 |
Estimation of Glottal Closure Instants in Voiced Speech using the DYPSA Algorithm
|
|
|
|
In: https://spiral.imperial.ac.uk:8443/bitstream/10044/1/678/1/Estimation of glottal closure.pdf
|
|
BASE
|
|
Show details
|
|
8 |
VOICE SOURCE ESTIMATION FOR ARTIFICIAL BANDWIDTH EXTENSION OF TELEPHONE SPEECH
|
|
|
|
In: http://www.commsp.ee.ic.ac.uk/~mrt102/publications/Thomas2010a.pdf
|
|
BASE
|
|
Show details
|
|
9 |
Voice Source Waveform Analysis and Synthesis using Principal Component Analysis and Gaussian Mixture Modelling
|
|
|
|
In: http://labrosa.ee.columbia.edu/~dpwe/pubs/GudTNE09-voicesource.pdf
|
|
BASE
|
|
Show details
|
|
10 |
DATA-DRIVEN VOICE SOURCE WAVEFORM MODELLING
|
|
|
|
In: http://www.ee.ic.ac.uk/hp/staff/pnaylor/PDFs/Thomas2009a.pdf
|
|
BASE
|
|
Show details
|
|
|
|