DE eng

Search in the Catalogues and Directories

Hits 1 – 3 of 3

1
Transcribing Mandarin broadcast speech using multi-layer perceptron acoustic features
In: Institute of Electrical and Electronics Engineers. IEEE transactions on audio, speech and language processing. - New York, NY : Inst. 19 (2011) 8, 2439-2450
BLLDB
OLC Linguistik
Show details
2
Stylization of Pitch with Syllable-Based Linear Segments
Abstract: Fundamental frequency contours for speech, as obtained by common pitch tracking algorithms, contain a great deal of fine detail that is unlikely to hold much perceptual significance for listeners. In our experiments, a radically reduced pitch contour consisting of a single linear segment for each syllable was found to judged as equally natural as the original pitch track by listeners, based on high-quality analysis-synthesis. We describe the algorithms both for segmenting speech into syllables based on fitting Gaussians to the energy envelope, and for approximating the pitch contour by independent linear segments for each syllable. We report our web-based test in which 40 listeners compared the stylized pitch contour resyntheses to equivalent resyntheses based on the original pitch track, and also to pitch tracks stylized by the existing Momel algorithm. Listeners preferred the original pitch contour to the linear approximation in only 60% of cases, where 50% would indicate random guessing. By contrast, the original was preferred over Momel in 74% of cases.
Keyword: Artificial intelligence; Electrical engineering
URL: https://doi.org/10.7916/D80Z7CPV
BASE
Hide details
3
Stylization of Pitch with Syllable-Based Linear Segments ...
Ravuri, Suman; Ellis, Daniel P. W.. - : Columbia University, 2008
BASE
Show details

Catalogues
0
0
1
0
0
0
0
Bibliographies
1
0
0
0
0
0
0
0
0
Linked Open Data catalogues
0
Online resources
0
0
0
0
Open access documents
2
0
0
0
0
© 2013 - 2024 Lin|gu|is|tik | Imprint | Privacy Policy | Datenschutzeinstellungen ändern