DE eng

Search in the Catalogues and Directories

Page: 1 2
Hits 1 – 20 of 23

1
Low-latency speaker spotting with online diarization and detection
In: The Speaker and Language Recognition Workshop ; https://hal.archives-ouvertes.fr/hal-01836490 ; The Speaker and Language Recognition Workshop, ISCA, Jun 2018, Les Sables d'Olonne, France (2018)
BASE
Show details
2
Combining Speaker Turn Embedding and Incremental Structure Prediction for Low-Latency Speaker Diarization
In: Interspeech 2017, 18th Annual Conference of the International Speech Communication Association ; https://hal.archives-ouvertes.fr/hal-01690162 ; Interspeech 2017, 18th Annual Conference of the International Speech Communication Association, Aug 2017, Stockholm, Sweden. ⟨10.21437/Interspeech.2017-1067⟩ (2017)
BASE
Show details
3
Multimodal Person Discovery in Broadcast TV: lessons learned from MediaEval 2015
In: ISSN: 1380-7501 ; EISSN: 1573-7721 ; Multimedia Tools and Applications ; https://hal.archives-ouvertes.fr/hal-01690581 ; Multimedia Tools and Applications, Springer Verlag, 2017, 76 (21), pp.22547 - 22567. ⟨10.1007/s11042-017-4730-x⟩ (2017)
BASE
Show details
4
Benchmarking Multimedia Technologies with the CAMOMILE Platform: the Case of Multimodal Person Discovery at MediaEval 2015
In: LREC 2016 ; https://hal.archives-ouvertes.fr/hal-01690277 ; LREC 2016, May 2016, Portorož, Slovenia (2016)
BASE
Show details
5
The CAMOMILE Collaborative Annotation Platform for Multi-modal, Multi-lingual and Multi-media Documents
In: Proceedings of LREC 2016 ; LREC 2016 Conference ; https://hal.archives-ouvertes.fr/hal-01350096 ; LREC 2016 Conference, May 2016, Portoroz, Slovenia (2016)
BASE
Show details
6
Lexical speaker identification in TV shows
In: ISSN: 1380-7501 ; EISSN: 1573-7721 ; Multimedia Tools and Applications ; https://hal.archives-ouvertes.fr/hal-01690342 ; Multimedia Tools and Applications, Springer Verlag, 2015, 74 (4), pp.1377 - 1396. ⟨10.1007/s11042-014-1940-3⟩ (2015)
BASE
Show details
7
Collaborative Annotation for Person Identification in TV Shows
In: Interspeech 2015 (short demo paper) ; https://hal.archives-ouvertes.fr/hal-01170513 ; Interspeech 2015 (short demo paper), Sep 2015, Dresden, Germany (2015)
BASE
Show details
8
TVD: a reproducible and multiply aligned TV series dataset
In: LREC 2014 ; https://hal.archives-ouvertes.fr/hal-01690279 ; LREC 2014, May 2014, Reykjavik, Iceland (2014)
BASE
Show details
9
Study of vowels and Voice Strength by Discriminant Analysis ; Etude des voyelles et de la force de voix par analyse discriminante
In: ISCA JEP2014 ; 30emes Journees d'Etude sur la Parole ; https://hal.archives-ouvertes.fr/hal-01885618 ; 30emes Journees d'Etude sur la Parole, ISCA AFCP, Jun 2014, Le Mans, France (2014)
Abstract: National audience ; Vocal Effort, represented here by an objective intensity measurement called VoiceStrength, is both a speech variability factor, and a physical quantity used by interlocutorsin order to exchange some types of information in a given situation. The present studydeals with the acoustical features coding this information in the vowel spectrum.Discriminant Analysis is used to identify the vowels despite the variability due to the voicestrength, and to estimate the voice strength despite the variability of the vowels. Each ofthose experiments is performed on two different databases. The results show that voicestrength may be estimated from the vowel spectra, and that knowing the voice strengthimproves the classification of the vowels. ; L'effort vocal, représenté ici par une mesure d'intensité objective appelée force de voix, est à la fois un facteur de variabilité de la parole et une grandeur acoustique utilisée par les interlocuteurs pour échanger diverses informations dans une situation donnée. La présente étude s'intéresse aux indices acoustiques codant ces informations dans le spectre des voyelles. L'Analyse Discriminante est mise en oeuvre pour caractériser les voyelles malgré la variabilité imputable à la force de voix, et pour estimer la force de voix en dépit de la variabilité imputable à la voyelle. Les résultats, établis sur deux bases de données différentes, montrent que la force de voix peut être estimée à partir du spectre des voyelles et que la connaissance préalable de la force de voix permet d'améliorer la classification des voyelles.
Keyword: [SHS.INFO]Humanities and Social Sciences/Library and information sciences; [SHS.LANGUE]Humanities and Social Sciences/Linguistics; acoustic analysis; analyse acoustique; analyse discriminante; communication orale; discriminant analysis; effort vocal; interactions voix-parole; oral communication; parole; speech; vocal effort; Voice; voice-speech interactions; Voix; voyelles
URL: https://hal.archives-ouvertes.fr/hal-01885618
https://hal.archives-ouvertes.fr/hal-01885618/file/JEP14%20LeMans.pdf
https://hal.archives-ouvertes.fr/hal-01885618/document
BASE
Hide details
10
Combination of Cepstral and Phonetically Discriminative Features for Speaker Verification
In: ISSN: 1070-9908 ; IEEE Signal Processing Letters ; https://hal.archives-ouvertes.fr/hal-01690336 ; IEEE Signal Processing Letters, Institute of Electrical and Electronics Engineers, 2014, 21 (9), pp.1040 - 1044. ⟨10.1109/LSP.2014.2323432⟩ (2014)
BASE
Show details
11
Impact of overlapping speech detection on speaker diarization for broadcast news and debates
In: IEEE International Conference on Acoustics, Speech, and Signal Processing ; https://hal.archives-ouvertes.fr/hal-01836475 ; IEEE International Conference on Acoustics, Speech, and Signal Processing, Jan 2013, Vancouver, Canada (2013)
BASE
Show details
12
Towards a better integration of written names for unsupervised speakers identification in videos
In: First Workshop on Speech, Language and Audio in Multimedia, SLAM ; https://hal.inria.fr/hal-00953089 ; First Workshop on Speech, Language and Audio in Multimedia, SLAM, 2013, Marseille, France (2013)
BASE
Show details
13
Une étude quantitative des marqueurs discursifs, disfluences et chevauchements de parole dans des interviews politiques
In: ISSN: 2118-870X ; EISSN: 2264-7082 ; Travaux Interdisciplinaires du Laboratoire Parole et Langage d'Aix-en-Provence (TIPA) ; https://hal.archives-ouvertes.fr/hal-01135042 ; Travaux Interdisciplinaires du Laboratoire Parole et Langage d'Aix-en-Provence (TIPA), Laboratoire Parole et Langage, 2013, pp.18. ⟨10.4000/tipa.830⟩ (2013)
BASE
Show details
14
Lattice MLLR based m-vector system for speaker verification
In: IEEE International Conference on Acoustics, Speech, and Signal Processing ; https://hal.archives-ouvertes.fr/hal-01836461 ; IEEE International Conference on Acoustics, Speech, and Signal Processing, Jan 2013, Vancouver, Canada (2013)
BASE
Show details
15
Unsupervised Speaker Identification using Overlaid Texts in TV Broadcast
In: Proceedings of the 13th Annual Conference of the International Speech Communication Association (Interspeech) ; Interspeech 2012 - Conference of the International Speech Communication Association ; https://hal.archives-ouvertes.fr/hal-00767427 ; Interspeech 2012 - Conference of the International Speech Communication Association, Sep 2012, Portland, OR, United States. 4p (2012)
BASE
Show details
16
Comparing Multi-Stage Approaches for Cross-Show Speaker Diarization
In: Interspeech 2011 ; https://hal.archives-ouvertes.fr/hal-01690265 ; Interspeech 2011, Aug 2011, Florence, Italy (2011)
BASE
Show details
17
Time structure and detection of the multivoiced segments in mixed speech
In: International Congress of Phonetic Sciences ; https://hal.archives-ouvertes.fr/hal-01836479 ; International Congress of Phonetic Sciences, Jan 2011, Hong Kong, China (2011)
BASE
Show details
18
Using sets of combs to control pitch estimation errors
In: Proceedings of Meetings on Acoustics ; Acoustics'08 ; https://hal.archives-ouvertes.fr/hal-01836484 ; Acoustics'08, SFA, Jan 2008, Paris, France. pp.060003, ⟨10.1121/1.2998757⟩ (2008)
BASE
Show details
19
Annotation and analysis of overlapping speech in political interviews
In: LREC 2008 ; https://hal.archives-ouvertes.fr/hal-01690328 ; LREC 2008, May 2008, Marrakech, Morocco (2008)
BASE
Show details
20
Comparing Prosodic Models for Speaker Recognition
In: Interspeech 2008 ; https://hal.archives-ouvertes.fr/hal-01690268 ; Interspeech 2008, Sep 2008, Brisbane, Australia (2008)
BASE
Show details

Page: 1 2

Catalogues
0
0
0
0
0
0
0
Bibliographies
0
0
0
0
0
0
0
0
0
Linked Open Data catalogues
0
Online resources
0
0
0
0
Open access documents
23
0
0
0
0
© 2013 - 2024 Lin|gu|is|tik | Imprint | Privacy Policy | Datenschutzeinstellungen ändern