Home Catalogue search

eng

Refine your search:
- Keyword
- Creator / Publisher
- Year
- Medium
- Type
- BLLDB-Access:
  - free (13)
  - subject to license (0)

Search in the Catalogues and Directories






	Sort by
Simple Search

Hits 1 – 13 of 13

1	Using heterogeneity in semi-supervised transcription hypotheses to improve code-switched speech recognition ...
	Slottje, Andrew; Wotherspoon, Shannon; Hartmann, William. - : arXiv, 2021
	BASE
	Show details

2	Cross-lingual Information Retrieval with BERT ...
	Jiang, Zhuolin; El-Jaroudi, Amro; Hartmann, William. - : arXiv, 2020
	BASE
	Show details

3	Lexical speaker identification in TV shows
	Roy, Anindya; Bredin, Hervé; Hartmann, William; Le, Viet Bac; Barras, Claude; Gauvain, Jean-Luc
	In: ISSN: 1380-7501 ; EISSN: 1573-7721 ; Multimedia Tools and Applications ; https://hal.archives-ouvertes.fr/hal-01690342 ; Multimedia Tools and Applications, Springer Verlag, 2015, 74 (4), pp.1377 - 1396. ⟨10.1007/s11042-014-1940-3⟩ (2015)
	Abstract: The final publication is available at https://link.springer.com/article/10.1007/s11042-014-1940-3 ; International audience ; It is possible to use lexical information extracted from speech transcripts for speaker identification (SID), either on its own or to improve the performance of standard cepstral-based SID systems upon fusion. This was established before typically using isolated speech from single speakers (NIST SRE corpora, parliamentary speeches). On the contrary, this work applies lexical approaches for SID on a different type of data. It uses the REPERE corpus consisting of unsegmented multiparty conversations, mostly debates, discussions and Q&A sessions from TV shows. It is hypothesized that people give out clues to their identity when speaking in such settings which this work aims to exploit. The impact on SID performance of the diarization front-end required to pre-process the unsegmented data is also measured. Four lexical SID approaches are studied in this work, including TFIDF, BM25 and LDA-based topic modeling. Results are analysed in terms of TV shows and speaker roles. Lexical approaches achieve low error rates for certain speaker roles such as anchors and journalists, sometimes lower than a standard cepstral-based Gaussian Supervector-Support Vector Machine (GSV-SVM) system. Also, in certain cases, the lexical system shows modest improvement over the cepstral-based system performance using score-level sum fusion. To highlight the potential of using lexical information not just to improve upon cepstral-based SID systems but as an independent approach in its own right, initial studies on crossmedia SID is briefly reported. Instead of using 2 Anindya Roy et al. speech data as all cepstral systems require, this approach uses Wikipedia texts to train lexical speaker models which are then tested on speech transcripts to identify speakers.
	Keyword: [INFO.INFO-MM]Computer Science [cs]/Multimedia [cs.MM]; [INFO]Computer Science [cs]
	URL: https://hal.archives-ouvertes.fr/hal-01690342/file/paper_v0.pdf https://hal.archives-ouvertes.fr/hal-01690342/document https://doi.org/10.1007/s11042-014-1940-3 https://hal.archives-ouvertes.fr/hal-01690342
	BASE
	Hide details

4	Comparing decoding strategies for subword-based keyword spotting in low-resourced languages
	Hartmann, William; Le, Viet Bac; Messaoudi, Abdelkhalek...
	In: Annual Conference of the International Speech Communication Association ; https://hal.archives-ouvertes.fr/hal-01843408 ; Annual Conference of the International Speech Communication Association , ISCA, Sep 2014, Singapore, Singapore (2014)
	BASE
	Show details

5	Efficient Rule Scoring for Improved Grapheme-Based Lexicons
	Hartmann, William; Lamel, Lori; Gauvain, Jean-Luc
	In: European Signal Processing Conference ; https://hal.archives-ouvertes.fr/hal-01843411 ; European Signal Processing Conference, Jan 2014, Lisbon, Portugal (2014)
	BASE
	Show details

6	Cross-Word Sub-Word Units for Low-Resource Keyword Spotting
	Hartmann, William; Lamel, Lori; Gauvain, Jean-Luc
	In: International Workshop on Spoken Languages Technologies for Under-resourced languages ; https://hal.archives-ouvertes.fr/hal-01843415 ; International Workshop on Spoken Languages Technologies for Under-resourced languages, May 2014, St. Petersburg, Russia (2014)
	BASE
	Show details

7	Efficient Rule Scoring For Improved Grapheme-Based Lexicons ...
	Gauvain, Jean-Luc; Hartmann, William; Lamel, Lori. - : Zenodo, 2014
	BASE
	Show details

8	Acoustic unit discovery and pronunciation generation from a grapheme-based lexicon
	Hartmann, William; Roy, Anindya; Lamel, Lori...
	In: IEEE Automatic Speech Recognition and Understanding Workshop ; https://hal.archives-ouvertes.fr/hal-01843433 ; IEEE Automatic Speech Recognition and Understanding Workshop, Dec 2013, Olomouc, Czech Republic (2013)
	BASE
	Show details

9	Acoustic signal processing
	Hartmann, William M.
	In: Springer handbook of acoustics (New York [etc.], 2007), p. 503-532
	MPI für Psycholinguistik
	Show details

10	On the Duifhuis pitch effect
	Lin, Jian-Yu; Hartmann, William Morris
	In: Acoustical Society of America. The journal of the Acoustical Society of America. - Melville, NY : AIP 101 (1997) 2, 1034-1043
	BLLDB
	Show details

11	On the externalization of sound images
	Hartmann, William Morris; Wittenberg, Andrew
	In: Acoustical Society of America. The journal of the Acoustical Society of America. - Melville, NY : AIP 99 (1996) 6, 3678-3688
	BLLDB
	Show details

12	The physical description of signals
	Hartmann, William Morris
	In: Hearing. - San Diego [u.a.] : Acad. Press (1995), 1-40
	BLLDB
	Show details

13	Auditory spectral discrimination and the localization of clicks in the sagittal plane
	Hartmann, William Morris; Rakerd, Brad
	In: Acoustical Society of America. The journal of the Acoustical Society of America. - Melville, NY : AIP 94 (1993) 4, 2083-2092
	BLLDB
	Show details

© 2013 - 2024 Lin|gu|is|tik | Imprint | Privacy Policy | Datenschutzeinstellungen ändern