DE eng

Search in the Catalogues and Directories

Hits 1 – 8 of 8

1
Speaker Diarization With Lexical Information ...
Park, Tae Jin; Han, Kyu; Lane, Ian. - : arXiv, 2018
BASE
Show details
2
AudioPairBank: Towards A Large-Scale Tag-Pair-Based Audio Content Analysis ...
Abstract: Recently, sound recognition has been used to identify sounds, such as car and river. However, sounds have nuances that may be better described by adjective-noun pairs such as slow car, and verb-noun pairs such as flying insects, which are under explored. Therefore, in this work we investigate the relation between audio content and both adjective-noun pairs and verb-noun pairs. Due to the lack of datasets with these kinds of annotations, we collected and processed the AudioPairBank corpus consisting of a combined total of 1,123 pairs and over 33,000 audio files. One contribution is the previously unavailable documentation of the challenges and implications of collecting audio recordings with these type of labels. A second contribution is to show the degree of correlation between the audio content and the labels through sound recognition experiments, which yielded results of 70% accuracy, hence also providing a performance benchmark. The results and study in this paper encourage further exploration of the ... : This paper is a revised version of "AudioSentibank: Large-scale Semantic Ontology of Acoustic Concepts for Audio Content Analysis" ...
Keyword: Computation and Language cs.CL; FOS Computer and information sciences; Sound cs.SD
URL: https://arxiv.org/abs/1607.03766
https://dx.doi.org/10.48550/arxiv.1607.03766
BASE
Hide details
3
Recurrent Models for Auditory Attention in Multi-Microphone Distance Speech Recognition ...
Kim, Suyoun; Lane, Ian. - : arXiv, 2015
BASE
Show details
4
Bilingual-LSA Based LM Adaptation for Spoken Language Translation
BASE
Show details
5
Bilingual LSA-based adaptation for statistical machine translation
In: Machine translation. - Dordrecht [u.a.] : Springer Science + Business Media 21 (2007) 4, 187-207
BLLDB
Show details
6
Out-of-domain utterance detection using classification confidences of multiple topics
In: Institute of Electrical and Electronics Engineers. IEEE transactions on audio, speech and language processing. - New York, NY : Inst. 15 (2007) 1, 150-161
BLLDB
Show details
7
Bilingual-LSA Based LM Adaptation for Spoken Language Translation ...
Tam, Yik-Cheung; Lane, Ian; Schultz, Tanja. - : Karlsruhe, 2007
BASE
Show details
8
Bilingual LSA-based adaptation for statistical machine translation
In: Machine translation. - Dordrecht [u.a.] : Springer Science + Business Media 21 (2006) 4, 187-208
OLC Linguistik
Show details

Catalogues
0
0
1
0
0
0
0
Bibliographies
2
0
0
0
0
0
0
0
0
Linked Open Data catalogues
0
Online resources
0
0
0
0
Open access documents
5
0
0
0
0
© 2013 - 2024 Lin|gu|is|tik | Imprint | Privacy Policy | Datenschutzeinstellungen ändern