DE eng

Search in the Catalogues and Directories

Page: 1 2 3 4 5
Hits 1 – 20 of 83

1
Big Data analytics to assess personality based on voice analysis
BASE
Show details
2
Refugees Welcome? Online Hate Speech and Sentiments in Twitter in Spain during the Reception of the Boat Aquarius
BASE
Show details
3
Conversational agent for supporting learners on a MOOC on programming with Java
BASE
Show details
4
An auditory saliency pooling-based LSTM model for speech intelligibility classification
Abstract: This article belongs to the Section Computer and Engineering Science and Symmetry/Asymmetry. ; Speech intelligibility is a crucial element in oral communication that can be influenced by multiple elements, such as noise, channel characteristics, or speech disorders. In this paper, we address the task of speech intelligibility classification (SIC) in this last circumstance. Taking our previous works, a SIC system based on an attentional long short-term memory (LSTM) network, as a starting point, we deal with the problem of the inadequate learning of the attention weights due to training data scarcity. For overcoming this issue, the main contribution of this paper is a novel type of weighted pooling (WP) mechanism, called saliency pooling where the WP weights are not automatically learned during the training process of the network, but are obtained from an external source of information, the Kalinli’s auditory saliency model. In this way, it is intended to take advantage of the apparent symmetry between the human auditory attention mechanism and the attentional models integrated into deep learning networks. The developed systems are assessed on the UA-speech dataset that comprises speech uttered by subjects with several dysarthria levels. Results show that all the systems with saliency pooling significantly outperform a reference support vector machine (SVM)-based system and LSTM-based systems with mean pooling and attention pooling, suggesting that Kalinli’s saliency can be successfully incorporated into the LSTM architecture as an external cue for the estimation of the speech intelligibility level. ; The work leading to these results has been supported by the Spanish Ministry of Economy, Industry and Competitiveness through TEC2017-84395-P (MINECO) and TEC2017-84593-C2-1-R (MINECO) projects (AEI/FEDER, UE), and the Universidad Carlos III de Madrid under Strategic Action 2018/00071/001.
Keyword: Attention; Auditory saliency model; LSTM; Saliency; Speech intelligibility; Telecomunicaciones; Weighted pooling
URL: https://doi.org/10.3390/sym13091728
http://hdl.handle.net/10016/33706
BASE
Hide details
5
Desarrollo de una plataforma para reconocimiento de gestos basada en Tensor Flow Lite sobre el dispositivo IoT Argon de Particle
BASE
Show details
6
Analyzing learners' engagement and behavior in MOOCs on programming with the Codeboard IDE
BASE
Show details
7
Reconocimiento de voz basado en características DNN Bottleneck
BASE
Show details
8
Redesigning a Freshman Engineering Course to Promote Active Learning by Flipping the Classroom through the Reuse of MOOCs
BASE
Show details
9
Análisis y aplicación de técnicas de aprendizaje automático para clasificación de reseñas en redes sociales
BASE
Show details
10
Implementación y evaluación de un sistema QbE-STD (Query-by-Example Spoken Term Detection)
BASE
Show details
11
Assessing EPAP lexical features: A corpus-based study ; Análisis de los rasgos léxicos de IFE: Un estudio de corpus ; Una análisi dels trets lèxics d'AFE: Un estudi de corpus
In: Quaderns de Filologia - Estudis Lingüístics; Vol. 22 (2017): Words, Corpus and Back to Words; 165-186 ; 2444-1449 ; 1135-416X (2018)
BASE
Show details
12
Detección automática de paráfrasis sobre un corpus de preguntas en inglés
BASE
Show details
13
Uncovering flipped-classroom problems at an engineering course on Systems Architecture through data-driven learning design
BASE
Show details
14
Sentiment analysis in MOOCs: a case study
BASE
Show details
15
Deep Neural Network Architectures for Large-scale, Robust and Small-Footprint Speaker and Language Recognition
BASE
Show details
16
Estimación de la normalidad de los lenguados atendiendo a su morfología
BASE
Show details
17
Memoria 2013 - Biblioteca Politécnica
BASE
Show details
18
An end-to-end approach to language identification in short utterances using convolutional neural networks
Lozano-Díez, Alicia; Zazo Candil, Rubén; González Domínguez, Javier. - : International Speech and Communication Association, 2016
BASE
Show details
19
Extrapolación de imágenes mediante procesos glaussianos
BASE
Show details
20
Linguistically-constrained formant-based i-vectors for automatic speaker recognition
BASE
Show details

Page: 1 2 3 4 5

Catalogues
0
0
0
0
0
0
0
Bibliographies
0
0
0
0
0
0
0
0
0
Linked Open Data catalogues
0
Online resources
0
0
0
0
Open access documents
83
0
0
0
0
© 2013 - 2024 Lin|gu|is|tik | Imprint | Privacy Policy | Datenschutzeinstellungen ändern