21 |
A Method of Speech Coding for Speech Recognition Using a Convolutional Neural Network
|
|
|
|
In: Symmetry ; Volume 11 ; Issue 9 (2019)
|
|
Abstract:
This work presents a new approach to speech recognition, based on the specific coding of time and frequency characteristics of speech. The research proposed the use of convolutional neural networks because, as we know, they show high resistance to cross-spectral distortions and differences in the length of the vocal tract. Until now, two layers of time convolution and frequency convolution were used. A novel idea is to weave three separate convolution layers: traditional time convolution and the introduction of two different frequency convolutions (mel-frequency cepstral coefficients (MFCC) convolution and spectrum convolution). This application takes into account more details contained in the tested signal. Our idea assumes creating patterns for sounds in the form of RGB (Red, Green, Blue) images. The work carried out research for isolated words and continuous speech, for neural network structure. A method for dividing continuous speech into syllables has been proposed. This method can be used for symmetrical stereo sound.
|
|
Keyword:
convolutional neural network; deep learning; speech recognition
|
|
URL: https://doi.org/10.3390/sym11091185
|
|
BASE
|
|
Hide details
|
|
22 |
Acoustic event, spoken keyword and emotional outburst detection
|
|
|
|
BASE
|
|
Show details
|
|
23 |
Feature Selection for Sentiment Analysis of Swedish News Article Titles ; Val av datarepresentation för sentimentsanalys av svenska nyhetsrubriker
|
|
Dahl, Jonas. - : KTH, Skolan för elektroteknik och datavetenskap (EECS), 2018
|
|
BASE
|
|
Show details
|
|
24 |
Fault Diagnosis Based on an Approach Combining a Spectrogram and a Convolutional Neural Network with Application to a Wind Turbine System
|
|
|
|
In: Energies ; Volume 11 ; Issue 10 (2018)
|
|
BASE
|
|
Show details
|
|
26 |
Speech Emotion Recognition using Convolutional Neural Networks
|
|
|
|
In: Computer Science and Engineering: Theses, Dissertations, and Student Research (2018)
|
|
BASE
|
|
Show details
|
|
27 |
Text Recognition in Multimedia Documents: A Study of two Neural-based OCRs Using and Avoiding Character Segmentation
|
|
|
|
In: ISSN: 1433-2833 ; EISSN: 1433-2825 ; International Journal on Document Analysis and Recognition ; https://hal.archives-ouvertes.fr/hal-00867225 ; International Journal on Document Analysis and Recognition, Springer Verlag, 2014, 17 (1), pp.19-31. ⟨10.1007/s10032-013-0202-7⟩ (2014)
|
|
BASE
|
|
Show details
|
|
28 |
User-Level Psychological Stress Detection from Social Media Using Deep Neural Network
|
|
In: http://hcsi.cs.tsinghua.edu.cn/Paper/paper14/HuijieLin_MM2014.pdf
|
|
BASE
|
|
Show details
|
|
|
|