Home Catalogue search

eng

Refine your search:

Search in the Catalogues and Directories






	Sort by
Simple Search

Hits 1 – 6 of 6

1	A semi-supervised approach for sentiment analysis of arab (ic+ izi) messages: Application to the algerian dialect
	Adeel, Ahsan; Guellil, Imane; Gogate, Mandar. - : Springer, 2021
	BASE
	Show details

2	A Semi-supervised Approach for Sentiment Analysis of Arab(ic+izi) Messages: Application to the Algerian Dialect
	Guellil, Imane; Adeel, Ahsan; Azouaou, Faical. - 2021
	BASE
	Show details

3	CochleaNet: A robust language-independent audio-visual model for real-time speech enhancement
	Gogate, Mandar; Dashtipour, Kia; Adeel, Ahsan. - : Elsevier, 2020
	BASE
	Show details

4	Offline Arabic Handwriting Recognition Using Deep Machine Learning: A Review of Recent Advances
	Ahmed, Rami; Dashtipour, Kia; Gogate, Mandar. - : Springer, 2020
	BASE
	Show details

5	Lip-reading driven deep learning approach for speech enhancement
	Adeel, Ahsan; Gogate, Mandar; Hussain, Amir; Whitmer, William M
	In: abs/1808.00046 ; 1 ; 10 (2019)
	Abstract: This paper proposes a novel lip-reading driven deep learning framework for speech enhancement. The proposed approach leverages the complementary strengths of both deep learning and analytical acoustic modelling (filtering based approach) as compared to recently published, comparatively simpler benchmark approaches that rely only on deep learning. The proposed audio-visual (AV) speech enhancement framework operates at two levels. In the first level, a novel deep learning-based lip-reading regression model is employed. In the second level, lip-reading approximated clean-audio features are exploited, using an enhanced, visually-derived Wiener filter (EVWF), for the clean audio power spectrum estimation. Specifically, a stacked long-short-term memory (LSTM) based lip-reading regression model is designed for clean audio features estimation using only temporal visual features considering different number of prior visual frames. For clean speech spectrum estimation, a new filterbank-domain EVWF is formulated, which exploits estimated speech features. The proposed EVWF is compared with conventional Spectral Subtraction and Log-Minimum Mean-Square Error methods using both ideal AV mapping and LSTM driven AV mapping. The potential of the proposed speech enhancement framework is evaluated under different dynamic real-world commercially-motivated scenarios (e.g. cafe, public transport, pedestrian area) at different SNR levels (ranging from low to high SNRs) using benchmark Grid and ChiME3 corpora. For objective testing, perceptual evaluation of speech quality is used to evaluate the quality of restored speech. For subjective testing, the standard mean-opinion-score method is used with inferential statistics. Comparative simulation results demonstrate significant lip-reading and speech enhancement improvement in terms of both speech quality and speech intelligibility. ; UK Engineering and Physical Sciences Research Council (EPSRC) Grant No. EP/M026981/1. ; Published version
	Keyword: audio-visual ChiME3 corpus; context-aware audio-visual speech enhancement; enhanced visually-derived Wiener filtering; lip reading; stacked long-short-term memory
	URL: https://doi.org/10.1109/tetci.2019.2917039 http://hdl.handle.net/2436/622874
	BASE
	Hide details

6	Persian Named Entity Recognition
	Dashtipour, Kia; Gogate, Mandar; Adeel, Ahsan. - : Institute of Electrical and Electronics Engineers Inc, 2017. : Piscataway, NJ, USA, 2017
	BASE
	Show details

© 2013 - 2024 Lin|gu|is|tik | Imprint | Privacy Policy | Datenschutzeinstellungen ändern