1 |
FactDrill: A Data Repository of Fact-checked Social Media Content to Study Fake News Incidents in India ...
|
|
|
|
BASE
|
|
Show details
|
|
2 |
FactDrill: A Data Repository of Fact-checked Social Media Content to Study Fake News Incidents in India ...
|
|
|
|
BASE
|
|
Show details
|
|
3 |
Factorization of Fact-Checks for Low Resource Indian Languages ...
|
|
|
|
BASE
|
|
Show details
|
|
4 |
GupShup: An Annotated Corpus for Abstractive Summarization of Open-Domain Code-Switched Conversations ...
|
|
|
|
BASE
|
|
Show details
|
|
5 |
Automated Speech Scoring System Under The Lens: Evaluating and interpreting the linguistic cues for language proficiency ...
|
|
|
|
BASE
|
|
Show details
|
|
6 |
Perception Point: Identifying Critical Learning Periods in Speech for Bilingual Networks ...
|
|
|
|
BASE
|
|
Show details
|
|
7 |
Robust suicide risk assessment on social media via deep adversarial learning
|
|
|
|
In: J Am Med Inform Assoc (2021)
|
|
BASE
|
|
Show details
|
|
8 |
LIFI: Towards Linguistically Informed Frame Interpolation ...
|
|
|
|
BASE
|
|
Show details
|
|
10 |
#MeTooMA: Multi-Aspect Annotations of Tweets Related to the MeToo Movement
|
|
|
|
In: Proceedings of the International AAAI Conference on Web and Social Media; Vol. 14 (2020): Fourteenth International AAAI Conference on Web and Social Media; 209-216 ; 2334-0770 ; 2162-3449 (2020)
|
|
BASE
|
|
Show details
|
|
11 |
Multi-modal Automated Speech Scoring using Attention Fusion ...
|
|
|
|
BASE
|
|
Show details
|
|
12 |
Get It Scored Using AutoSAS -- An Automated System for Scoring Short Answers ...
|
|
|
|
BASE
|
|
Show details
|
|
13 |
"Notic My Speech" -- Blending Speech Patterns With Multimedia ...
|
|
|
|
BASE
|
|
Show details
|
|
14 |
#MeTooMA: Multi-Aspect Annotations of Tweets Related to the MeToo Movement ...
|
|
|
|
BASE
|
|
Show details
|
|
15 |
Mind Your Language: Abuse and Offense Detection for Code-Switched Languages ...
|
|
|
|
BASE
|
|
Show details
|
|
16 |
Harnessing AI for Speech Reconstruction using Multi-view Silent Video Feed ...
|
|
|
|
Abstract:
Speechreading or lipreading is the technique of understanding and getting phonetic features from a speaker's visual features such as movement of lips, face, teeth and tongue. It has a wide range of multimedia applications such as in surveillance, Internet telephony, and as an aid to a person with hearing impairments. However, most of the work in speechreading has been limited to text generation from silent videos. Recently, research has started venturing into generating (audio) speech from silent video sequences but there have been no developments thus far in dealing with divergent views and poses of a speaker. Thus although, we have multiple camera feeds for the speech of a user, but we have failed in using these multiple video feeds for dealing with the different poses. To this end, this paper presents the world's first ever multi-view speech reading and reconstruction system. This work encompasses the boundaries of multimedia research by putting forth a model which leverages silent video feeds from ... : 2018 ACM Multimedia Conference (MM '18), October 22--26, 2018, Seoul, Republic of Korea ...
|
|
Keyword:
Audio and Speech Processing eess.AS; FOS Computer and information sciences; FOS Electrical engineering, electronic engineering, information engineering; Sound cs.SD
|
|
URL: https://dx.doi.org/10.48550/arxiv.1807.00619 https://arxiv.org/abs/1807.00619
|
|
BASE
|
|
Hide details
|
|
|
|