DE eng

Search in the Catalogues and Directories

Hits 1 – 9 of 9

1
Self-Supervised Representation Learning for Speech Using Visual Grounding and Masked Language Modeling ...
Peng, Puyuan; Harwath, David. - : arXiv, 2022
BASE
Show details
2
Cascaded Multilingual Audio-Visual Learning from Videos ...
BASE
Show details
3
Can phones, syllables, and words emerge as side-products of cross-situational audiovisual learning? -- A computational investigation ...
Khorrami, Khazar; Räsänen, Okko. - : arXiv, 2021
BASE
Show details
4
Cross-modal Speaker Verification and Recognition: A Multilingual Perspective ...
BASE
Show details
5
UniVL: A Unified Video and Language Pre-Training Model for Multimodal Understanding and Generation ...
Luo, Huaishao; Ji, Lei; Shi, Botian. - : arXiv, 2020
BASE
Show details
6
UltraSuite: A Repository of Ultrasound and Acoustic Data from Child Speech Therapy Sessions ...
BASE
Show details
7
Speaker-independent classification of phonetic segments from raw ultrasound in child speech ...
BASE
Show details
8
Towards Automatic Speech Identification from Vocal Tract Shape Dynamics in Real-time MRI ...
BASE
Show details
9
Semantic speech retrieval with a visually grounded model of untranscribed speech ...
BASE
Show details

Catalogues
0
0
0
0
0
0
0
Bibliographies
0
0
0
0
0
0
0
0
0
Linked Open Data catalogues
0
Online resources
0
0
0
0
Open access documents
9
0
0
0
0
© 2013 - 2024 Lin|gu|is|tik | Imprint | Privacy Policy | Datenschutzeinstellungen ändern