Home
Catalogue search
Refine your search:
Keyword
Creator / Publisher:
Afouras, Triantafyllos (11)
Momeni, Liliane (11)
Zisserman, Andrew (11)
Albanie, Samuel (10)
Varol, Gül (10)
University of Oxford Oxford (5)
Visual Geometry Group (VGG) (5)
Bull, Hannah (4)
Fox, Neil (4)
Laboratoire d'Informatique Gaspard-Monge (LIGM) (3)
more
Year:
2022 (1)
2021 (6)
2020 (4)
Medium:
Online (11)
Type:
Miscellaneous (7)
Article (4)
BLLDB-Access
Search in the Catalogues and Directories
All fields
Title
Creator / Publisher
Keyword
Year
AND
OR
AND NOT
All fields
Title
Creator / Publisher
Keyword
Year
AND
OR
AND NOT
All fields
Title
Creator / Publisher
Keyword
Year
AND
OR
AND NOT
All fields
Title
Creator / Publisher
Keyword
Year
AND
OR
AND NOT
All fields
Title
Creator / Publisher
Keyword
Year
Sort by
creator [A → Z]
'
creator [Z → A]
'
publishing year ↑ (asc)
'
publishing year ↓ (desc)
'
title [A → Z]
'
title [Z → A]
'
Simple Search
Hits 1 – 11 of 11
1
BBC-Oxford British Sign Language Dataset
Albanie, Samuel
;
Varol, Gül
;
Momeni, Liliane
...
In: https://hal.archives-ouvertes.fr/hal-03516444 ; 2022 (2022)
BASE
Show details
2
Aligning Subtitles in Sign Language Videos
Bull, Hannah
;
Afouras, Triantafyllos
;
Varol, Gül
;
Albanie, Samuel
;
Momeni, Liliane
;
Zisserman, Andrew
In: International Conference on Computer Vision (ICCV) ; https://hal.archives-ouvertes.fr/hal-03515983 ; International Conference on Computer Vision (ICCV), Oct 2021, Montreal, Canada (2021)
Abstract:
International audience ; The goal of this work is to temporally align asynchronous subtitles in sign language videos. In particular, we focus on sign-language interpreted TV broadcast data comprising (i) a video of continuous signing, and (ii) subtitles corresponding to the audio content. Previous work exploiting such weakly-aligned data only considered finding keyword-sign correspondences, whereas we aim to localise a complete subtitle text in continuous signing. We propose a Transformer architecture tailored for this task, which we train on manually annotated alignments covering over 15K subtitles that span 17.7 hours of video. We use BERT subtitle embeddings and CNN video representations learned for sign recognition to encode the two signals, which interact through a series of attention layers. Our model outputs frame-level predictions, i.e., for each video frame, whether it belongs to the queried subtitle or not. Through extensive evaluations, we show substantial improvements over existing alignment baselines that do not make use of subtitle text embeddings for learning. Our automatic alignment model opens up possibilities for advancing machine translation of sign languages via providing continuously synchronized video-text data.
Keyword:
[INFO.INFO-CL]Computer Science [cs]/Computation and Language [cs.CL]
;
[INFO.INFO-CV]Computer Science [cs]/Computer Vision and Pattern Recognition [cs.CV]
;
[INFO]Computer Science [cs]
URL:
https://hal.archives-ouvertes.fr/hal-03515983
BASE
Hide details
3
Read and Attend: Temporal Localisation in Sign Language Videos
Varol, Gül
;
Momeni, Liliane
;
Albanie, Samuel
...
In: 2021 IEEE Conference on Computer Vision and Pattern Recognition (CVPR 2021) ; https://hal.archives-ouvertes.fr/hal-03513396 ; 2021 IEEE Conference on Computer Vision and Pattern Recognition (CVPR 2021), Jun 2021, Nashville, TN, United States. ⟨10.1109/CVPR46437.2021.01658⟩ (2021)
BASE
Show details
4
Read and Attend: Temporal Localisation in Sign Language Videos ...
Varol, Gül
;
Momeni, Liliane
;
Albanie, Samuel
. - : arXiv, 2021
BASE
Show details
5
Visual Keyword Spotting with Attention ...
Prajwal, K R
;
Momeni, Liliane
;
Afouras, Triantafyllos
. - : arXiv, 2021
BASE
Show details
6
Aligning Subtitles in Sign Language Videos ...
Bull, Hannah
;
Afouras, Triantafyllos
;
Varol, Gül
. - : arXiv, 2021
BASE
Show details
7
BBC-Oxford British Sign Language Dataset ...
Albanie, Samuel
;
Varol, Gül
;
Momeni, Liliane
. - : arXiv, 2021
BASE
Show details
8
BSL-1K: Scaling up co-articulated sign language recognition using mouthing cues
Varol, Gül
;
Albanie, Samuel
;
Momeni, Liliane
...
In: European Conference on Computer Vision (ECCV) 2020 ; https://hal.archives-ouvertes.fr/hal-03516489 ; European Conference on Computer Vision (ECCV) 2020, Aug 2020, Glasgow, United Kingdom. ⟨10.1007/978-3-030-58621-8_3⟩ (2020)
BASE
Show details
9
Watch, read and lookup: learning to spot signs from multiple supervisors
Varol, Gül
;
Momeni, Liliane
;
Albanie, Samuel
...
In: Asian Conference on Computer Vision (ACCV) 2020 ; https://hal.archives-ouvertes.fr/hal-03516457 ; Asian Conference on Computer Vision (ACCV) 2020, Nov 2020, Kyoto, Japan. ⟨10.1007/978-3-030-69544-6_18⟩ (2020)
BASE
Show details
10
Watch, read and lookup: learning to spot signs from multiple supervisors ...
Momeni, Liliane
;
Varol, Gül
;
Albanie, Samuel
. - : arXiv, 2020
BASE
Show details
11
BSL-1K: Scaling up co-articulated sign language recognition using mouthing cues ...
Albanie, Samuel
;
Varol, Gül
;
Momeni, Liliane
. - : arXiv, 2020
BASE
Show details
Mobile view
All
Catalogues
UB Frankfurt Linguistik
0
IDS Mannheim
0
OLC Linguistik
0
UB Frankfurt Retrokatalog
0
DNB Subject Category Language
0
Institut für Empirische Sprachwissenschaft
0
Leibniz-Centre General Linguistics (ZAS)
0
Bibliographies
BLLDB
0
BDSL
0
IDS Bibliografie zur deutschen Grammatik
0
IDS Bibliografie zur Gesprächsforschung
0
IDS Konnektoren im Deutschen
0
IDS Präpositionen im Deutschen
0
IDS OBELEX meta
0
MPI-SHH Linguistics Collection
0
MPI for Psycholinguistics
0
Linked Open Data catalogues
Annohub
0
Online resources
Link directory
0
Journal directory
0
Database directory
0
Dictionary directory
0
Open access documents
BASE
11
Linguistik-Repository
0
IDS Publikationsserver
0
Online dissertations
0
Language Description Heritage
0
© 2013 - 2024 Lin|gu|is|tik
|
Imprint
|
Privacy Policy
|
Datenschutzeinstellungen ändern