Home
Catalogue search
Refine your search:
Keyword:
68T07, 68T45, 68T50 (1)
Computation and Language cs.CL (1)
Computer Vision and Pattern Recognition cs.CV (1)
FOS Computer and information sciences (1)
I.2.7; I.2.10; I.5.1 (1)
Machine Learning cs.LG (1)
Neural and Evolutionary Computing cs.NE (1)
Creator / Publisher:
Kelleher, John D. (1)
Lindh, Annika (1)
Ross, Robert J. (1)
Year
Medium
Type
BLLDB-Access
Search in the Catalogues and Directories
All fields
Title
Creator / Publisher
Keyword
Year
AND
OR
AND NOT
All fields
Title
Creator / Publisher
Keyword
Year
AND
OR
AND NOT
All fields
Title
Creator / Publisher
Keyword
Year
AND
OR
AND NOT
All fields
Title
Creator / Publisher
Keyword
Year
AND
OR
AND NOT
All fields
Title
Creator / Publisher
Keyword
Year
Sort by
creator [A → Z]
'
creator [Z → A]
'
publishing year ↑ (asc)
'
publishing year ↓ (desc)
'
title [A → Z]
'
title [Z → A]
'
Simple Search
Hits 1 – 1 of 1
1
Language-Driven Region Pointer Advancement for Controllable Image Captioning ...
Lindh, Annika
;
Ross, Robert J.
;
Kelleher, John D.
. - : arXiv, 2020
Abstract:
Controllable Image Captioning is a recent sub-field in the multi-modal task of Image Captioning wherein constraints are placed on which regions in an image should be described in the generated natural language caption. This puts a stronger focus on producing more detailed descriptions, and opens the door for more end-user control over results. A vital component of the Controllable Image Captioning architecture is the mechanism that decides the timing of attending to each region through the advancement of a region pointer. In this paper, we propose a novel method for predicting the timing of region pointer advancement by treating the advancement step as a natural part of the language structure via a NEXT-token, motivated by a strong correlation to the sentence structure in the training data. We find that our timing agrees with the ground-truth timing in the Flickr30k Entities test data with a precision of 86.55% and a recall of 97.92%. Our model implementing this technique improves the state-of-the-art on ... : Accepted to COLING 2020 ...
Keyword:
68T07, 68T45, 68T50
;
Computation and Language cs.CL
;
Computer Vision and Pattern Recognition cs.CV
;
FOS Computer and information sciences
;
I.2.7; I.2.10; I.5.1
;
Machine Learning cs.LG
;
Neural and Evolutionary Computing cs.NE
URL:
https://arxiv.org/abs/2011.14901
https://dx.doi.org/10.48550/arxiv.2011.14901
BASE
Hide details
Mobile view
All
Catalogues
UB Frankfurt Linguistik
0
IDS Mannheim
0
OLC Linguistik
0
UB Frankfurt Retrokatalog
0
DNB Subject Category Language
0
Institut für Empirische Sprachwissenschaft
0
Leibniz-Centre General Linguistics (ZAS)
0
Bibliographies
BLLDB
0
BDSL
0
IDS Bibliografie zur deutschen Grammatik
0
IDS Bibliografie zur Gesprächsforschung
0
IDS Konnektoren im Deutschen
0
IDS Präpositionen im Deutschen
0
IDS OBELEX meta
0
MPI-SHH Linguistics Collection
0
MPI for Psycholinguistics
0
Linked Open Data catalogues
Annohub
0
Online resources
Link directory
0
Journal directory
0
Database directory
0
Dictionary directory
0
Open access documents
BASE
1
Linguistik-Repository
0
IDS Publikationsserver
0
Online dissertations
0
Language Description Heritage
0
© 2013 - 2024 Lin|gu|is|tik
|
Imprint
|
Privacy Policy
|
Datenschutzeinstellungen ändern