Home
Catalogue search
Refine your search:
Keyword
Creator / Publisher:
Kelleher, John D. (3)
Ross, Robert (3)
ADAPT SFI Research Centre (1)
John D. Kelleher and Robert J. Ross (1)
Lindh, Annika (1)
Mac Namee, Brian (1)
SFI Research Centres Programme (1)
Sloan, Colm (1)
Year:
2020 (1)
2010 (2)
Medium
Type:
Article (3)
BLLDB-Access
Search in the Catalogues and Directories
All fields
Title
Creator / Publisher
Keyword
Year
AND
OR
AND NOT
All fields
Title
Creator / Publisher
Keyword
Year
AND
OR
AND NOT
All fields
Title
Creator / Publisher
Keyword
Year
AND
OR
AND NOT
All fields
Title
Creator / Publisher
Keyword
Year
AND
OR
AND NOT
All fields
Title
Creator / Publisher
Keyword
Year
Sort by
creator [A → Z]
'
creator [Z → A]
'
publishing year ↑ (asc)
'
publishing year ↓ (desc)
'
title [A → Z]
'
title [Z → A]
'
Simple Search
Hits 1 – 3 of 3
1
Language-Driven Region Pointer Advancement for Controllable Image Captioning
Lindh, Annika
;
Ross, Robert
;
Kelleher, John D.
In: Conference papers (2020)
Abstract:
Controllable Image Captioning is a recent sub-field in the multi-modal task of Image Captioning wherein constraints are placed on which regions in an image should be described in the generated natural language caption. This puts a stronger focus on producing more detailed descriptions, and opens the door for more end-user control over results. A vital component of the Controllable Image Captioning architecture is the mechanism that decides the timing of attending to each region through the advancement of a region pointer. In this paper, we propose a novel method for predicting the timing of region pointer advancement by treating the advancement step as a natural part of the language structure via a NEXT-token, motivated by a strong correlation to the sentence structure in the training data. We find that our timing agrees with the ground-truth timing in the Flickr30k Entities test data with a precision of 86.55% and a recall of 97.92%. Our model implementing this technique improves the state-of-the-art on standard captioning metrics while additionally demonstrating a considerably larger effective vocabulary size.
Keyword:
Artificial Intelligence and Robotics
;
computer vision
;
controllable image captioning
;
deep learning
;
machine learning
;
natural language generation
URL:
https://arrow.tudublin.ie/scschcomcon/286
https://arrow.tudublin.ie/cgi/viewcontent.cgi?article=1303&context=scschcomcon
BASE
Hide details
2
Situating Spatial Templates for Human-Robot Interaction
Sloan, Colm
;
Ross, Robert
;
Kelleher, John D.
...
In: Conference papers (2010)
BASE
Show details
3
Topology in Composite Spatial Terms
Kelleher, John D.
;
Ross, Robert
In: Conference papers (2010)
BASE
Show details
Mobile view
All
Catalogues
UB Frankfurt Linguistik
0
IDS Mannheim
0
OLC Linguistik
0
UB Frankfurt Retrokatalog
0
DNB Subject Category Language
0
Institut für Empirische Sprachwissenschaft
0
Leibniz-Centre General Linguistics (ZAS)
0
Bibliographies
BLLDB
0
BDSL
0
IDS Bibliografie zur deutschen Grammatik
0
IDS Bibliografie zur Gesprächsforschung
0
IDS Konnektoren im Deutschen
0
IDS Präpositionen im Deutschen
0
IDS OBELEX meta
0
MPI-SHH Linguistics Collection
0
MPI for Psycholinguistics
0
Linked Open Data catalogues
Annohub
0
Online resources
Link directory
0
Journal directory
0
Database directory
0
Dictionary directory
0
Open access documents
BASE
3
Linguistik-Repository
0
IDS Publikationsserver
0
Online dissertations
0
Language Description Heritage
0
© 2013 - 2024 Lin|gu|is|tik
|
Imprint
|
Privacy Policy
|
Datenschutzeinstellungen ändern