Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1411.4555
Cited By
Show and Tell: A Neural Image Caption Generator
17 November 2014
Oriol Vinyals
Alexander Toshev
Samy Bengio
D. Erhan
3DV
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Show and Tell: A Neural Image Caption Generator"
50 / 2,023 papers shown
Title
Hierarchical Adaptable and Transferable Networks (HATN) for Driving Behavior Prediction
Letian Wang
Yeping Hu
Liting Sun
Wei Zhan
Masayoshi Tomizuka
Changliu Liu
21
16
0
01 Nov 2021
Latent Cognizance: What Machine Really Learns
Pisit Nakjai
J. Ponsawat
Tatpong Katanyukul
BDL
18
3
0
29 Oct 2021
Discovering Non-monotonic Autoregressive Orderings with Variational Inference
Xuanlin Li
Brandon Trabucco
Dongmin Park
Michael Luo
S. Shen
Trevor Darrell
Yang Gao
27
12
0
27 Oct 2021
Recurrence along Depth: Deep Convolutional Neural Networks with Recurrent Layer Aggregation
Jingyu Zhao
Yanwen Fang
Guodong Li
27
23
0
22 Oct 2021
Exploiting Cross-Modal Prediction and Relation Consistency for Semi-Supervised Image Captioning
Yang Yang
Haoran Wei
Hengshu Zhu
Dianhai Yu
Hui Xiong
Jian Yang
SSL
14
33
0
22 Oct 2021
Adaptive Bridge between Training and Inference for Dialogue
Haoran Xu
Hainan Zhang
Yanyan Zou
Hongshen Chen
Zhuoye Ding
Yanyan Lan
CVBM
6
8
0
22 Oct 2021
ASFormer: Transformer for Action Segmentation
Fangqiu Yi
Hongyu Wen
Tingting Jiang
ViT
79
174
0
16 Oct 2021
Self-Annotated Training for Controllable Image Captioning
Zhangzi Zhu
Tianlei Wang
Hong Qu
29
2
0
16 Oct 2021
Guiding Visual Question Generation
Nihir Vedd
Zixu Wang
Marek Rei
Yishu Miao
Lucia Specia
89
23
0
15 Oct 2021
Identification of Attack-Specific Signatures in Adversarial Examples
Hossein Souri
Pirazh Khorramshahi
Chun Pong Lau
Micah Goldblum
Rama Chellappa
AAML
MLAU
48
4
0
13 Oct 2021
Topic Scene Graph Generation by Attention Distillation from Caption
Wenbin Wang
R. Wang
X. Chen
DiffM
30
14
0
12 Oct 2021
Semi-Autoregressive Image Captioning
Xu Yan
Zhengcong Fei
Zekang Li
Shuhui Wang
Qingming Huang
Qi Tian
35
23
0
11 Oct 2021
Supervision Exists Everywhere: A Data Efficient Contrastive Language-Image Pre-training Paradigm
Yangguang Li
Feng Liang
Lichen Zhao
Yufeng Cui
Wanli Ouyang
Jing Shao
F. Yu
Junjie Yan
VLM
CLIP
50
448
0
11 Oct 2021
Braxlines: Fast and Interactive Toolkit for RL-driven Behavior Engineering beyond Reward Maximization
S. Gu
Manfred Diaz
Daniel Freeman
Hiroki Furuta
Seyed Kamyar Seyed Ghasemipour
Anton Raichuk
Byron David
Erik Frey
Erwin Coumans
Olivier Bachem
44
14
0
10 Oct 2021
Accessible Visualization via Natural Language Descriptions: A Four-Level Model of Semantic Content
Alan Lundgard
Arvind Satyanarayan
25
128
0
08 Oct 2021
Let there be a clock on the beach: Reducing Object Hallucination in Image Captioning
Ali Furkan Biten
L. G. I. Bigorda
Dimosthenis Karatzas
102
57
0
04 Oct 2021
Learning Structural Representations for Recipe Generation and Food Retrieval
Hao Wang
Guosheng Lin
Guosheng Lin
Chunyan Miao
29
28
0
04 Oct 2021
Transfer Learning Approaches for Knowledge Discovery in Grid-based Geo-Spatiotemporal Data
Aishwarya Sarkar
Jien Zhang
Chaoqun Lu
Ali Jannesari
AI4CE
33
2
0
02 Oct 2021
Geometry Attention Transformer with Position-aware LSTMs for Image Captioning
Chi-Yin Wang
Yulin Shen
Luping Ji
ViT
52
49
0
01 Oct 2021
A Review of Text Style Transfer using Deep Learning
Martina Toshevska
Sonja Gievska
CLIP
48
43
0
30 Sep 2021
Google Neural Network Models for Edge Devices: Analyzing and Mitigating Machine Learning Inference Bottlenecks
Amirali Boroumand
Saugata Ghose
Berkin Akin
Ravi Narayanaswami
Geraldo F. Oliveira
Xiaoyu Ma
Eric Shiu
O. Mutlu
25
82
0
29 Sep 2021
Geometry-Entangled Visual Semantic Transformer for Image Captioning
Ling Cheng
Wei Wei
Feida Zhu
Yong Liu
Chunyan Miao
ViT
21
3
0
29 Sep 2021
CIDEr-R: Robust Consensus-based Image Description Evaluation
G. O. D. Santos
Esther Luna Colombini
Sandra Avila
47
30
0
28 Sep 2021
Weakly Supervised Contrastive Learning for Chest X-Ray Report Generation
An Yan
Zexue He
Xing Lu
Jingfeng Du
E. Chang
Amilcare Gentili
Julian McAuley
Chun-Nan Hsu
MedIm
88
64
0
25 Sep 2021
Scene Graph Generation for Better Image Captioning?
Maximilian Mozes
Martin Schmitt
Vladimir Golkov
Hinrich Schütze
Daniel Cremers
GNN
34
3
0
23 Sep 2021
Pix2seq: A Language Modeling Framework for Object Detection
Ting-Li Chen
Saurabh Saxena
Lala Li
David J. Fleet
Geoffrey E. Hinton
MLLM
ViT
VLM
244
344
0
22 Sep 2021
Caption Enriched Samples for Improving Hateful Memes Detection
Efrat Blaier
Itzik Malkiel
Lior Wolf
VLM
61
21
0
22 Sep 2021
Survey: Transformer based Video-Language Pre-training
Ludan Ruan
Qin Jin
VLM
ViT
72
44
0
21 Sep 2021
Label-Attention Transformer with Geometrically Coherent Objects for Image Captioning
Shikha Dubey
Farrukh Olimov
M. Rafique
Joonmo Kim
M. Jeon
ViT
36
37
0
16 Sep 2021
SafeAccess+: An Intelligent System to make Smart Home Safer and Americans with Disability Act Compliant
Shahinur Alam
24
2
0
14 Sep 2021
Explain Me the Painting: Multi-Topic Knowledgeable Art Description Generation
Zechen Bai
Yuta Nakashima
Noa Garcia
68
43
0
13 Sep 2021
DSSL: Deep Surroundings-person Separation Learning for Text-based Person Retrieval
A. Zhu
Zijie Wang
Yifeng Li
Xili Wan
Jing Jin
Tian Wang
Fangqiang Hu
G. Hua
95
162
0
12 Sep 2021
We went to look for meaning and all we got were these lousy representations: aspects of meaning representation for computational semantics
Simon Dobnik
R. Cooper
Adam Ek
Bill Noble
Staffan Larsson
N. Ilinykh
Vladislav Maraev
Vidya Somashekarappa
30
0
0
10 Sep 2021
Sensor-Augmented Egocentric-Video Captioning with Dynamic Modal Attention
Katsuyuki Nakamura
Hiroki Ohashi
Mitsuhiro Okada
EgoV
36
13
0
07 Sep 2021
LAViTeR: Learning Aligned Visual and Textual Representations Assisted by Image and Caption Generation
Mohammad Abuzar Shaikh
Zhanghexuan Ji
Dana Moukheiber
Yan Shen
S. Srihari
Mingchen Gao
VLM
22
1
0
04 Sep 2021
Working Memory Connections for LSTM
Federico Landi
Lorenzo Baraldi
Marcella Cornia
Rita Cucchiara
KELM
29
158
0
31 Aug 2021
QACE: Asking Questions to Evaluate an Image Caption
Hwanhee Lee
Thomas Scialom
Seunghyun Yoon
Franck Dernoncourt
Kyomin Jung
CoGe
27
18
0
28 Aug 2021
Automated Generation of Accurate \& Fluent Medical X-ray Reports
Hoang T.N. Nguyen
Dong Nie
Taivanbat Badamdorj
Yujie Liu
Yingying Zhu
J. Truong
Li Cheng
MedIm
LM&MA
27
40
0
27 Aug 2021
SASRA: Semantically-aware Spatio-temporal Reasoning Agent for Vision-and-Language Navigation in Continuous Environments
Muhammad Zubair Irshad
Niluthpol Chowdhury Mithun
Zachary Seymour
Han-Pang Chiu
S. Samarasekera
Rakesh Kumar
LM&Ro
26
49
0
26 Aug 2021
Similar Scenes arouse Similar Emotions: Parallel Data Augmentation for Stylized Image Captioning
Guodun Li
Yuchen Zhai
Zehao Lin
Yin Zhang
59
21
0
26 Aug 2021
Product-oriented Machine Translation with Cross-modal Cross-lingual Pre-training
Yuqing Song
Shizhe Chen
Qin Jin
Wei Luo
Jun Xie
Fei Huang
31
18
0
25 Aug 2021
Auto-Parsing Network for Image Captioning and Visual Question Answering
Xu Yang
Chongyang Gao
Hanwang Zhang
Jianfei Cai
24
35
0
24 Aug 2021
Group-based Distinctive Image Captioning with Memory Attention
Jiuniu Wang
Wenjia Xu
Qingzhong Wang
Antoni B. Chan
21
18
0
20 Aug 2021
Medical-VLBERT: Medical Visual Language BERT for COVID-19 CT Report Generation With Alternate Learning
Guangyi Liu
Yinghong Liao
Fuyu Wang
Bin Zhang
Lu Zhang
...
Xiang Wan
Shaolin Li
Zhen Li
Shuixing Zhang
Shuguang Cui
28
56
0
11 Aug 2021
Communicating Visualizations without Visuals: Investigation of Visualization Alternative Text for People with Visual Impairments
C. Jung
Shubham Mehta
Atharva Kulkarni
Yuhang Zhao
Yea-Seul Kim
144
55
0
08 Aug 2021
Screen2Words: Automatic Mobile UI Summarization with Multimodal Learning
Bryan Wang
Gang Li
Xin Zhou
Zhourong Chen
Tovi Grossman
Yang Li
170
154
0
07 Aug 2021
Tiny Neural Models for Seq2Seq
A. Kandoor
34
0
0
07 Aug 2021
Interpretable Visual Understanding with Cognitive Attention Network
Xuejiao Tang
Wenbin Zhang
Yi Yu
Kea Turner
Tyler Derr
Mengyu Wang
Eirini Ntoutsi
52
12
0
06 Aug 2021
Neural Twins Talk & Alternative Calculations
Zanyar Zohourianshahzadi
Jugal Kalita
25
0
0
05 Aug 2021
Dual Graph Convolutional Networks with Transformer and Curriculum Learning for Image Captioning
Xinzhi Dong
Chengjiang Long
Wenju Xu
Chunxia Xiao
ViT
83
66
0
05 Aug 2021
Previous
1
2
3
...
11
12
13
...
39
40
41
Next