ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1411.4555
  4. Cited By
Show and Tell: A Neural Image Caption Generator

Show and Tell: A Neural Image Caption Generator

17 November 2014
Oriol Vinyals
Alexander Toshev
Samy Bengio
D. Erhan
    3DV
ArXivPDFHTML

Papers citing "Show and Tell: A Neural Image Caption Generator"

50 / 2,023 papers shown
Title
ShapeCaptioner: Generative Caption Network for 3D Shapes by Learning a
  Mapping from Parts Detected in Multiple Views to Sentences
ShapeCaptioner: Generative Caption Network for 3D Shapes by Learning a Mapping from Parts Detected in Multiple Views to Sentences
Zhizhong Han
Chao Chen
Yu-Shen Liu
Matthias Zwicker
3DPC
27
46
0
31 Jul 2019
Automatic Generation of Personalized Comment Based on User Profile
Automatic Generation of Personalized Comment Based on User Profile
Wenhuan Zeng
Abulikemu Abuduweili
Lei Li
Pengcheng Yang
5
22
0
24 Jul 2019
Modeling question asking using neural program generation
Modeling question asking using neural program generation
ZiYun Wang
Brenden M. Lake
24
7
0
23 Jul 2019
Trends in Integration of Vision and Language Research: A Survey of
  Tasks, Datasets, and Methods
Trends in Integration of Vision and Language Research: A Survey of Tasks, Datasets, and Methods
Aditya Mogadala
M. Kalimuthu
Dietrich Klakow
VLM
29
133
0
22 Jul 2019
VIFIDEL: Evaluating the Visual Fidelity of Image Descriptions
VIFIDEL: Evaluating the Visual Fidelity of Image Descriptions
Pranava Madhyastha
Josiah Wang
Lucia Specia
13
32
0
22 Jul 2019
A Sufficient Statistic for Influence in Structured Multiagent
  Environments
A Sufficient Statistic for Influence in Structured Multiagent Environments
F. Oliehoek
Stefan J. Witwicki
L. Kaelbling
23
23
0
22 Jul 2019
Generating Sentiment-Preserving Fake Online Reviews Using Neural
  Language Models and Their Human- and Machine-based Detection
Generating Sentiment-Preserving Fake Online Reviews Using Neural Language Models and Their Human- and Machine-based Detection
David Ifeoluwa Adelani
H. Mai
Fuming Fang
H. Nguyen
Junichi Yamagishi
Isao Echizen
DeLMO
22
120
0
22 Jul 2019
Automatic Radiology Report Generation based on Multi-view Image Fusion
  and Medical Concept Enrichment
Automatic Radiology Report Generation based on Multi-view Image Fusion and Medical Concept Enrichment
Jianbo Yuan
Haofu Liao
R. Luo
Jiebo Luo
MedIm
36
193
0
22 Jul 2019
Language Modelling for Sound Event Detection with Teacher Forcing and
  Scheduled Sampling
Language Modelling for Sound Event Detection with Teacher Forcing and Scheduled Sampling
Konstantinos Drossos
Shayan Gharib
P. Magron
Tuomas Virtanen
AI4TS
18
22
0
19 Jul 2019
MeetUp! A Corpus of Joint Activity Dialogues in a Visual Environment
MeetUp! A Corpus of Joint Activity Dialogues in a Visual Environment
N. Ilinykh
Sina Zarrieß
David Schlangen
27
43
0
11 Jul 2019
A Survey of Deep Learning-based Object Detection
A Survey of Deep Learning-based Object Detection
L. Jiao
Fan Zhang
Fang Liu
Shuyuan Yang
Lingling Li
Zhixi Feng
Rong Qu
ObjD
31
962
0
11 Jul 2019
Aesthetic Attributes Assessment of Images
Aesthetic Attributes Assessment of Images
Xin Jin
Le Wu
Geng Zhao
Xiaodong Li
Xiaokun Zhang
Shiming Ge
Dongqing Zou
Bin Zhou
Xinghui Zhou
22
37
0
11 Jul 2019
M3D-GAN: Multi-Modal Multi-Domain Translation with Universal Attention
M3D-GAN: Multi-Modal Multi-Domain Translation with Universal Attention
Shuang Ma
Daniel J. McDuff
Yale Song
14
4
0
09 Jul 2019
Variational Context: Exploiting Visual and Textual Context for Grounding
  Referring Expressions
Variational Context: Exploiting Visual and Textual Context for Grounding Referring Expressions
Yulei Niu
Hanwang Zhang
Zhiwu Lu
Shih-Fu Chang
ObjD
BDL
36
24
0
08 Jul 2019
Embodied Vision-and-Language Navigation with Dynamic Convolutional
  Filters
Embodied Vision-and-Language Navigation with Dynamic Convolutional Filters
Federico Landi
Lorenzo Baraldi
M. Corsini
Rita Cucchiara
LM&Ro
36
26
0
05 Jul 2019
A Unified Framework of Online Learning Algorithms for Training Recurrent
  Neural Networks
A Unified Framework of Online Learning Algorithms for Training Recurrent Neural Networks
O. Marschall
Kyunghyun Cho
Cristina Savin
FedML
36
73
0
05 Jul 2019
Towards Interpretable Deep Extreme Multi-label Learning
Towards Interpretable Deep Extreme Multi-label Learning
Yihuang Kang
I-Ling Cheng
W. Mao
Bowen Kuo
Pei-Ju Lee
16
0
0
03 Jul 2019
Kite: Automatic speech recognition for unmanned aerial vehicles
Kite: Automatic speech recognition for unmanned aerial vehicles
Dan Oneaţă
H. Cucu
35
13
0
02 Jul 2019
Multimodal Transformer Networks for End-to-End Video-Grounded Dialogue
  Systems
Multimodal Transformer Networks for End-to-End Video-Grounded Dialogue Systems
Hung Le
Doyen Sahoo
Nancy F. Chen
Guosheng Lin
22
111
0
02 Jul 2019
Densely Residual Laplacian Super-Resolution
Densely Residual Laplacian Super-Resolution
Saeed Anwar
Nick Barnes
SupR
24
225
0
28 Jun 2019
A Deep Decoder Structure Based on WordEmbedding Regression for An
  Encoder-Decoder Based Model for Image Captioning
A Deep Decoder Structure Based on WordEmbedding Regression for An Encoder-Decoder Based Model for Image Captioning
A. Asadi
Reza Safabakhsh
17
3
0
26 Jun 2019
Improving Description-based Person Re-identification by
  Multi-granularity Image-text Alignments
Improving Description-based Person Re-identification by Multi-granularity Image-text Alignments
K. Niu
Y. Huang
Wanli Ouyang
Liang Wang
27
139
0
23 Jun 2019
Sequence Generation: From Both Sides to the Middle
Sequence Generation: From Both Sides to the Middle
Long Zhou
Jiajun Zhang
Chengqing Zong
Heng Yu
36
22
0
23 Jun 2019
A Deep Generative Model for Code-Switched Text
A Deep Generative Model for Code-Switched Text
Bidisha Samanta
Sharmila Reddy Nangi
Hussain Jagirdar
Niloy Ganguly
Soumen Chakrabarti
11
34
0
21 Jun 2019
Informative Image Captioning with External Sources of Information
Informative Image Captioning with External Sources of Information
Sanqiang Zhao
Piyush Sharma
Tomer Levinboim
Radu Soricut
21
45
0
20 Jun 2019
Expressing Visual Relationships via Language
Expressing Visual Relationships via Language
Hao Tan
Franck Dernoncourt
Zhe Lin
Trung Bui
Joey Tianyi Zhou
29
63
0
18 Jun 2019
Neurally-Guided Structure Inference
Neurally-Guided Structure Inference
Sidi Lu
Jiayuan Mao
J. Tenenbaum
Jiajun Wu
14
7
0
17 Jun 2019
Image Captioning with Integrated Bottom-Up and Multi-level Residual
  Top-Down Attention for Game Scene Understanding
Image Captioning with Integrated Bottom-Up and Multi-level Residual Top-Down Attention for Game Scene Understanding
Jian Zheng
S. Krishnamurthy
Ruxin Chen
Min-Hung Chen
Zhenhao Ge
Xiaohua Li
43
4
0
16 Jun 2019
Generating Diverse and Informative Natural Language Fashion Feedback
Generating Diverse and Informative Natural Language Fashion Feedback
Gil Sadeh
L. Fritz
Gabi Shalev
Eduard Oks
16
5
0
15 Jun 2019
Automatic Conditional Generation of Personalized Social Media Short
  Texts
Automatic Conditional Generation of Personalized Social Media Short Texts
Ziwen Wang
Jie Wang
Haiqian Gu
Zhicheng Zhao
Bojin Zhuang
AI4CE
18
7
0
15 Jun 2019
Trimmed Action Recognition, Dense-Captioning Events in Videos, and
  Spatio-temporal Action Localization with Focus on ActivityNet Challenge 2019
Trimmed Action Recognition, Dense-Captioning Events in Videos, and Spatio-temporal Action Localization with Focus on ActivityNet Challenge 2019
Zhaofan Qiu
Dong Li
Yehao Li
Qi Cai
Yingwei Pan
Ting Yao
27
8
0
14 Jun 2019
Image Captioning: Transforming Objects into Words
Image Captioning: Transforming Objects into Words
Simão Herdade
Armin Kappeler
K. Boakye
Joao Soares
ViT
50
463
0
14 Jun 2019
Semantics to Space(S2S): Embedding semantics into spatial space for
  zero-shot verb-object query inferencing
Semantics to Space(S2S): Embedding semantics into spatial space for zero-shot verb-object query inferencing
Sungmin Eum
H. Kwon
18
3
0
13 Jun 2019
Know What You Don't Know: Modeling a Pragmatic Speaker that Refers to
  Objects of Unknown Categories
Know What You Don't Know: Modeling a Pragmatic Speaker that Refers to Objects of Unknown Categories
Sina Zarrieß
David Schlangen
22
16
0
13 Jun 2019
Character n-gram Embeddings to Improve RNN Language Models
Character n-gram Embeddings to Improve RNN Language Models
Sho Takase
Jun Suzuki
Masaaki Nagata
27
25
0
13 Jun 2019
Vispi: Automatic Visual Perception and Interpretation of Chest X-rays
Vispi: Automatic Visual Perception and Interpretation of Chest X-rays
X. Li
Rui Cao
D. Zhu
19
20
0
12 Jun 2019
Mimic and Fool: A Task Agnostic Adversarial Attack
Mimic and Fool: A Task Agnostic Adversarial Attack
Akshay Chaturvedi
Utpal Garain
AAML
19
26
0
11 Jun 2019
Relationship-Embedded Representation Learning for Grounding Referring
  Expressions
Relationship-Embedded Representation Learning for Grounding Referring Expressions
Sibei Yang
Guanbin Li
Yizhou Yu
ObjD
33
52
0
11 Jun 2019
Figure Captioning with Reasoning and Sequence-Level Training
Figure Captioning with Reasoning and Sequence-Level Training
Charles C. Chen
Ruiyi Zhang
Eunyee Koh
Sungchul Kim
Scott D. Cohen
Tong Yu
Ryan Rossi
Razvan Bunescu
AIMat
31
38
0
07 Jun 2019
Learning in Gated Neural Networks
Learning in Gated Neural Networks
Ashok Vardhan Makkuva
Sewoong Oh
Sreeram Kannan
Pramod Viswanath
19
10
0
06 Jun 2019
Context-Aware Visual Policy Network for Fine-Grained Image Captioning
Context-Aware Visual Policy Network for Fine-Grained Image Captioning
Zhengjun Zha
Daqing Liu
Hanwang Zhang
Yongdong Zhang
Feng Wu
25
120
0
06 Jun 2019
KERMIT: Generative Insertion-Based Modeling for Sequences
KERMIT: Generative Insertion-Based Modeling for Sequences
William Chan
Nikita Kitaev
Kelvin Guu
Mitchell Stern
Jakob Uszkoreit
VLM
23
65
0
04 Jun 2019
Masked Non-Autoregressive Image Captioning
Masked Non-Autoregressive Image Captioning
Junlong Gao
Xi Meng
Shiqi Wang
Xia Li
Shanshe Wang
Siwei Ma
Wen Gao
30
36
0
03 Jun 2019
Reconstruct and Represent Video Contents for Captioning via
  Reinforcement Learning
Reconstruct and Represent Video Contents for Captioning via Reinforcement Learning
Wei Zhang
Bairui Wang
Lin Ma
Wei Liu
20
67
0
03 Jun 2019
A Semi-Supervised Approach for Low-Resourced Text Generation
A Semi-Supervised Approach for Low-Resourced Text Generation
Hongyu Zang
Xiaojun Wan
OffRL
33
5
0
03 Jun 2019
Generating Question Relevant Captions to Aid Visual Question Answering
Generating Question Relevant Captions to Aid Visual Question Answering
Jialin Wu
Zeyuan Hu
Raymond J. Mooney
31
42
0
03 Jun 2019
Minimax bounds for structured prediction
Minimax bounds for structured prediction
Kevin Bello
Asish Ghoshal
Jean Honorio
17
2
0
02 Jun 2019
Learning to Generate Grounded Visual Captions without Localization
  Supervision
Learning to Generate Grounded Visual Captions without Localization Supervision
Chih-Yao Ma
Yannis Kalantidis
Ghassan AlRegib
Peter Vajda
Marcus Rohrbach
Z. Kira
SSL
19
10
0
01 Jun 2019
Audio Caption in a Car Setting with a Sentence-Level Loss
Audio Caption in a Car Setting with a Sentence-Level Loss
Xuenan Xu
Heinrich Dinkel
Mengyue Wu
Kai Yu
13
2
0
31 May 2019
Fashion IQ: A New Dataset Towards Retrieving Images by Natural Language
  Feedback
Fashion IQ: A New Dataset Towards Retrieving Images by Natural Language Feedback
Hui Wu
Yupeng Gao
Xiaoxiao Guo
Ziad Al-Halah
Steven J. Rennie
Kristen Grauman
Rogerio Feris
EgoV
28
63
0
30 May 2019
Previous
123...212223...394041
Next