ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1411.4555
  4. Cited By
Show and Tell: A Neural Image Caption Generator

Show and Tell: A Neural Image Caption Generator

17 November 2014
Oriol Vinyals
Alexander Toshev
Samy Bengio
D. Erhan
    3DV
ArXivPDFHTML

Papers citing "Show and Tell: A Neural Image Caption Generator"

50 / 2,023 papers shown
Title
Stay on the Path: Instruction Fidelity in Vision-and-Language Navigation
Stay on the Path: Instruction Fidelity in Vision-and-Language Navigation
Vihan Jain
Gabriel Ilharco
Alexander Ku
Ashish Vaswani
Eugene Ie
Jason Baldridge
LM&Ro
16
179
0
29 May 2019
Vision-to-Language Tasks Based on Attributes and Attention Mechanism
Vision-to-Language Tasks Based on Attributes and Attention Mechanism
Xuelong Li
Aihong Yuan
Xiaoqiang Lu
21
37
0
29 May 2019
Straight to Shapes++: Real-time Instance Segmentation Made More Accurate
Straight to Shapes++: Real-time Instance Segmentation Made More Accurate
Laurynas Miksys
Saumya Jetley
Michael Sapienza
Stuart Golodetz
Philip Torr
3DV
24
7
0
27 May 2019
Transcribing Content from Structural Images with Spotlight Mechanism
Transcribing Content from Structural Images with Spotlight Mechanism
Yu Yin
Zhenya Huang
Enhong Chen
Qi Liu
Fuzheng Zhang
Xing Xie
Guoping Hu
19
22
0
27 May 2019
A Survey on Biomedical Image Captioning
A Survey on Biomedical Image Captioning
Vasiliki Kougia
John Pavlopoulos
Ion Androutsopoulos
MedIm
22
80
0
26 May 2019
Bivariate Beta-LSTM
Bivariate Beta-LSTM
Kyungwoo Song
Joonho Jang
Seung-Jae Shin
Il-Chul Moon
20
6
0
25 May 2019
Recent Advances in Neural Question Generation
Recent Advances in Neural Question Generation
Liangming Pan
Wenqiang Lei
Tat-Seng Chua
Min-Yen Kan
22
116
0
22 May 2019
Image Captioning based on Deep Learning Methods: A Survey
Image Captioning based on Deep Learning Methods: A Survey
Yiyu Wang
Jungang Xu
Yingfei Sun
Xianpei Han
VLM
18
7
0
20 May 2019
Multimodal Transformer with Multi-View Visual Representation for Image
  Captioning
Multimodal Transformer with Multi-View Visual Representation for Image Captioning
Jun-chen Yu
Jing Li
Zhou Yu
Qingming Huang
ViT
27
377
0
20 May 2019
Exact-K Recommendation via Maximal Clique Optimization
Exact-K Recommendation via Maximal Clique Optimization
Yu Gong
Yu Zhu
Lu Duan
Qingwen Liu
Ziyu Guan
Fei Sun
Wenwu Ou
Kenny Q. Zhu
OffRL
CML
37
59
0
17 May 2019
Aligning Visual Regions and Textual Concepts for Semantic-Grounded Image
  Representations
Aligning Visual Regions and Textual Concepts for Semantic-Grounded Image Representations
Fenglin Liu
Yuanxin Liu
Xuancheng Ren
Xiaodong He
Xu Sun
VLM
34
81
0
15 May 2019
CutMix: Regularization Strategy to Train Strong Classifiers with
  Localizable Features
CutMix: Regularization Strategy to Train Strong Classifiers with Localizable Features
Sangdoo Yun
Dongyoon Han
Seong Joon Oh
Sanghyuk Chun
Junsuk Choe
Y. Yoo
OOD
439
4,705
0
13 May 2019
Exact Adversarial Attack to Image Captioning via Structured Output
  Learning with Latent Variables
Exact Adversarial Attack to Image Captioning via Structured Output Learning with Latent Variables
Yan Xu
Baoyuan Wu
Fumin Shen
Yanbo Fan
Yong Zhang
Heng Tao Shen
Wei Liu
AAML
33
55
0
10 May 2019
Memory-Attended Recurrent Network for Video Captioning
Memory-Attended Recurrent Network for Video Captioning
Wenjie Pei
Jiyuan Zhang
Xiangrong Wang
Lei Ke
Xiaoyong Shen
Yu-Wing Tai
22
200
0
10 May 2019
Sketch2code: Generating a website from a paper mockup
Sketch2code: Generating a website from a paper mockup
Alex Robinson
3DV
30
37
0
09 May 2019
ShapeGlot: Learning Language for Shape Differentiation
ShapeGlot: Learning Language for Shape Differentiation
Panos Achlioptas
Judy Fan
Robert D. Hawkins
Noah D. Goodman
Leonidas J. Guibas
36
83
0
08 May 2019
Image Captioning with Clause-Focused Metrics in a Multi-Modal Setting
  for Marketing
Image Captioning with Clause-Focused Metrics in a Multi-Modal Setting for Marketing
Philipp Harzig
D. Zecha
Rainer Lienhart
Carolin Kaiser
René Schallner
19
2
0
06 May 2019
PAL: A Wearable Platform for Real-time, Personalized and Context-Aware
  Health and Cognition Support
PAL: A Wearable Platform for Real-time, Personalized and Context-Aware Health and Cognition Support
Mina Khan
Glenn Fernandes
U. Sarawgi
Prudhvi Rampey
Pattie Maes
15
9
0
03 May 2019
Impact of Artificial Intelligence on Businesses: from Research,
  Innovation, Market Deployment to Future Shifts in Business Models
Impact of Artificial Intelligence on Businesses: from Research, Innovation, Market Deployment to Future Shifts in Business Models
N. Soni
E. Sharma
Narotam Singh
A. Kapoor
24
68
0
03 May 2019
AI-Powered Text Generation for Harmonious Human-Machine Interaction:
  Current State and Future Directions
AI-Powered Text Generation for Harmonious Human-Machine Interaction: Current State and Future Directions
Qiuyun Zhang
Bin Guo
Hao Wang
Yunji Liang
Shaoyang Hao
Zhiwen Yu
20
6
0
01 May 2019
Knowing When to Stop: Evaluation and Verification of Conformity to
  Output-size Specifications
Knowing When to Stop: Evaluation and Verification of Conformity to Output-size Specifications
Chenglong Wang
Rudy Bunel
Krishnamurthy Dvijotham
Po-Sen Huang
Edward Grefenstette
Pushmeet Kohli
30
5
0
26 Apr 2019
Pointing Novel Objects in Image Captioning
Pointing Novel Objects in Image Captioning
Yehao Li
Ting Yao
Yingwei Pan
Hongyang Chao
Tao Mei
33
69
0
25 Apr 2019
3G structure for image caption generation
3G structure for image caption generation
Aihong Yuan
Xuelong Li
Xiaoqiang Lu
21
34
0
21 Apr 2019
Multi-modal gated recurrent units for image description
Multi-modal gated recurrent units for image description
Xuelong Li
Aihong Yuan
Xiaoqiang Lu
GAN
21
26
0
20 Apr 2019
Learning to Collocate Neural Modules for Image Captioning
Learning to Collocate Neural Modules for Image Captioning
Xu Yang
Hanwang Zhang
Jianfei Cai
27
77
0
18 Apr 2019
Visual Relationship Detection with Language prior and Softmax
Visual Relationship Detection with Language prior and Softmax
Jaewon Jung
Jongyoul Park
27
9
0
16 Apr 2019
Positional Encoding to Control Output Sequence Length
Positional Encoding to Control Output Sequence Length
Sho Takase
Naoaki Okazaki
17
110
0
16 Apr 2019
Self-critical n-step Training for Image Captioning
Self-critical n-step Training for Image Captioning
Junlong Gao
Shiqi Wang
Shanshe Wang
Siwei Ma
Wen Gao
22
55
0
15 Apr 2019
A Simple Baseline for Audio-Visual Scene-Aware Dialog
A Simple Baseline for Audio-Visual Scene-Aware Dialog
Idan Schwartz
Alex Schwing
Tamir Hazan
27
69
0
11 Apr 2019
Reasoning Visual Dialogs with Structural and Partial Observations
Reasoning Visual Dialogs with Structural and Partial Observations
Zilong Zheng
Wenguan Wang
Siyuan Qi
Song-Chun Zhu
39
117
0
11 Apr 2019
UniVSE: Robust Visual Semantic Embeddings via Structured Semantic
  Representations
UniVSE: Robust Visual Semantic Embeddings via Structured Semantic Representations
Hao Wu
Jiayuan Mao
Yufeng Zhang
Yuning Jiang
Lei Li
Weiwei Sun
Wei-Ying Ma
22
8
0
11 Apr 2019
Streamlined Dense Video Captioning
Streamlined Dense Video Captioning
Jonghwan Mun
L. Yang
Zhou Ren
N. Xu
Bohyung Han
28
137
0
08 Apr 2019
VATEX: A Large-Scale, High-Quality Multilingual Dataset for
  Video-and-Language Research
VATEX: A Large-Scale, High-Quality Multilingual Dataset for Video-and-Language Research
Xin Eric Wang
Jiawei Wu
Junkun Chen
Lei Li
Yuan-fang Wang
William Yang Wang
32
539
0
06 Apr 2019
Reducing catastrophic forgetting when evolving neural networks
Reducing catastrophic forgetting when evolving neural networks
Joseph Early
19
2
0
05 Apr 2019
Clinically Accurate Chest X-Ray Report Generation
Clinically Accurate Chest X-Ray Report Generation
Guanxiong Liu
T. Hsu
Matthew B. A. McDermott
Willie Boag
W. Weng
Peter Szolovits
Marzyeh Ghassemi
MedIm
39
271
0
04 Apr 2019
MMED: A Multi-domain and Multi-modality Event Dataset
MMED: A Multi-domain and Multi-modality Event Dataset
Zhenguo Yang
Zehang Lin
Min Cheng
Qing Li
Wenyin Liu
31
9
0
04 Apr 2019
Good News, Everyone! Context driven entity-aware captioning for news
  images
Good News, Everyone! Context driven entity-aware captioning for news images
Ali Furkan Biten
Lluís Gómez
Marçal Rusiñol
Dimosthenis Karatzas
27
139
0
02 Apr 2019
HYPE: A Benchmark for Human eYe Perceptual Evaluation of Generative
  Models
HYPE: A Benchmark for Human eYe Perceptual Evaluation of Generative Models
Sharon Zhou
Mitchell L. Gordon
Ranjay Krishna
Austin Narcomey
Li Fei-Fei
Michael S. Bernstein
VLM
EGVM
6
118
0
01 Apr 2019
Describing like humans: on diversity in image captioning
Describing like humans: on diversity in image captioning
Qingzhong Wang
Antoni B. Chan
27
98
0
28 Mar 2019
Information Maximizing Visual Question Generation
Information Maximizing Visual Question Generation
Ranjay Krishna
Michael S. Bernstein
Li Fei-Fei
23
95
0
27 Mar 2019
Improve Diverse Text Generation by Self Labeling Conditional Variational
  Auto Encoder
Improve Diverse Text Generation by Self Labeling Conditional Variational Auto Encoder
Yuchi Zhang
Yongliang Wang
Liping Zhang
Qing Cui
Kun Gai
22
19
0
26 Mar 2019
Knowledge-driven Encode, Retrieve, Paraphrase for Medical Image Report
  Generation
Knowledge-driven Encode, Retrieve, Paraphrase for Medical Image Report Generation
Yuan Li
Xiaodan Liang
Zhiting Hu
Eric Xing
MedIm
25
268
0
25 Mar 2019
End-to-End Learning Using Cycle Consistency for Image-to-Caption
  Transformations
End-to-End Learning Using Cycle Consistency for Image-to-Caption Transformations
Keisuke Hagiwara
Yusuke Mukuta
Tatsuya Harada
24
0
0
25 Mar 2019
Comparison of State-of-the-Art Deep Learning APIs for Image Multi-Label
  Classification using Semantic Metrics
Comparison of State-of-the-Art Deep Learning APIs for Image Multi-Label Classification using Semantic Metrics
Adam Kubany
Shimon Ben Ishay
Ruben-sacha Ohayon
A. Shmilovici
Lior Rokach
Tomer Doitshman
VLM
11
2
0
21 Mar 2019
Neural Sequential Phrase Grounding (SeqGROUND)
Neural Sequential Phrase Grounding (SeqGROUND)
Pelin Dogan
Leonid Sigal
Markus Gross
ObjD
33
52
0
18 Mar 2019
Boosted Attention: Leveraging Human Attention for Image Captioning
Boosted Attention: Leveraging Human Attention for Image Captioning
Shi Chen
Qi Zhao
30
47
0
18 Mar 2019
Effects of padding on LSTMs and CNNs
Effects of padding on LSTMs and CNNs
Dwarampudi Mahidhar Reddy
Subba Reddy
19
96
0
18 Mar 2019
A Weighted Multi-Criteria Decision Making Approach for Image Captioning
A Weighted Multi-Criteria Decision Making Approach for Image Captioning
Hassan Maleki Galandouz
M. Moghaddam
M. Shamsfard
24
0
0
17 Mar 2019
Stochastic Beams and Where to Find Them: The Gumbel-Top-k Trick for
  Sampling Sequences Without Replacement
Stochastic Beams and Where to Find Them: The Gumbel-Top-k Trick for Sampling Sequences Without Replacement
W. Kool
H. V. Hoof
Max Welling
71
216
0
14 Mar 2019
Dense Relational Captioning: Triple-Stream Networks for
  Relationship-Based Captioning
Dense Relational Captioning: Triple-Stream Networks for Relationship-Based Captioning
Dong-Jin Kim
Jinsoo Choi
Tae-Hyun Oh
In So Kweon
14
84
0
14 Mar 2019
Previous
123...222324...394041
Next