ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1411.4555
  4. Cited By
Show and Tell: A Neural Image Caption Generator

Show and Tell: A Neural Image Caption Generator

17 November 2014
Oriol Vinyals
Alexander Toshev
Samy Bengio
D. Erhan
    3DV
ArXivPDFHTML

Papers citing "Show and Tell: A Neural Image Caption Generator"

50 / 2,023 papers shown
Title
Multi-Task Spatiotemporal Neural Networks for Structured Surface
  Reconstruction
Multi-Task Spatiotemporal Neural Networks for Structured Surface Reconstruction
Mingze Xu
Chenyou Fan
John Paden
Geoffrey C. Fox
David J. Crandall
23
13
0
11 Jan 2018
Foreground Segmentation Using a Triplet Convolutional Neural Network for
  Multiscale Feature Encoding
Foreground Segmentation Using a Triplet Convolutional Neural Network for Multiscale Feature Encoding
Long Ang Lim
H. Keles
22
197
0
07 Jan 2018
Deep Bidirectional and Unidirectional LSTM Recurrent Neural Network for
  Network-wide Traffic Speed Prediction
Deep Bidirectional and Unidirectional LSTM Recurrent Neural Network for Network-wide Traffic Speed Prediction
Zhiyong Cui
Ruimin Ke
Ziyuan Pu
Yinhai Wang
AI4TS
8
411
0
07 Jan 2018
Visual Text Correction
Visual Text Correction
Amir Mazaheri
M. Shah
52
11
0
06 Jan 2018
From Eliza to XiaoIce: Challenges and Opportunities with Social Chatbots
From Eliza to XiaoIce: Challenges and Opportunities with Social Chatbots
H. Shum
Xiaodong He
Di Li
27
549
0
06 Jan 2018
Deep Learning: A Critical Appraisal
Deep Learning: A Critical Appraisal
G. Marcus
HAI
VLM
50
1,034
0
02 Jan 2018
Learning Continuous User Representations through Hybrid Filtering with
  doc2vec
Learning Continuous User Representations through Hybrid Filtering with doc2vec
Simon Stiebellehner
Jun Wang
Shuai Yuan
24
6
0
31 Dec 2017
Exploring Models and Data for Remote Sensing Image Caption Generation
Exploring Models and Data for Remote Sensing Image Caption Generation
Xiaoqiang Lu
Binqiang Wang
Xiangtao Zheng
Xuelong Li
32
463
0
21 Dec 2017
Learning to Act Properly: Predicting and Explaining Affordances from
  Images
Learning to Act Properly: Predicting and Explaining Affordances from Images
Ching-Yao Chuang
Jiaman Li
Antonio Torralba
Sanja Fidler
24
101
0
20 Dec 2017
Synthesizing Novel Pairs of Image and Text
Synthesizing Novel Pairs of Image and Text
Jason Xie
Tingwen Bao
6
0
0
18 Dec 2017
Video Object Detection with an Aligned Spatial-Temporal Memory
Video Object Detection with an Aligned Spatial-Temporal Memory
Fanyi Xiao
Yong Jae Lee
49
189
0
18 Dec 2017
Learning Compact Recurrent Neural Networks with Block-Term Tensor
  Decomposition
Learning Compact Recurrent Neural Networks with Block-Term Tensor Decomposition
Jinmian Ye
Linnan Wang
Guangxi Li
Di Chen
Shandian Zhe
Xinqi Chu
Zenglin Xu
29
132
0
14 Dec 2017
Network Analysis for Explanation
Network Analysis for Explanation
Hiroshi Kuwajima
Masayuki Tanaka
FAtt
9
3
0
07 Dec 2017
Why Do Neural Dialog Systems Generate Short and Meaningless Replies? A
  Comparison between Dialog and Translation
Why Do Neural Dialog Systems Generate Short and Meaningless Replies? A Comparison between Dialog and Translation
Bolin Wei
Shuai Lu
Lili Mou
Hao Zhou
Pascal Poupart
Ge Li
Zhi Jin
35
29
0
06 Dec 2017
Attacking Visual Language Grounding with Adversarial Examples: A Case
  Study on Neural Image Captioning
Attacking Visual Language Grounding with Adversarial Examples: A Case Study on Neural Image Captioning
Hongge Chen
Huan Zhang
Pin-Yu Chen
Jinfeng Yi
Cho-Jui Hsieh
GAN
AAML
35
49
0
06 Dec 2017
Examining Cooperation in Visual Dialog Models
Examining Cooperation in Visual Dialog Models
Mircea Mironenco
D. Kianfar
Ke M. Tran
Evangelos Kanoulas
E. Gavves
28
4
0
04 Dec 2017
SERKET: An Architecture for Connecting Stochastic Models to Realize a
  Large-Scale Cognitive Model
SERKET: An Architecture for Connecting Stochastic Models to Realize a Large-Scale Cognitive Model
Tomoaki Nakamura
Takayuki Nagai
T. Taniguchi
3DV
18
44
0
04 Dec 2017
Recurrent Neural Networks for Semantic Instance Segmentation
Recurrent Neural Networks for Semantic Instance Segmentation
Amaia Salvador
Míriam Bellver
Victor Campos
Manel Baradad
F. Marqués
Jordi Torres
Xavier Giró-i-Nieto
SSeg
24
62
0
02 Dec 2017
Improving Visually Grounded Sentence Representations with Self-Attention
Improving Visually Grounded Sentence Representations with Self-Attention
Kang Min Yoo
Youhyun Shin
Sang-goo Lee
34
5
0
02 Dec 2017
A Perceptual Measure for Deep Single Image Camera Calibration
A Perceptual Measure for Deep Single Image Camera Calibration
Yannick Hold-Geoffroy
Kalyan Sunkavalli
Jonathan Eisenmann
Matt Fisher
Emiliano Gambaretto
Sunil Hadap
Jean-François Lalonde
3DV
32
106
0
02 Dec 2017
Visual Features for Context-Aware Speech Recognition
Visual Features for Context-Aware Speech Recognition
Abhinav Gupta
Yajie Miao
Leonardo Neves
Florian Metze
25
42
0
01 Dec 2017
Multimodal Attribute Extraction
Multimodal Attribute Extraction
Robert L Logan IV
Samuel Humeau
Sameer Singh
21
27
0
29 Nov 2017
Learning to cluster in order to transfer across domains and tasks
Learning to cluster in order to transfer across domains and tasks
Yen-Chang Hsu
Zhaoyang Lv
Z. Kira
OOD
40
216
0
28 Nov 2017
End-to-end Adversarial Learning for Generative Conversational Agents
End-to-end Adversarial Learning for Generative Conversational Agents
Oswaldo Ludwig
GAN
16
9
0
28 Nov 2017
Unsupervised Domain Adaptation with Similarity Learning
Unsupervised Domain Adaptation with Similarity Learning
Pedro H. O. Pinheiro
SSL
OOD
41
269
0
24 Nov 2017
Self-view Grounding Given a Narrated 360° Video
Self-view Grounding Given a Narrated 360° Video
Shih-Han Chou
Yi-Chun Chen
Kuo-Hao Zeng
Hou-Ning Hu
Jianlong Fu
Min Sun
19
4
0
23 Nov 2017
On the Automatic Generation of Medical Imaging Reports
On the Automatic Generation of Medical Imaging Reports
Baoyu Jing
P. Xie
Eric Xing
MedIm
35
504
0
22 Nov 2017
The Devil is in the Middle: Exploiting Mid-level Representations for
  Cross-Domain Instance Matching
The Devil is in the Middle: Exploiting Mid-level Representations for Cross-Domain Instance Matching
Qian Yu
Xiaobin Chang
Yi-Zhe Song
Tao Xiang
Timothy M. Hospedales
24
91
0
22 Nov 2017
Integrating both Visual and Audio Cues for Enhanced Video Caption
Wangli Hao
Zhaoxiang Zhang
He Guan
Guibo Zhu
42
36
0
22 Nov 2017
Are You Talking to Me? Reasoned Visual Dialog Generation through
  Adversarial Learning
Are You Talking to Me? Reasoned Visual Dialog Generation through Adversarial Learning
Qi Wu
Peng Wang
Chunhua Shen
Ian Reid
Anton Van Den Hengel
GAN
35
129
0
21 Nov 2017
E-PUR: An Energy-Efficient Processing Unit for Recurrent Neural Networks
E-PUR: An Energy-Efficient Processing Unit for Recurrent Neural Networks
Franyell Silfa
Gem Dot
J. Arnau
Antonio González
33
39
0
20 Nov 2017
Diverse and Accurate Image Description Using a Variational Auto-Encoder
  with an Additive Gaussian Encoding Space
Diverse and Accurate Image Description Using a Variational Auto-Encoder with an Additive Gaussian Encoding Space
Liwei Wang
Alex Schwing
Svetlana Lazebnik
CoGe
37
175
0
19 Nov 2017
ADVISE: Symbolism and External Knowledge for Decoding Advertisements
ADVISE: Symbolism and External Knowledge for Decoding Advertisements
Keren Ye
Adriana Kovashka
35
50
0
17 Nov 2017
Neural Motifs: Scene Graph Parsing with Global Context
Neural Motifs: Scene Graph Parsing with Global Context
Rowan Zellers
Mark Yatskar
Sam Thomson
Yejin Choi
GNN
44
983
0
17 Nov 2017
ATRank: An Attention-Based User Behavior Modeling Framework for
  Recommendation
ATRank: An Attention-Based User Behavior Modeling Framework for Recommendation
Chang Zhou
Jinze Bai
Junshuai Song
Xiaofei Liu
Zhengchao Zhao
Xiusi Chen
Jun Gao
HAI
41
306
0
17 Nov 2017
AI Challenger : A Large-scale Dataset for Going Deeper in Image
  Understanding
AI Challenger : A Large-scale Dataset for Going Deeper in Image Understanding
Jiahong Wu
He Zheng
Bo Zhao
Yixin Li
Baoming Yan
...
Shipei Zhou
G. Lin
Yanwei Fu
Yizhou Wang
Yonggang Wang
VLM
38
149
0
17 Nov 2017
Parallel Attention: A Unified Framework for Visual Object Discovery
  through Dialogs and Queries
Parallel Attention: A Unified Framework for Visual Object Discovery through Dialogs and Queries
Bohan Zhuang
Qi Wu
Chunhua Shen
Ian Reid
Anton Van Den Hengel
ObjD
27
134
0
17 Nov 2017
Language-Based Image Editing with Recurrent Attentive Models
Language-Based Image Editing with Recurrent Attentive Models
Jianbo Chen
Yelong Shen
Jianfeng Gao
Jingjing Liu
Xiaodong Liu
35
122
0
16 Nov 2017
A Novel Framework for Robustness Analysis of Visual QA Models
A Novel Framework for Robustness Analysis of Visual QA Models
Jia-Hong Huang
Cuong Duc Dao
Modar Alfadly
Guohao Li
AAML
OOD
30
34
0
16 Nov 2017
Interpreting Deep Visual Representations via Network Dissection
Interpreting Deep Visual Representations via Network Dissection
Bolei Zhou
David Bau
A. Oliva
Antonio Torralba
FAtt
MILM
29
323
0
15 Nov 2017
Dual-Path Convolutional Image-Text Embeddings with Instance Loss
Dual-Path Convolutional Image-Text Embeddings with Instance Loss
Zhedong Zheng
Liang Zheng
Michael Garrett
Yi Yang
Mingliang Xu
Yi-Dong Shen
27
470
0
15 Nov 2017
Controllable Abstractive Summarization
Controllable Abstractive Summarization
Angela Fan
David Grangier
Michael Auli
42
306
0
14 Nov 2017
Improving Factor-Based Quantitative Investing by Forecasting Company
  Fundamentals
Improving Factor-Based Quantitative Investing by Forecasting Company Fundamentals
J. Alberg
Zachary Chase Lipton
AI4TS
37
48
0
13 Nov 2017
Building machines that adapt and compute like brains
Building machines that adapt and compute like brains
Brenden M. Lake
J. Tenenbaum
AI4CE
FedML
NAI
AILaw
254
888
0
11 Nov 2017
End-to-end Video-level Representation Learning for Action Recognition
End-to-end Video-level Representation Learning for Action Recognition
Jiagang Zhu
Wei Zou
Zheng Zhu
25
89
0
11 Nov 2017
Phrase-based Image Captioning with Hierarchical LSTM Model
Phrase-based Image Captioning with Hierarchical LSTM Model
Y. Tan
Chee Seng Chan
VLM
26
4
0
11 Nov 2017
Neural-Symbolic Learning and Reasoning: A Survey and Interpretation
Neural-Symbolic Learning and Reasoning: A Survey and Interpretation
Tarek R. Besold
Artur Garcez
Sebastian Bader
Howard L. Bowman
Pedro M. Domingos
...
P. Lima
L. Penning
Gadi Pinkas
Hoifung Poon
Gerson Zaverucha
LRM
AI4CE
28
332
0
10 Nov 2017
Object Referring in Visual Scene with Spoken Language
Object Referring in Visual Scene with Spoken Language
A. Vasudevan
Dengxin Dai
Luc Van Gool
37
18
0
10 Nov 2017
DLPaper2Code: Auto-generation of Code from Deep Learning Research Papers
DLPaper2Code: Auto-generation of Code from Deep Learning Research Papers
Akshay Sethi
A. Sankaran
Naveen Panwar
Shreya Khare
Senthil Mani
3DV
23
33
0
09 Nov 2017
Image Captioning and Classification of Dangerous Situations
Image Captioning and Classification of Dangerous Situations
Octavio Arriaga
Paul G. Plöger
Matias Valdenegro-Toro
27
8
0
07 Nov 2017
Previous
123...293031...394041
Next