ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1411.4555
  4. Cited By
Show and Tell: A Neural Image Caption Generator

Show and Tell: A Neural Image Caption Generator

17 November 2014
Oriol Vinyals
Alexander Toshev
Samy Bengio
D. Erhan
    3DV
ArXivPDFHTML

Papers citing "Show and Tell: A Neural Image Caption Generator"

50 / 2,023 papers shown
Title
Situation Recognition with Graph Neural Networks
Situation Recognition with Graph Neural Networks
Ruiyu Li
Makarand Tapaswi
Renjie Liao
Jiaya Jia
R. Urtasun
Sanja Fidler
GNN
24
131
0
14 Aug 2017
Recurrent Filter Learning for Visual Tracking
Recurrent Filter Learning for Visual Tracking
Tianyu Yang
Antoni B. Chan
VOT
27
84
0
13 Aug 2017
Early Stage Malware Prediction Using Recurrent Neural Networks
Early Stage Malware Prediction Using Recurrent Neural Networks
Matilda Rhode
Pete Burnap
K. Jones
AAML
22
253
0
11 Aug 2017
TandemNet: Distilling Knowledge from Medical Images Using Diagnostic
  Reports as Optional Semantic References
TandemNet: Distilling Knowledge from Medical Images Using Diagnostic Reports as Optional Semantic References
Zizhao Zhang
Pingjun Chen
Manish Sapkota
Ling Yang
MedIm
18
67
0
10 Aug 2017
Hierarchically-Attentive RNN for Album Summarization and Storytelling
Hierarchically-Attentive RNN for Album Summarization and Storytelling
Licheng Yu
Joey Tianyi Zhou
Tamara L. Berg
41
66
0
09 Aug 2017
Learning to Disambiguate by Asking Discriminative Questions
Learning to Disambiguate by Asking Discriminative Questions
Yining Li
Chen Huang
Xiaoou Tang
Chen Change Loy
18
22
0
09 Aug 2017
Tips and Tricks for Visual Question Answering: Learnings from the 2017
  Challenge
Tips and Tricks for Visual Question Answering: Learnings from the 2017 Challenge
Damien Teney
Peter Anderson
Xiaodong He
Anton Van Den Hengel
50
381
0
09 Aug 2017
Recent Trends in Deep Learning Based Natural Language Processing
Recent Trends in Deep Learning Based Natural Language Processing
Tom Young
Devamanyu Hazarika
Soujanya Poria
Min Zhang
35
2,824
0
09 Aug 2017
Weakly Supervised Image Annotation and Segmentation with Objects and
  Attributes
Weakly Supervised Image Annotation and Segmentation with Objects and Attributes
Zhiyuan Shi
Yongxin Yang
Timothy M. Hospedales
Tao Xiang
21
46
0
08 Aug 2017
What is the Role of Recurrent Neural Networks (RNNs) in an Image Caption
  Generator?
What is the Role of Recurrent Neural Networks (RNNs) in an Image Caption Generator?
Marc Tanti
Albert Gatt
K. Camilleri
24
56
0
07 Aug 2017
Amulet: Aggregating Multi-level Convolutional Features for Salient
  Object Detection
Amulet: Aggregating Multi-level Convolutional Features for Salient Object Detection
Pingping Zhang
D. Wang
Huchuan Lu
Hongyu Wang
Xiang Ruan
16
734
0
07 Aug 2017
Identity-Aware Textual-Visual Matching with Latent Co-attention
Identity-Aware Textual-Visual Matching with Latent Co-attention
Shuang Li
Tong Xiao
Hongsheng Li
Wei Yang
Xiaogang Wang
22
227
0
07 Aug 2017
PPR-FCN: Weakly Supervised Visual Relation Detection via Parallel
  Pairwise R-FCN
PPR-FCN: Weakly Supervised Visual Relation Detection via Parallel Pairwise R-FCN
Hanwang Zhang
Zawlin Kyaw
Jinyang Yu
Shih-Fu Chang
22
141
0
07 Aug 2017
Learning to Infer Graphics Programs from Hand-Drawn Images
Learning to Infer Graphics Programs from Hand-Drawn Images
Kevin Ellis
Daniel E. Ritchie
Armando Solar-Lezama
J. Tenenbaum
NAI
13
226
0
30 Jul 2017
Graph Classification with 2D Convolutional Neural Networks
Graph Classification with 2D Convolutional Neural Networks
A. Tixier
Giannis Nikolentzos
Polykarpos Meladianos
Michalis Vazirgiannis
GNN
15
23
0
29 Jul 2017
Men Also Like Shopping: Reducing Gender Bias Amplification using
  Corpus-level Constraints
Men Also Like Shopping: Reducing Gender Bias Amplification using Corpus-level Constraints
Jieyu Zhao
Tianlu Wang
Mark Yatskar
Vicente Ordonez
Kai-Wei Chang
FaML
32
964
0
29 Jul 2017
Deep Co-Space: Sample Mining Across Feature Transformation for
  Semi-Supervised Learning
Deep Co-Space: Sample Mining Across Feature Transformation for Semi-Supervised Learning
Ziliang Chen
Keze Wang
Tianlin Li
Pai Peng
E. Izquierdo
Liang Lin
34
9
0
28 Jul 2017
TensorLayer: A Versatile Library for Efficient Deep Learning Development
TensorLayer: A Versatile Library for Efficient Deep Learning Development
Hao Dong
A. Supratak
Luo Mai
Fangde Liu
A. Oehmichen
Simiao Yu
Yike Guo
59
114
0
26 Jul 2017
Deep Interactive Region Segmentation and Captioning
Deep Interactive Region Segmentation and Captioning
Ali Sharifi Boroujerdi
M. Khanian
M. Breuß
24
7
0
26 Jul 2017
Bottom-Up and Top-Down Attention for Image Captioning and Visual
  Question Answering
Bottom-Up and Top-Down Attention for Image Captioning and Visual Question Answering
Peter Anderson
Xiaodong He
Chris Buehler
Damien Teney
Mark Johnson
Stephen Gould
Lei Zhang
AIMat
61
4,184
0
25 Jul 2017
Image Pivoting for Learning Multilingual Multimodal Representations
Image Pivoting for Learning Multilingual Multimodal Representations
Spandana Gella
Rico Sennrich
Frank Keller
Mirella Lapata
SSL
38
78
0
24 Jul 2017
OBJ2TEXT: Generating Visually Descriptive Language from Object Layouts
OBJ2TEXT: Generating Visually Descriptive Language from Object Layouts
Xuwang Yin
Vicente Ordonez
VLM
40
55
0
22 Jul 2017
cvpaper.challenge in 2016: Futuristic Computer Vision through 1,600
  Papers Survey
cvpaper.challenge in 2016: Futuristic Computer Vision through 1,600 Papers Survey
Hirokatsu Kataoka
Soma Shirakabe
Yun He
S. Ueta
Teppei Suzuki
...
Ryousuke Takasawa
Masataka Fuchida
Yudai Miyashita
Kazushige Okayasu
Yuta Matsuzaki
30
1
0
20 Jul 2017
Learning Visually Grounded Sentence Representations
Learning Visually Grounded Sentence Representations
Douwe Kiela
Alexis Conneau
Allan Jabri
Maximilian Nickel
SSL
29
69
0
19 Jul 2017
Grounding Spatio-Semantic Referring Expressions for Human-Robot
  Interaction
Grounding Spatio-Semantic Referring Expressions for Human-Robot Interaction
Mohit Shridhar
David Hsu
ObjD
27
20
0
18 Jul 2017
Auto-Conditioned Recurrent Networks for Extended Complex Human Motion
  Synthesis
Auto-Conditioned Recurrent Networks for Extended Complex Human Motion Synthesis
Zimo Li
Yi Zhou
Shuangjiu Xiao
C. He
Zeng Huang
Hao Li
3DH
27
47
0
17 Jul 2017
Knowledge-Guided Recurrent Neural Network Learning for Task-Oriented
  Action Prediction
Knowledge-Guided Recurrent Neural Network Learning for Task-Oriented Action Prediction
Liang Lin
Lili Huang
Tianshui Chen
Yukang Gan
Hui Cheng
20
16
0
15 Jul 2017
Large-scale Video Classification guided by Batch Normalized LSTM
  Translator
Large-scale Video Classification guided by Batch Normalized LSTM Translator
Jae Hyeon Yoo
VLM
20
11
0
13 Jul 2017
Deep Fisher Discriminant Learning for Mobile Hand Gesture Recognition
Deep Fisher Discriminant Learning for Mobile Hand Gesture Recognition
Chunyu Xie
Ce Li
Baochang Zhang
Chong Chen
Jungong Han
HAI
27
64
0
12 Jul 2017
Automatic Understanding of Image and Video Advertisements
Automatic Understanding of Image and Video Advertisements
Zaeem Hussain
Ruotong Wang
Xiaozhong Zhang
Keren Ye
Christopher Thomas
Zuha Agha
Nathan Ong
Adriana Kovashka
DiffM
22
161
0
10 Jul 2017
MDNet: A Semantically and Visually Interpretable Medical Image Diagnosis
  Network
MDNet: A Semantically and Visually Interpretable Medical Image Diagnosis Network
Zizhao Zhang
Yuanpu Xie
Fuyong Xing
M. McGough
Ling Yang
MedIm
21
301
0
08 Jul 2017
ShuffleNet: An Extremely Efficient Convolutional Neural Network for
  Mobile Devices
ShuffleNet: An Extremely Efficient Convolutional Neural Network for Mobile Devices
Xiangyu Zhang
Xinyu Zhou
Mengxiao Lin
Jian Sun
AI4TS
81
6,792
0
04 Jul 2017
Where to Play: Retrieval of Video Segments using Natural-Language
  Queries
Where to Play: Retrieval of Video Segments using Natural-Language Queries
Sangkuk Lee
Daesik Kim
Myunggi Lee
Jihye Hwang
Nojun Kwak
38
3
0
02 Jul 2017
Automated Audio Captioning with Recurrent Neural Networks
Automated Audio Captioning with Recurrent Neural Networks
K. Drossos
Sharath Adavanne
Tuomas Virtanen
25
128
0
30 Jun 2017
Actor-Critic Sequence Training for Image Captioning
Actor-Critic Sequence Training for Image Captioning
Li Zhang
Flood Sung
Feng Liu
Tao Xiang
S. Gong
Yongxin Yang
Timothy M. Hospedales
24
111
0
29 Jun 2017
Paying More Attention to Saliency: Image Captioning with Saliency and
  Context Attention
Paying More Attention to Saliency: Image Captioning with Saliency and Context Attention
Marcella Cornia
Lorenzo Baraldi
Giuseppe Serra
Rita Cucchiara
33
79
0
26 Jun 2017
Skeleton-Based Action Recognition Using Spatio-Temporal LSTM Network
  with Trust Gates
Skeleton-Based Action Recognition Using Spatio-Temporal LSTM Network with Trust Gates
Jun Liu
Amir Shahroudy
Dong Xu
Alex C. Kot
G. Wang
25
452
0
26 Jun 2017
Neural-based Natural Language Generation in Dialogue using RNN
  Encoder-Decoder with Semantic Aggregation
Neural-based Natural Language Generation in Dialogue using RNN Encoder-Decoder with Semantic Aggregation
Van-Khanh Tran
Le-Minh Nguyen
31
33
0
21 Jun 2017
Using Artificial Tokens to Control Languages for Multilingual Image
  Caption Generation
Using Artificial Tokens to Control Languages for Multilingual Image Caption Generation
Satoshi Tsutsui
David J. Crandall
19
19
0
20 Jun 2017
An online sequence-to-sequence model for noisy speech recognition
An online sequence-to-sequence model for noisy speech recognition
Chung-Cheng Chiu
Dieterich Lawson
Yuping Luo
George Tucker
Kevin Swersky
Ilya Sutskever
Navdeep Jaitly
19
7
0
16 Jun 2017
Deep Learning Methods for Efficient Large Scale Video Labeling
Deep Learning Methods for Efficient Large Scale Video Labeling
Miha Škalič
M. Pekalski
Xin Pan
VLM
18
17
0
14 Jun 2017
Evaluating Personal Assistants on Mobile devices
Evaluating Personal Assistants on Mobile devices
Julia Kiseleva
Maarten de Rijke
11
21
0
14 Jun 2017
SEARNN: Training RNNs with Global-Local Losses
SEARNN: Training RNNs with Global-Local Losses
Rémi Leblond
Jean-Baptiste Alayrac
A. Osokin
Simon Lacoste-Julien
27
52
0
14 Jun 2017
Teaching Compositionality to CNNs
Teaching Compositionality to CNNs
Austin Stone
Hua-Yan Wang
Michael Stark
Yi Liu
D. Phoenix
Dileep George
CoGe
16
54
0
14 Jun 2017
Image Captioning with Object Detection and Localization
Image Captioning with Object Detection and Localization
Zhongliang Yang
Yujin Zhang
S. Rehman
Yongfeng Huang
ObjD
VLM
30
47
0
08 Jun 2017
Best of Both Worlds: Transferring Knowledge from Discriminative Learning
  to a Generative Visual Dialog Model
Best of Both Worlds: Transferring Knowledge from Discriminative Learning to a Generative Visual Dialog Model
Jiasen Lu
A. Kannan
Jianwei Yang
Devi Parikh
Dhruv Batra
BDL
38
136
0
05 Jun 2017
Hierarchical LSTM with Adjusted Temporal Attention for Video Captioning
Hierarchical LSTM with Adjusted Temporal Attention for Video Captioning
Jingkuan Song
Zhao Guo
Lianli Gao
Wu Liu
Dongxiang Zhang
Heng Tao Shen
48
166
0
05 Jun 2017
Order embeddings and character-level convolutions for multimodal
  alignment
Order embeddings and character-level convolutions for multimodal alignment
Jonatas Wehrmann
Anderson Mattjie
Rodrigo C. Barros
28
27
0
03 Jun 2017
See, Hear, and Read: Deep Aligned Representations
See, Hear, and Read: Deep Aligned Representations
Y. Aytar
Carl Vondrick
Antonio Torralba
VLM
AI4TS
12
136
0
03 Jun 2017
Natural Language Generation for Spoken Dialogue System using RNN
  Encoder-Decoder Networks
Natural Language Generation for Spoken Dialogue System using RNN Encoder-Decoder Networks
Van-Khanh Tran
Le-Minh Nguyen
41
41
0
01 Jun 2017
Previous
123...313233...394041
Next