Show and Tell: A Neural Image Caption Generator

17 November 2014

Papers citing "Show and Tell: A Neural Image Caption Generator"

50 / 2,022 papers shown

Title
Learning Deep Structure-Preserving Image-Text Embeddings Liwei Wang Yin Li Svetlana Lazebnik 35 780 0 19 Nov 2015
Reducing Overfitting in Deep Networks by Decorrelating Representations Michael Cogswell Faruk Ahmed Ross B. Girshick C. L. Zitnick Dhruv Batra 23 411 0 19 Nov 2015
ABC-CNN: An Attention Based Convolutional Neural Network for Visual Question Answering Kan Chen Jiang Wang Liang-Chieh Chen Haoyuan Gao Wenyuan Xu Ram Nevatia 22 287 0 18 Nov 2015
Learning Articulated Motion Models from Visual and Lingual Signals Zhengyang Wu Joey Tianyi Zhou Matthew R. Walter 22 0 0 17 Nov 2015
Deep Compositional Captioning: Describing Novel Object Categories without Paired Training Data Lisa Anne Hendricks Subhashini Venugopalan Marcus Rohrbach Raymond J. Mooney Kate Saenko Trevor Darrell CoGe 16 284 0 17 Nov 2015
Recurrent Neural Networks Hardware Implementation on FPGA Andre Xian Ming Chang B. Martini Eugenio Culurciello 11 126 0 17 Nov 2015
Ask, Attend and Answer: Exploring Question-Guided Spatial Attention for Visual Question Answering Huijuan Xu Kate Saenko 24 760 0 17 Nov 2015
How (not) to Train your Generative Model: Scheduled Sampling, Likelihood, Adversary? Ferenc Huszár OOD DiffM GAN 17 296 0 16 Nov 2015
Yin and Yang: Balancing and Answering Binary Visual Questions Peng Zhang Yash Goyal D. Summers-Stay Dhruv Batra Devi Parikh CoGe 19 349 0 16 Nov 2015
Sherlock: Scalable Fact Learning in Images Mohamed Elhoseiny Scott D. Cohen W. Chang Brian L. Price Ahmed Elgammal 19 26 0 16 Nov 2015
A Neural Transducer Navdeep Jaitly David Sussillo Quoc V. Le Oriol Vinyals Ilya Sutskever Samy Bengio AI4TS 14 48 0 16 Nov 2015
Neural Programmer: Inducing Latent Programs with Gradient Descent Arvind Neelakantan Quoc V. Le Ilya Sutskever ODL 27 260 0 16 Nov 2015
Uncovering Temporal Context for Video Question and Answering Linchao Zhu Zhongwen Xu Yi Yang Alexander G. Hauptmann BDL 19 44 0 15 Nov 2015
Oracle performance for visual captioning L. Yao Nicolas Ballas Kyunghyun Cho John R. Smith Yoshua Bengio VLM 31 8 0 14 Nov 2015
Symbol Grounding Association in Multimodal Sequences with Missing Elements Federico Raue Andreas Dengel Thomas Breuel Marcus Liwicki 11 1 0 13 Nov 2015
Sequence to Sequence Learning for Optical Character Recognition D. Sahu Mohak Sukhwani 14 13 0 13 Nov 2015
Natural Language Object Retrieval Ronghang Hu Huazhe Xu Marcus Rohrbach Jiashi Feng Kate Saenko Trevor Darrell ObjD 32 551 0 13 Nov 2015
Action Recognition using Visual Attention Shikhar Sharma Ryan Kiros Ruslan Salakhutdinov 24 666 0 12 Nov 2015
Improving performance of recurrent neural network with relu nonlinearity S. Talathi Aniket A. Vartak ODL 16 87 0 12 Nov 2015
Grounding of Textual Phrases in Images by Reconstruction Anna Rohrbach Marcus Rohrbach Ronghang Hu Trevor Darrell Bernt Schiele 18 494 0 12 Nov 2015
Deep Multimodal Semantic Embeddings for Speech and Images David Harwath James R. Glass 10 155 0 11 Nov 2015
Generative Concatenative Nets Jointly Learn to Write and Classify Reviews Zachary Chase Lipton Sharad Vikram Julian McAuley BDL 25 32 0 11 Nov 2015
Learning to Diagnose with LSTM Recurrent Neural Networks Zachary Chase Lipton David C. Kale Charles Elkan R. Wetzel 14 1,095 0 11 Nov 2015
Visual7W: Grounded Question Answering in Images Yuke Zhu Oliver Groth Michael S. Bernstein Li Fei-Fei 44 870 0 11 Nov 2015
From Images to Sentences through Scene Description Graphs using Commonsense Reasoning and Knowledge Somak Aditya Yezhou Yang Chitta Baral Cornelia Fermuller Yiannis Aloimonos 3DV 11 69 0 10 Nov 2015
Generating Images from Captions with Attention Elman Mansimov Emilio Parisotto Jimmy Lei Ba Ruslan Salakhutdinov VLM 40 449 0 09 Nov 2015
Generation and Comprehension of Unambiguous Object Descriptions Junhua Mao Jonathan Huang Alexander Toshev Oana-Maria Camburu Alan Yuille Kevin Patrick Murphy ObjD 31 1,310 0 07 Nov 2015
Stacked Attention Networks for Image Question Answering Zichao Yang Xiaodong He Jianfeng Gao Li Deng Alex Smola BDL 36 1,867 0 07 Nov 2015
Semi-supervised Sequence Learning Andrew M. Dai Quoc V. Le SSL 13 1,230 0 04 Nov 2015
Top-down Tree Long Short-Term Memory Networks Xingxing Zhang Liang Lu Mirella Lapata AIMat 17 101 0 31 Oct 2015
Generating Text with Deep Reinforcement Learning Hongyu Guo AIMat 17 50 0 30 Oct 2015
Deep Recurrent Regression for Facial Landmark Detection Hanjiang Lai Shengtao Xiao Yan Pan Zhen Cui Jiashi Feng Chunyan Xu Jian Yin Shuicheng Yan 3DV CVBM 31 57 0 30 Oct 2015
Learning Multi-Domain Convolutional Neural Networks for Visual Tracking Hyeonseob Nam Bohyung Han 49 2,464 0 27 Oct 2015
Video Paragraph Captioning Using Hierarchical Recurrent Neural Networks Haonan Yu Jiang Wang Zhiheng Huang Yi Yang Wenyuan Xu 42 560 0 26 Oct 2015
Phenotyping of Clinical Time Series with LSTM Recurrent Neural Networks Zachary Chase Lipton David C. Kale R. Wetzel 22 38 0 26 Oct 2015
Multilingual Image Description with Neural Sequence Models Desmond Elliott Stella Frank Eva Hasler VLM 22 75 0 15 Oct 2015
Resolving References to Objects in Photographs using the Words-As-Classifiers Model David Schlangen Sina Zarrieß C. Kennington 23 48 0 07 Oct 2015
SentiCap: Generating Image Descriptions with Sentiments A. Mathews Lexing Xie Xuming He 23 221 0 06 Oct 2015
Segment-Phrase Table for Semantic Segmentation, Visual Entailment and Paraphrasing Hamid Izadinia Fereshteh Sadeghi S. Divvala Yejin Choi Ali Farhadi VLM 12 20 0 27 Sep 2015
Guiding Long-Short Term Memory for Image Caption Generation Xu Jia E. Gavves Basura Fernando Tinne Tuytelaars VLM 22 101 0 16 Sep 2015
Deep Learning Applied to Image and Text Matching Afroze Ibrahim Baqapuri VLM 16 0 0 14 Sep 2015
Structured Prediction with Output Embeddings for Semantic Image Annotation A. Quattoni Arnau Ramisa Pranava Swaroop Madhyastha E. Simo-Serra Francesc Moreno-Noguer 14 2 0 07 Sep 2015
Object Recognition from Short Videos for Robotic Perception Ivan Bogun A. Angelova Navdeep Jaitly 12 8 0 04 Sep 2015
What to talk about and how? Selective Generation using LSTMs with Coarse-to-Fine Alignment Hongyuan Mei Joey Tianyi Zhou Matthew R. Walter 22 288 0 02 Sep 2015
SentenceRacer: A Game with a Purpose for Image Sentence Annotation Kenji Hata Sherman Leung Ranjay Krishna Michael S. Bernstein Li Fei-Fei 26 0 0 27 Aug 2015
The SP theory of intelligence: distinctive features and advantages J. G. Wolff 8 31 0 17 Aug 2015
Image Representations and New Domains in Neural Image Captioning Jack Hessel Nicolas Savva Michael J. Wilber VLM 22 16 0 09 Aug 2015
Listen, Attend and Spell William Chan Navdeep Jaitly Quoc V. Le Oriol Vinyals RALM 50 2,250 0 05 Aug 2015
Recurrent Network Models for Human Dynamics Katerina Fragkiadaki Sergey Levine Panna Felsen Jitendra Malik 28 30 0 02 Aug 2015
Building End-To-End Dialogue Systems Using Generative Hierarchical Neural Network Models Iulian Serban Alessandro Sordoni Yoshua Bengio Aaron Courville Joelle Pineau AILaw 16 1,747 0 17 Jul 2015