ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1609.06647
  4. Cited By
Show and Tell: Lessons learned from the 2015 MSCOCO Image Captioning
  Challenge

Show and Tell: Lessons learned from the 2015 MSCOCO Image Captioning Challenge

21 September 2016
Oriol Vinyals
Alexander Toshev
Samy Bengio
D. Erhan
ArXivPDFHTML

Papers citing "Show and Tell: Lessons learned from the 2015 MSCOCO Image Captioning Challenge"

41 / 91 papers shown
Title
Semantic Object Accuracy for Generative Text-to-Image Synthesis
Semantic Object Accuracy for Generative Text-to-Image Synthesis
Tobias Hinz
Stefan Heinrich
S. Wermter
EGVM
24
158
0
29 Oct 2019
Compositional Generalization in Image Captioning
Compositional Generalization in Image Captioning
Mitja Nikolaus
Mostafa Abdou
Matthew Lamm
Rahul Aralikatte
Desmond Elliott
CoGe
27
49
0
10 Sep 2019
Aligning Linguistic Words and Visual Semantic Units for Image Captioning
Aligning Linguistic Words and Visual Semantic Units for Image Captioning
Longteng Guo
Jing Liu
Jinhui Tang
Jiangwei Li
W. Luo
Hanqing Lu
25
102
0
06 Aug 2019
Physical Cue based Depth-Sensing by Color Coding with Deaberration
  Network
Physical Cue based Depth-Sensing by Color Coding with Deaberration Network
Nao Mishima
Tatsuo Kozakaya
Akihisa Moriya
R. Okada
S. Hiura
3DV
26
3
0
01 Aug 2019
Predicting Motion of Vulnerable Road Users using High-Definition Maps
  and Efficient ConvNets
Predicting Motion of Vulnerable Road Users using High-Definition Maps and Efficient ConvNets
Fang-Chieh Chou
Tsung-Han Lin
Henggang Cui
Vladan Radosavljevic
Thi Nguyen
Tzu-Kuo Huang
Matthew Niedoba
J. Schneider
Nemanja Djuric
10
58
0
20 Jun 2019
Reconstruct and Represent Video Contents for Captioning via
  Reinforcement Learning
Reconstruct and Represent Video Contents for Captioning via Reinforcement Learning
Wei Zhang
Bairui Wang
Lin Ma
Wei Liu
20
67
0
03 Jun 2019
Automatically Dismantling Online Dating Fraud
Automatically Dismantling Online Dating Fraud
Guillermo Suarez-Tangil
M. Edwards
Claudia Peersman
Gianluca Stringhini
A. Rashid
M. Whitty
9
58
0
29 May 2019
3G structure for image caption generation
3G structure for image caption generation
Aihong Yuan
Xuelong Li
Xiaoqiang Lu
21
34
0
21 Apr 2019
End-to-End Video Captioning
End-to-End Video Captioning
Silvio Olivastri
Gurkirt Singh
Fabio Cuzzolin
16
18
0
04 Apr 2019
Good News, Everyone! Context driven entity-aware captioning for news
  images
Good News, Everyone! Context driven entity-aware captioning for news images
Ali Furkan Biten
Lluís Gómez
Marçal Rusiñol
Dimosthenis Karatzas
21
139
0
02 Apr 2019
Visual Entailment: A Novel Task for Fine-Grained Image Understanding
Visual Entailment: A Novel Task for Fine-Grained Image Understanding
Ning Xie
Farley Lai
Derek Doran
Asim Kadav
CoGe
51
322
0
20 Jan 2019
A Survey of the Recent Architectures of Deep Convolutional Neural
  Networks
A Survey of the Recent Architectures of Deep Convolutional Neural Networks
Asifullah Khan
A. Sohail
Umme Zahoora
Aqsa Saeed Qureshi
OOD
62
2,268
0
17 Jan 2019
Pre-gen metrics: Predicting caption quality metrics without generating
  captions
Pre-gen metrics: Predicting caption quality metrics without generating captions
Marc Tanti
Albert Gatt
K. Camilleri
26
2
0
12 Oct 2018
A Comprehensive Survey of Deep Learning for Image Captioning
A Comprehensive Survey of Deep Learning for Image Captioning
Md Zakir Hossain
Ferdous Sohel
M. Shiratuddin
Hamid Laga
VLM
3DV
45
760
0
06 Oct 2018
Context-Dependent Diffusion Network for Visual Relationship Detection
Context-Dependent Diffusion Network for Visual Relationship Detection
Zhen Cui
Chunyan Xu
Wenming Zheng
Jian Yang
GNN
22
50
0
11 Sep 2018
LUCSS: Language-based User-customized Colourization of Scene Sketches
LUCSS: Language-based User-customized Colourization of Scene Sketches
C. Zou
Haoran Mo
Ruofei Du
Xing Wu
Chengying Gao
Hongbo Fu
30
8
0
30 Aug 2018
Exploring the Applications of Faster R-CNN and Single-Shot Multi-box
  Detection in a Smart Nursery Domain
Exploring the Applications of Faster R-CNN and Single-Shot Multi-box Detection in a Smart Nursery Domain
S. Phon-Amnuaisuk
K. Murata
P. Pavarangkoon
Kazunori Yamamoto
Takamichi Mizuhara
ObjD
18
11
0
27 Aug 2018
Shuffle-Then-Assemble: Learning Object-Agnostic Visual Relationship
  Features
Shuffle-Then-Assemble: Learning Object-Agnostic Visual Relationship Features
Xu Yang
Hanwang Zhang
Jianfei Cai
47
74
0
01 Aug 2018
Big-Little Net: An Efficient Multi-Scale Feature Representation for
  Visual and Speech Recognition
Big-Little Net: An Efficient Multi-Scale Feature Representation for Visual and Speech Recognition
Chun-Fu Chen
Quanfu Fan
Neil Rohit Mallinar
Tom Sercu
Rogerio Feris
20
96
0
10 Jul 2018
Topic-Guided Attention for Image Captioning
Topic-Guided Attention for Image Captioning
Zhihao Zhu
Zhan Xue
Zejian Yuan
30
23
0
10 Jul 2018
Natural Language Generation for Electronic Health Records
Natural Language Generation for Electronic Health Records
Scott H. Lee
SyDa
16
81
0
01 Jun 2018
Fast, Diverse and Accurate Image Captioning Guided By Part-of-Speech
Fast, Diverse and Accurate Image Captioning Guided By Part-of-Speech
Aditya Deshpande
J. Aneja
Liwei Wang
A. Schwing
David A. Forsyth
27
146
0
31 May 2018
SemStyle: Learning to Generate Stylised Image Captions using Unaligned
  Text
SemStyle: Learning to Generate Stylised Image Captions using Unaligned Text
A. Mathews
Lexing Xie
Xuming He
VLM
24
115
0
18 May 2018
Object Counts! Bringing Explicit Detections Back into Image Captioning
Object Counts! Bringing Explicit Detections Back into Image Captioning
Josiah Wang
Pranava Madhyastha
Lucia Specia
ObjD
19
37
0
23 Apr 2018
Learning to Guide Decoding for Image Captioning
Learning to Guide Decoding for Image Captioning
Wenhao Jiang
Lin Ma
Xinpeng Chen
Hanwang Zhang
Wen Liu
16
69
0
03 Apr 2018
Reconstruction Network for Video Captioning
Reconstruction Network for Video Captioning
Bairui Wang
Lin Ma
Wei Zhang
Wen Liu
38
317
0
30 Mar 2018
Toolflows for Mapping Convolutional Neural Networks on FPGAs: A Survey
  and Future Directions
Toolflows for Mapping Convolutional Neural Networks on FPGAs: A Survey and Future Directions
Stylianos I. Venieris
Alexandros Kouris
C. Bouganis
19
184
0
15 Mar 2018
HoME: a Household Multimodal Environment
HoME: a Household Multimodal Environment
Simon Brodeur
Ethan Perez
Ankesh Anand
Florian Golemo
Luca Herranz-Celotti
Florian Strub
Jean Rouat
Hugo Larochelle
Aaron Courville
LM&Ro
44
103
0
29 Nov 2017
Convolutional Image Captioning
Convolutional Image Captioning
J. Aneja
Aditya Deshpande
A. Schwing
VLM
37
359
0
24 Nov 2017
Diverse and Accurate Image Description Using a Variational Auto-Encoder
  with an Additive Gaussian Encoding Space
Diverse and Accurate Image Description Using a Variational Auto-Encoder with an Additive Gaussian Encoding Space
Liwei Wang
A. Schwing
Svetlana Lazebnik
CoGe
37
175
0
19 Nov 2017
Dual-Path Convolutional Image-Text Embeddings with Instance Loss
Dual-Path Convolutional Image-Text Embeddings with Instance Loss
Zhedong Zheng
Liang Zheng
Michael Garrett
Yi Yang
Mingliang Xu
Yi-Dong Shen
27
470
0
15 Nov 2017
Semantic Image Retrieval via Active Grounding of Visual Situations
Semantic Image Retrieval via Active Grounding of Visual Situations
Max H. Quinn
E. Conser
Jordan M. Witte
Melanie Mitchell
13
9
0
31 Oct 2017
Recognizing and Curating Photo Albums via Event-Specific Image
  Importance
Recognizing and Curating Photo Albums via Event-Specific Image Importance
Yufei Wang
Zhe-nan Lin
Xiaohui Shen
R. Měch
G. Miller
G. Cottrell
30
13
0
19 Jul 2017
I2T2I: Learning Text to Image Synthesis with Textual Data Augmentation
I2T2I: Learning Text to Image Synthesis with Textual Data Augmentation
Hao Dong
Jingqing Zhang
Douglas McIlwraith
Yike Guo
35
58
0
20 Mar 2017
Recurrent Models for Situation Recognition
Recurrent Models for Situation Recognition
Arun Mallya
Svetlana Lazebnik
20
30
0
18 Mar 2017
Evolving Deep Neural Networks
Evolving Deep Neural Networks
Risto Miikkulainen
J. Liang
Elliot Meyerson
Aditya Rawal
Daniel Fink
...
B. Raju
H. Shahrzad
Arshak Navruzyan
Nigel P. Duffy
B. Hodjat
16
884
0
01 Mar 2017
Visual Translation Embedding Network for Visual Relation Detection
Visual Translation Embedding Network for Visual Relation Detection
Hanwang Zhang
Zawlin Kyaw
Shih-Fu Chang
Tat-Seng Chua
ViT
154
560
0
27 Feb 2017
Learning Visual N-Grams from Web Data
Learning Visual N-Grams from Web Data
Ang Li
Allan Jabri
Armand Joulin
L. V. D. van der Maaten
VLM
20
136
0
29 Dec 2016
An Empirical Study of Language CNN for Image Captioning
An Empirical Study of Language CNN for Image Captioning
Jiuxiang Gu
G. Wang
Jianfei Cai
Tsuhan Chen
31
132
0
21 Dec 2016
Self-critical Sequence Training for Image Captioning
Self-critical Sequence Training for Image Captioning
Steven J. Rennie
E. Marcheret
Youssef Mroueh
Jerret Ross
Vaibhava Goel
11
1,877
0
02 Dec 2016
Semantic Regularisation for Recurrent Image Annotation
Semantic Regularisation for Recurrent Image Annotation
Feng Liu
Tao Xiang
Timothy M. Hospedales
Wankou Yang
Changyin Sun
31
103
0
16 Nov 2016
Previous
12