ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1411.4555
  4. Cited By
Show and Tell: A Neural Image Caption Generator

Show and Tell: A Neural Image Caption Generator

17 November 2014
Oriol Vinyals
Alexander Toshev
Samy Bengio
D. Erhan
    3DV
ArXivPDFHTML

Papers citing "Show and Tell: A Neural Image Caption Generator"

50 / 2,023 papers shown
Title
O2NA: An Object-Oriented Non-Autoregressive Approach for Controllable
  Video Captioning
O2NA: An Object-Oriented Non-Autoregressive Approach for Controllable Video Captioning
Fenglin Liu
Xuancheng Ren
Xian Wu
Bang-ju Yang
Shen Ge
Yuexian Zou
Xu Sun
32
32
0
05 Aug 2021
Ordered Attention for Coherent Visual Storytelling
Ordered Attention for Coherent Visual Storytelling
Tom Braude
Idan Schwartz
Alex Schwing
Ariel Shamir
33
9
0
04 Aug 2021
Question-controlled Text-aware Image Captioning
Question-controlled Text-aware Image Captioning
Anwen Hu
Shizhe Chen
Qin Jin
19
15
0
04 Aug 2021
ICECAP: Information Concentrated Entity-aware Image Captioning
ICECAP: Information Concentrated Entity-aware Image Captioning
Anwen Hu
Shizhe Chen
Qin Jin
28
20
0
04 Aug 2021
Distributed Attention for Grounded Image Captioning
Distributed Attention for Grounded Image Captioning
Nenglun Chen
Xingjia Pan
Runnan Chen
Lei Yang
Zhiwen Lin
Yuqiang Ren
Haolei Yuan
Xiaowei Guo
Feiyue Huang
Wenping Wang
27
21
0
02 Aug 2021
Learning TFIDF Enhanced Joint Embedding for Recipe-Image Cross-Modal
  Retrieval Service
Learning TFIDF Enhanced Joint Embedding for Recipe-Image Cross-Modal Retrieval Service
Zhongwei Xie
Ling Liu
Yanzhao Wu
Lin Li
Luo Zhong
28
22
0
02 Aug 2021
Advances in adversarial attacks and defenses in computer vision: A
  survey
Advances in adversarial attacks and defenses in computer vision: A survey
Naveed Akhtar
Ajmal Mian
Navid Kardan
M. Shah
AAML
41
236
0
01 Aug 2021
UAV Trajectory Planning in Wireless Sensor Networks for Energy
  Consumption Minimization by Deep Reinforcement Learning
UAV Trajectory Planning in Wireless Sensor Networks for Energy Consumption Minimization by Deep Reinforcement Learning
Botao Zhu
E. Bedeer
Ha H. Nguyen
Robert Barton
Jérôme Henry
23
119
0
01 Aug 2021
Chest ImaGenome Dataset for Clinical Reasoning
Chest ImaGenome Dataset for Clinical Reasoning
Joy T. Wu
Nkechinyere N. Agu
Ismini Lourentzou
Arjun Sharma
J. Paguio
...
William Mitchell
Satyananda Kashyap
Andrea Giovannini
Leo Anthony Celi
Mehdi Moradi
21
65
0
31 Jul 2021
Experimenting with Self-Supervision using Rotation Prediction for Image
  Captioning
Experimenting with Self-Supervision using Rotation Prediction for Image Captioning
Ahmed Elhagry
Karima Kadaoui
SSL
16
0
0
28 Jul 2021
Language Models as Zero-shot Visual Semantic Learners
Language Models as Zero-shot Visual Semantic Learners
Yue Jiao
Jonathon S. Hare
Adam Prugel-Bennett
VLM
27
0
0
26 Jul 2021
Boosting Entity-aware Image Captioning with Multi-modal Knowledge Graph
Boosting Entity-aware Image Captioning with Multi-modal Knowledge Graph
Wentian Zhao
Yao Hu
Heda Wang
Xinxiao Wu
Jiebo Luo
23
47
0
26 Jul 2021
Transcript to Video: Efficient Clip Sequencing from Texts
Transcript to Video: Efficient Clip Sequencing from Texts
Yu Xiong
Fabian Caba Heilbron
Dahua Lin
CLIP
33
10
0
25 Jul 2021
Boosting Video Captioning with Dynamic Loss Network
Boosting Video Captioning with Dynamic Loss Network
Nasib Ullah
Partha Pratim Mohanta
30
1
0
25 Jul 2021
Explainable artificial intelligence (XAI) in deep learning-based medical
  image analysis
Explainable artificial intelligence (XAI) in deep learning-based medical image analysis
Bas H. M. van der Velden
Hugo J. Kuijf
K. Gilhuijs
M. Viergever
XAI
45
640
0
22 Jul 2021
GI-NNet \& RGI-NNet: Development of Robotic Grasp Pose Models, Trainable
  with Large as well as Limited Labelled Training Datasets, under supervised
  and semi supervised paradigms
GI-NNet \& RGI-NNet: Development of Robotic Grasp Pose Models, Trainable with Large as well as Limited Labelled Training Datasets, under supervised and semi supervised paradigms
Priya Shukla
Nilotpal Pramanik
Deepesh Mehta
G. C. Nandi
27
1
0
15 Jul 2021
Variational Topic Inference for Chest X-Ray Report Generation
Variational Topic Inference for Chest X-Ray Report Generation
Ivona Najdenkoska
Xiantong Zhen
M. Worring
Ling Shao
MedIm
48
28
0
15 Jul 2021
Surgical Instruction Generation with Transformers
Surgical Instruction Generation with Transformers
Jinglu Zhang
Y. Nie
Jian Chang
Jiangning Zhang
MedIm
27
13
0
14 Jul 2021
From Show to Tell: A Survey on Deep Learning-based Image Captioning
From Show to Tell: A Survey on Deep Learning-based Image Captioning
Matteo Stefanini
Marcella Cornia
Lorenzo Baraldi
S. Cascianelli
G. Fiameni
Rita Cucchiara
3DV
VLM
MLLM
69
256
0
14 Jul 2021
Controlled Caption Generation for Images Through Adversarial Attacks
Controlled Caption Generation for Images Through Adversarial Attacks
Nayyer Aafaq
Naveed Akhtar
Wei Liu
M. Shah
Ajmal Mian
AAML
41
9
0
07 Jul 2021
RATCHET: Medical Transformer for Chest X-ray Diagnosis and Reporting
RATCHET: Medical Transformer for Chest X-ray Diagnosis and Reporting
Benjamin Hou
Georgios Kaissis
Ronald M. Summers
Bernhard Kainz
ViT
LM&MA
MedIm
33
50
0
05 Jul 2021
Cognitive Visual Commonsense Reasoning Using Dynamic Working Memory
Cognitive Visual Commonsense Reasoning Using Dynamic Working Memory
Xuejiao Tang
Xin Huang
Wenbin Zhang
T. Child
Qiong Hu
Zhen Liu
Ji Zhang
LRM
22
18
0
04 Jul 2021
Productivity, Portability, Performance: Data-Centric Python
Productivity, Portability, Performance: Data-Centric Python
Yiheng Wang
Yao Zhang
Yanzhang Wang
Yan Wan
Jiao Wang
Zhongyuan Wu
Yuhao Yang
Bowen She
59
95
0
01 Jul 2021
Egocentric Image Captioning for Privacy-Preserved Passive Dietary Intake
  Monitoring
Egocentric Image Captioning for Privacy-Preserved Passive Dietary Intake Monitoring
Jianing Qiu
Frank P.-W. Lo
Xiao Gu
M. Jobarteh
Wenyan Jia
...
M. McCrory
Edward Sazonov
Mingui Sun
Gary Frost
Benny Lo
EgoV
38
18
0
01 Jul 2021
Contrastive Semantic Similarity Learning for Image Captioning Evaluation
  with Intrinsic Auto-encoder
Contrastive Semantic Similarity Learning for Image Captioning Evaluation with Intrinsic Auto-encoder
Chao Zeng
Tiesong Zhao
Sam Kwong
30
2
0
29 Jun 2021
Building a Video-and-Language Dataset with Human Actions for Multimodal
  Logical Inference
Building a Video-and-Language Dataset with Human Actions for Multimodal Logical Inference
Riko Suzuki
Hitomi Yanaka
K. Mineshima
D. Bekki
VGen
MLLM
21
1
0
27 Jun 2021
UMIC: An Unreferenced Metric for Image Captioning via Contrastive
  Learning
UMIC: An Unreferenced Metric for Image Captioning via Contrastive Learning
Hwanhee Lee
Seunghyun Yoon
Franck Dernoncourt
Trung Bui
Kyomin Jung
VLM
21
44
0
26 Jun 2021
Neural Fashion Image Captioning : Accounting for Data Diversity
Neural Fashion Image Captioning : Accounting for Data Diversity
Gilles Hacheme
Nouréini Sayouti
17
12
0
23 Jun 2021
TCIC: Theme Concepts Learning Cross Language and Vision for Image
  Captioning
TCIC: Theme Concepts Learning Cross Language and Vision for Image Captioning
Zhihao Fan
Zhongyu Wei
Siyuan Wang
Ruize Wang
Zejun Li
Haijun Shan
Xuanjing Huang
32
26
0
21 Jun 2021
Trust It or Not: Confidence-Guided Automatic Radiology Report Generation
Trust It or Not: Confidence-Guided Automatic Radiology Report Generation
Yixin Wang
Zihao Lin
Zhe Xu
Haoyu Dong
Jiang Tian
Jie Luo
Zhongchao Shi
Yang Zhang
Jianping Fan
Zhiqiang He
UQCV
MedIm
43
12
0
21 Jun 2021
Do Encoder Representations of Generative Dialogue Models Encode
  Sufficient Information about the Task ?
Do Encoder Representations of Generative Dialogue Models Encode Sufficient Information about the Task ?
Prasanna Parthasarathi
J. Pineau
Sarath Chandar
28
2
0
20 Jun 2021
A Brief Study on the Effects of Training Generative Dialogue Models with
  a Semantic loss
A Brief Study on the Effects of Training Generative Dialogue Models with a Semantic loss
Prasanna Parthasarathi
Mohamed Abdelsalam
J. Pineau
Sarath Chandar
22
0
0
20 Jun 2021
GEM: A General Evaluation Benchmark for Multimodal Tasks
GEM: A General Evaluation Benchmark for Multimodal Tasks
Lin Su
Nan Duan
Edward Cui
Lei Ji
Chenfei Wu
Huaishao Luo
Yongfei Liu
Ming Zhong
Taroon Bharti
Arun Sacheti
VLM
19
19
0
18 Jun 2021
Semi-Autoregressive Transformer for Image Captioning
Semi-Autoregressive Transformer for Image Captioning
Yuanen Zhou
Yong Zhang
Zhenzhen Hu
Meng Wang
VLM
34
24
0
17 Jun 2021
Evolving Image Compositions for Feature Representation Learning
Evolving Image Compositions for Feature Representation Learning
Paola Cascante-Bonilla
Arshdeep Sekhon
Yanjun Qi
Vicente Ordonez
SSL
24
7
0
16 Jun 2021
Redefining Neural Architecture Search of Heterogeneous Multi-Network
  Models by Characterizing Variation Operators and Model Components
Redefining Neural Architecture Search of Heterogeneous Multi-Network Models by Characterizing Variation Operators and Model Components
Unai Garciarena
Roberto Santana
A. Mendiburu
25
2
0
16 Jun 2021
Understanding and Evaluating Racial Biases in Image Captioning
Understanding and Evaluating Racial Biases in Image Captioning
Dora Zhao
Angelina Wang
Olga Russakovsky
30
134
0
16 Jun 2021
Pre-Trained Models: Past, Present and Future
Pre-Trained Models: Past, Present and Future
Xu Han
Zhengyan Zhang
Ning Ding
Yuxian Gu
Xiao Liu
...
Jie Tang
Ji-Rong Wen
Jinhui Yuan
Wayne Xin Zhao
Jun Zhu
AIFin
MQ
AI4MH
60
818
0
14 Jun 2021
Contrastive Attention for Automatic Chest X-ray Report Generation
Contrastive Attention for Automatic Chest X-ray Report Generation
Fenglin Liu
Changchang Yin
Xian Wu
Shen Ge
Yuexian Zou
Ping Zhang
Yuexian Zou
Xu Sun
MedIm
16
146
0
13 Jun 2021
Exploring and Distilling Posterior and Prior Knowledge for Radiology
  Report Generation
Exploring and Distilling Posterior and Prior Knowledge for Radiology Report Generation
Fenglin Liu
Xian Wu
Shen Ge
Wei Fan
Yuexian Zou
MedIm
37
249
0
13 Jun 2021
Conversational Fashion Image Retrieval via Multiturn Natural Language
  Feedback
Conversational Fashion Image Retrieval via Multiturn Natural Language Feedback
Yifei Yuan
W. Lam
21
43
0
08 Jun 2021
An End-to-End Breast Tumour Classification Model Using Context-Based
  Patch Modelling- A BiLSTM Approach for Image Classification
An End-to-End Breast Tumour Classification Model Using Context-Based Patch Modelling- A BiLSTM Approach for Image Classification
S. Tripathi
S. Singh
H. Lee
21
43
0
05 Jun 2021
Attention mechanisms and deep learning for machine vision: A survey of
  the state of the art
Attention mechanisms and deep learning for machine vision: A survey of the state of the art
A. M. Hafiz
S. A. Parah
R. A. Bhat
26
45
0
03 Jun 2021
SMURF: SeMantic and linguistic UndeRstanding Fusion for Caption
  Evaluation via Typicality Analysis
SMURF: SeMantic and linguistic UndeRstanding Fusion for Caption Evaluation via Typicality Analysis
Joshua Forster Feinglass
Yezhou Yang
24
21
0
02 Jun 2021
multiPRover: Generating Multiple Proofs for Improved Interpretability in
  Rule Reasoning
multiPRover: Generating Multiple Proofs for Improved Interpretability in Rule Reasoning
Swarnadeep Saha
Prateek Yadav
Joey Tianyi Zhou
ReLM
LRM
24
26
0
02 Jun 2021
Fine-grained Generalization Analysis of Structured Output Prediction
Fine-grained Generalization Analysis of Structured Output Prediction
Waleed Mustafa
Yunwen Lei
Antoine Ledent
Marius Kloft
13
9
0
31 May 2021
Longer Version for "Deep Context-Encoding Network for Retinal Image
  Captioning"
Longer Version for "Deep Context-Encoding Network for Retinal Image Captioning"
Jia-Hong Huang
Ting-Wei Wu
Chao-Han Huck Yang
M. Worring
MedIm
22
28
0
30 May 2021
Towards Diverse Paragraph Captioning for Untrimmed Videos
Towards Diverse Paragraph Captioning for Untrimmed Videos
Yuqing Song
Shizhe Chen
Qin Jin
21
37
0
30 May 2021
Maria: A Visual Experience Powered Conversational Agent
Maria: A Visual Experience Powered Conversational Agent
Zujie Liang
Huang Hu
Can Xu
Chongyang Tao
Xiubo Geng
Yining Chen
Fan Liang
Daxin Jiang
27
29
0
27 May 2021
ST-HOI: A Spatial-Temporal Baseline for Human-Object Interaction
  Detection in Videos
ST-HOI: A Spatial-Temporal Baseline for Human-Object Interaction Detection in Videos
Meng-Jiun Chiou
Chun-Yu Liao
Li-Wei Wang
Roger Zimmermann
Jiashi Feng
43
24
0
25 May 2021
Previous
123...121314...394041
Next