ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1411.4555
  4. Cited By
Show and Tell: A Neural Image Caption Generator

Show and Tell: A Neural Image Caption Generator

17 November 2014
Oriol Vinyals
Alexander Toshev
Samy Bengio
D. Erhan
    3DV
ArXivPDFHTML

Papers citing "Show and Tell: A Neural Image Caption Generator"

50 / 2,023 papers shown
Title
VirTex: Learning Visual Representations from Textual Annotations
VirTex: Learning Visual Representations from Textual Annotations
Karan Desai
Justin Johnson
SSL
VLM
30
433
0
11 Jun 2020
RTEX: A novel methodology for Ranking, Tagging, and Explanatory
  diagnostic captioning of radiography exams
RTEX: A novel methodology for Ranking, Tagging, and Explanatory diagnostic captioning of radiography exams
Vasiliki Kougia
John Pavlopoulos
P. Papapetrou
Max Gordon
32
0
0
11 Jun 2020
Toward Building Safer Smart Homes for the People with Disabilities
Toward Building Safer Smart Homes for the People with Disabilities
Shahinur Alam
M. Mahmud
M. Yeasin
16
4
0
10 Jun 2020
Auxiliary Signal-Guided Knowledge Encoder-Decoder for Medical Report
  Generation
Auxiliary Signal-Guided Knowledge Encoder-Decoder for Medical Report Generation
Mingjie Li
Fuyu Wang
Xiaojun Chang
Xiaodan Liang
MedIm
34
101
0
06 Jun 2020
Pick-Object-Attack: Type-Specific Adversarial Attack for Object
  Detection
Pick-Object-Attack: Type-Specific Adversarial Attack for Object Detection
Omid Mohamad Nezami
Akshay Chaturvedi
Mark Dras
Utpal Garain
AAML
ObjD
26
19
0
05 Jun 2020
An embedded system for the automated generation of labeled plant images
  to enable machine learning applications in agriculture
An embedded system for the automated generation of labeled plant images to enable machine learning applications in agriculture
Michael A. Beck
Chen-Yi Liu
C. Bidinosti
C. Henry
Cara M. Godee
Manisha Ajmani
VLM
19
21
0
01 Jun 2020
JPD-SE: High-Level Semantics for Joint Perception-Distortion Enhancement
  in Image Compression
JPD-SE: High-Level Semantics for Joint Perception-Distortion Enhancement in Image Compression
Shiyu Duan
Huaijin Chen
Liang Feng
32
5
0
24 May 2020
PruneNet: Channel Pruning via Global Importance
PruneNet: Channel Pruning via Global Importance
A. Khetan
Zohar Karnin
26
11
0
22 May 2020
Rethinking and Improving Natural Language Generation with Layer-Wise
  Multi-View Decoding
Rethinking and Improving Natural Language Generation with Layer-Wise Multi-View Decoding
Fenglin Liu
Xuancheng Ren
Guangxiang Zhao
Chenyu You
Xuewei Ma
Xian Wu
Xu Sun
45
2
0
16 May 2020
Flight Time Prediction for Fuel Loading Decisions with a Deep Learning
  Approach
Flight Time Prediction for Fuel Loading Decisions with a Deep Learning Approach
Xinting Zhu
Lishuai Li
11
32
0
12 May 2020
Towards QoS-Aware and Resource-Efficient GPU Microservices Based on
  Spatial Multitasking GPUs In Datacenters
Towards QoS-Aware and Resource-Efficient GPU Microservices Based on Spatial Multitasking GPUs In Datacenters
Wei Zhang
Quan Chen
Kaihua Fu
Ningxin Zheng
Zhiyi Huang
Jingwen Leng
Chao Li
Wenli Zheng
Minyi Guo
27
3
0
05 May 2020
Global Table Extractor (GTE): A Framework for Joint Table Identification
  and Cell Structure Recognition Using Visual Context
Global Table Extractor (GTE): A Framework for Joint Table Identification and Cell Structure Recognition Using Visual Context
Xinyi Zheng
Doug Burdick
Lucian Popa
Xu Zhong
N. Wang
LMTD
35
142
0
01 May 2020
Computing the Testing Error without a Testing Set
Computing the Testing Error without a Testing Set
C. Corneanu
Meysam Madadi
Sergio Escalera
Aleix M. Martinez
AAML
10
69
0
01 May 2020
Towards Embodied Scene Description
Towards Embodied Scene Description
Sinan Tan
Huaping Liu
Di Guo
Xinyu Zhang
F. Sun
LM&Ro
10
9
0
30 Apr 2020
memeBot: Towards Automatic Image Meme Generation
memeBot: Towards Automatic Image Meme Generation
Aadhavan Sadasivam
K. Gunasekar
H. Davulcu
Yezhou Yang
14
9
0
30 Apr 2020
Explainable Deep Learning: A Field Guide for the Uninitiated
Explainable Deep Learning: A Field Guide for the Uninitiated
Gabrielle Ras
Ning Xie
Marcel van Gerven
Derek Doran
AAML
XAI
55
371
0
30 Apr 2020
Pragmatic Issue-Sensitive Image Captioning
Pragmatic Issue-Sensitive Image Captioning
Allen Nie
Reuben Cohn-Gordon
Christopher Potts
20
24
0
29 Apr 2020
Image Captioning through Image Transformer
Image Captioning through Image Transformer
Sen He
Wentong Liao
Hamed R. Tavakoli
M. Yang
Bodo Rosenhahn
N. Pugeault
ViT
41
91
0
29 Apr 2020
Cross-modal Speaker Verification and Recognition: A Multilingual
  Perspective
Cross-modal Speaker Verification and Recognition: A Multilingual Perspective
M. S. Saeed
Shah Nawaz
Pietro Morerio
Arif Mahmood
I. Gallo
Muhammad Haroon Yousaf
Alessio Del Bue
CVBM
28
26
0
28 Apr 2020
Show, Describe and Conclude: On Exploiting the Structure Information of
  Chest X-Ray Reports
Show, Describe and Conclude: On Exploiting the Structure Information of Chest X-Ray Reports
Baoyu Jing
Zeya Wang
Eric Xing
22
139
0
26 Apr 2020
Detective: An Attentive Recurrent Model for Sparse Object Detection
Detective: An Attentive Recurrent Model for Sparse Object Detection
A. Kechaou
Manuel Martínez
Monica Haurilet
Rainer Stiefelhagen
ObjD
12
3
0
25 Apr 2020
VisualCOMET: Reasoning about the Dynamic Context of a Still Image
VisualCOMET: Reasoning about the Dynamic Context of a Still Image
J. S. Park
Chandra Bhagavatula
Roozbeh Mottaghi
Ali Farhadi
Yejin Choi
ReLM
LRM
27
6
0
22 Apr 2020
Textual Visual Semantic Dataset for Text Spotting
Textual Visual Semantic Dataset for Text Spotting
Ahmed Sabir
Francesc Moreno-Noguer
Lluís Padró
24
3
0
21 Apr 2020
ParaCNN: Visual Paragraph Generation via Adversarial Twin Contextual
  CNNs
ParaCNN: Visual Paragraph Generation via Adversarial Twin Contextual CNNs
Shiyang Yan
Yang Hua
N. Robertson
19
7
0
21 Apr 2020
Transform and Tell: Entity-Aware News Image Captioning
Transform and Tell: Entity-Aware News Image Captioning
Alasdair Tran
A. Mathews
Lexing Xie
VLM
28
95
0
17 Apr 2020
Context-Aware Group Captioning via Self-Attention and Contrastive
  Features
Context-Aware Group Captioning via Self-Attention and Contrastive Features
Zhuowan Li
Quan Hung Tran
Long Mai
Zhe Lin
Alan Yuille
VLM
14
44
0
07 Apr 2020
Character-level Japanese Text Generation with Attention Mechanism for
  Chest Radiography Diagnosis
Character-level Japanese Text Generation with Attention Mechanism for Chest Radiography Diagnosis
Kenya Sakka
Kotaro Nakayama
Nisei Kimura
Taiki Inoue
Yusuke Iwasawa
Ryohei Yamaguchi
Yosimasa Kawazoe
K. Ohe
Y. Matsuo
14
2
0
06 Apr 2020
B-SCST: Bayesian Self-Critical Sequence Training for Image Captioning
B-SCST: Bayesian Self-Critical Sequence Training for Image Captioning
Shashank Bujimalla
Mahesh Subedar
Omesh Tickoo
BDL
UQCV
25
10
0
06 Apr 2020
Adding A Filter Based on The Discriminator to Improve Unconditional Text
  Generation
Adding A Filter Based on The Discriminator to Improve Unconditional Text Generation
Xingyuan Chen
Ping Cai
Peng Jin
Hongjun Wang
Xingyu Dai
Jiajun Chen
26
2
0
05 Apr 2020
Open Domain Dialogue Generation with Latent Images
Open Domain Dialogue Generation with Latent Images
Ze Yang
Wei Wu
Huang Hu
Can Xu
Wei Wang
Zhoujun Li
30
29
0
04 Apr 2020
PaStaNet: Toward Human Activity Knowledge Engine
PaStaNet: Toward Human Activity Knowledge Engine
Yong-Lu Li
Liang Xu
Xinpeng Liu
Xijie Huang
Yue Xu
Shiyi Wang
Haoshu Fang
Ze Ma
Mingyang Chen
Cewu Lu
28
151
0
02 Apr 2020
Pixel-BERT: Aligning Image Pixels with Text by Deep Multi-Modal
  Transformers
Pixel-BERT: Aligning Image Pixels with Text by Deep Multi-Modal Transformers
Zhicheng Huang
Zhaoyang Zeng
Bei Liu
Dongmei Fu
Jianlong Fu
ViT
50
436
0
02 Apr 2020
Consistent Multiple Sequence Decoding
Consistent Multiple Sequence Decoding
Bicheng Xu
Leonid Sigal
34
0
0
02 Apr 2020
More Grounded Image Captioning by Distilling Image-Text Matching Model
More Grounded Image Captioning by Distilling Image-Text Matching Model
Yuanen Zhou
Meng Wang
Daqing Liu
Zhenzhen Hu
Hanwang Zhang
25
125
0
01 Apr 2020
X-Linear Attention Networks for Image Captioning
X-Linear Attention Networks for Image Captioning
Yingwei Pan
Ting Yao
Yehao Li
Tao Mei
39
510
0
31 Mar 2020
Detection and Description of Change in Visual Streams
Detection and Description of Change in Visual Streams
Davis Gilton
Ruotian Luo
Rebecca Willett
Gregory Shakhnarovich
AI4TS
18
4
0
27 Mar 2020
Grounded Situation Recognition
Grounded Situation Recognition
Sarah M Pratt
Mark Yatskar
Luca Weihs
Ali Farhadi
Aniruddha Kembhavi
30
112
0
26 Mar 2020
Egoshots, an ego-vision life-logging dataset and semantic fidelity
  metric to evaluate diversity in image captioning models
Egoshots, an ego-vision life-logging dataset and semantic fidelity metric to evaluate diversity in image captioning models
Pranav Agarwal
Alejandro Betancourt
V. Panagiotou
Natalia Díaz Rodríguez
EGVM
14
10
0
26 Mar 2020
Learning Compact Reward for Image Captioning
Learning Compact Reward for Image Captioning
Nannan Li
Zhenzhong Chen
23
3
0
24 Mar 2020
Normalized and Geometry-Aware Self-Attention Network for Image
  Captioning
Normalized and Geometry-Aware Self-Attention Network for Image Captioning
Longteng Guo
Jing Liu
Xinxin Zhu
Peng Yao
Shichen Lu
Hanqing Lu
ViT
135
189
0
19 Mar 2020
Fast Distance-based Anomaly Detection in Images Using an Inception-like
  Autoencoder
Fast Distance-based Anomaly Detection in Images Using an Inception-like Autoencoder
Natasa Sarafijanovic-Djukic
Jesse Davis
30
24
0
12 Mar 2020
"An Image is Worth a Thousand Features": Scalable Product
  Representations for In-Session Type-Ahead Personalization
"An Image is Worth a Thousand Features": Scalable Product Representations for In-Session Type-Ahead Personalization
Bingqing Yu
Jacopo Tagliabue
C. Greco
Federico Bianchi
66
10
0
11 Mar 2020
Visual Grounding in Video for Unsupervised Word Translation
Visual Grounding in Video for Unsupervised Word Translation
Gunnar Sigurdsson
Jean-Baptiste Alayrac
Aida Nematzadeh
Lucas Smaira
Mateusz Malinowski
João Carreira
Phil Blunsom
Andrew Zisserman
VGen
29
49
0
11 Mar 2020
Deconfounded Image Captioning: A Causal Retrospect
Deconfounded Image Captioning: A Causal Retrospect
Xu Yang
Hanwang Zhang
Jianfei Cai
CML
18
119
0
09 Mar 2020
Better Captioning with Sequence-Level Exploration
Better Captioning with Sequence-Level Exploration
Jia Chen
Qin Jin
37
12
0
08 Mar 2020
Investigating the Decoders of Maximum Likelihood Sequence Models: A
  Look-ahead Approach
Investigating the Decoders of Maximum Likelihood Sequence Models: A Look-ahead Approach
Yu-Siang Wang
Yen-Ling Kuo
Boris Katz
31
3
0
08 Mar 2020
Noise Estimation Using Density Estimation for Self-Supervised Multimodal
  Learning
Noise Estimation Using Density Estimation for Self-Supervised Multimodal Learning
Elad Amrani
Rami Ben-Ari
Daniel Rotman
A. Bronstein
27
121
0
06 Mar 2020
Show, Edit and Tell: A Framework for Editing Image Captions
Show, Edit and Tell: A Framework for Editing Image Captions
Fawaz Sammani
Luke Melas-Kyriazi
KELM
DiffM
48
59
0
06 Mar 2020
Say As You Wish: Fine-grained Control of Image Caption Generation with
  Abstract Scene Graphs
Say As You Wish: Fine-grained Control of Image Caption Generation with Abstract Scene Graphs
Shizhe Chen
Qin Jin
Peng Wang
Qi Wu
DiffM
39
215
0
01 Mar 2020
Unblind Your Apps: Predicting Natural-Language Labels for Mobile GUI
  Components by Deep Learning
Unblind Your Apps: Predicting Natural-Language Labels for Mobile GUI Components by Deep Learning
Jieshan Chen
Chunyang Chen
Zhenchang Xing
Xiwei Xu
Liming Zhu
Guoqiang Li
Jinshui Wang
19
139
0
01 Mar 2020
Previous
123...171819...394041
Next