ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1412.6632
  4. Cited By
Deep Captioning with Multimodal Recurrent Neural Networks (m-RNN)

Deep Captioning with Multimodal Recurrent Neural Networks (m-RNN)

20 December 2014
Junhua Mao
Wenyuan Xu
Yi Yang
Jiang Wang
Zhiheng Huang
Alan Yuille
    VLM
ArXivPDFHTML

Papers citing "Deep Captioning with Multimodal Recurrent Neural Networks (m-RNN)"

50 / 417 papers shown
Title
Aesthetic Image Captioning From Weakly-Labelled Photographs
Aesthetic Image Captioning From Weakly-Labelled Photographs
Koustav Ghosal
A. Rana
A. Smolic
27
25
0
29 Aug 2019
Adversarial Representation Learning for Text-to-Image Matching
Adversarial Representation Learning for Text-to-Image Matching
N. Sarafianos
Xiang Xu
I. Kakadiaris
GAN
46
186
0
28 Aug 2019
Sequential Latent Spaces for Modeling the Intention During Diverse Image
  Captioning
Sequential Latent Spaces for Modeling the Intention During Diverse Image Captioning
J. Aneja
Harsh Agrawal
Dhruv Batra
A. Schwing
BDL
VLM
23
66
0
22 Aug 2019
Dynamic Stale Synchronous Parallel Distributed Training for Deep
  Learning
Dynamic Stale Synchronous Parallel Distributed Training for Deep Learning
Xing Zhao
Aijun An
Junfeng Liu
B. Chen
26
57
0
16 Aug 2019
Semi Supervised Phrase Localization in a Bidirectional Caption-Image
  Retrieval Framework
Semi Supervised Phrase Localization in a Bidirectional Caption-Image Retrieval Framework
Deepan Das
Noor Mohammed Ghouse
Shashank Verma
Yin Li
11
0
0
08 Aug 2019
Image Captioning using Facial Expression and Attention
Image Captioning using Facial Expression and Attention
Omid Mohamad Nezami
Mark Dras
Stephen Wan
Cécile Paris
CVBM
17
8
0
08 Aug 2019
Scene-based Factored Attention for Image Captioning
Scene-based Factored Attention for Image Captioning
Chen Shen
Rongrong Ji
Fuhai Chen
Xiaoshuai Sun
Xiangming Li
24
0
0
07 Aug 2019
Cascaded Revision Network for Novel Object Captioning
Cascaded Revision Network for Novel Object Captioning
Qianyu Feng
Yu Wu
Hehe Fan
C. Yan
Yezhou Yang
29
35
0
06 Aug 2019
Image Captioning with Unseen Objects
Image Captioning with Unseen Objects
B. Demirel
R. G. Cinbis
Nazli Ikizler-Cinbis
VLM
24
16
0
31 Jul 2019
An Empirical Study on Leveraging Scene Graphs for Visual Question
  Answering
An Empirical Study on Leveraging Scene Graphs for Visual Question Answering
Cheng Zhang
Wei-Lun Chao
D. Xuan
23
50
0
28 Jul 2019
Learning Visual Actions Using Multiple Verb-Only Labels
Learning Visual Actions Using Multiple Verb-Only Labels
Michael Wray
Dima Damen
28
7
0
25 Jul 2019
Trends in Integration of Vision and Language Research: A Survey of
  Tasks, Datasets, and Methods
Trends in Integration of Vision and Language Research: A Survey of Tasks, Datasets, and Methods
Aditya Mogadala
M. Kalimuthu
Dietrich Klakow
VLM
25
132
0
22 Jul 2019
Image Captioning: Transforming Objects into Words
Image Captioning: Transforming Objects into Words
Simão Herdade
Armin Kappeler
K. Boakye
Joao Soares
ViT
45
462
0
14 Jun 2019
Figure Captioning with Reasoning and Sequence-Level Training
Figure Captioning with Reasoning and Sequence-Level Training
Charles C. Chen
Ruiyi Zhang
Eunyee Koh
Sungchul Kim
Scott D. Cohen
Tong Yu
Ryan Rossi
Razvan Bunescu
AIMat
31
38
0
07 Jun 2019
Context-Aware Visual Policy Network for Fine-Grained Image Captioning
Context-Aware Visual Policy Network for Fine-Grained Image Captioning
Zhengjun Zha
Daqing Liu
Hanwang Zhang
Yongdong Zhang
Feng Wu
25
120
0
06 Jun 2019
Vision-to-Language Tasks Based on Attributes and Attention Mechanism
Vision-to-Language Tasks Based on Attributes and Attention Mechanism
Xuelong Li
Aihong Yuan
Xiaoqiang Lu
21
37
0
29 May 2019
Multimodal Transformer with Multi-View Visual Representation for Image
  Captioning
Multimodal Transformer with Multi-View Visual Representation for Image Captioning
Jun-chen Yu
Jing Li
Zhou Yu
Qingming Huang
ViT
27
377
0
20 May 2019
AI in the media and creative industries
AI in the media and creative industries
Giuseppe Amato
Malte Behrmann
Frédéric Bimbot
Baptiste Caramiaux
Fabrizio Falchi
...
Andrew Perkis
R. Redondo
Enrico Turrin
T. Viéville
Emmanuel Vincent
19
42
0
10 May 2019
3G structure for image caption generation
3G structure for image caption generation
Aihong Yuan
Xuelong Li
Xiaoqiang Lu
21
34
0
21 Apr 2019
Saliency-Guided Attention Network for Image-Sentence Matching
Saliency-Guided Attention Network for Image-Sentence Matching
Zhong Ji
Haoran Wang
J. Han
Yanwei Pang
17
88
0
20 Apr 2019
Multi-modal gated recurrent units for image description
Multi-modal gated recurrent units for image description
Xuelong Li
Aihong Yuan
Xiaoqiang Lu
GAN
21
26
0
20 Apr 2019
Challenges and Prospects in Vision and Language Research
Challenges and Prospects in Vision and Language Research
Kushal Kafle
Robik Shrestha
Christopher Kanan
22
41
0
19 Apr 2019
Self-critical n-step Training for Image Captioning
Self-critical n-step Training for Image Captioning
Junlong Gao
Shiqi Wang
Shanshe Wang
Siwei Ma
Wen Gao
19
55
0
15 Apr 2019
Factor Graph Attention
Factor Graph Attention
Idan Schwartz
Seunghak Yu
Tamir Hazan
A. Schwing
24
110
0
11 Apr 2019
A Simple Baseline for Audio-Visual Scene-Aware Dialog
A Simple Baseline for Audio-Visual Scene-Aware Dialog
Idan Schwartz
A. Schwing
Tamir Hazan
27
69
0
11 Apr 2019
Reasoning Visual Dialogs with Structural and Partial Observations
Reasoning Visual Dialogs with Structural and Partial Observations
Zilong Zheng
Wenguan Wang
Siyuan Qi
Song-Chun Zhu
39
117
0
11 Apr 2019
Align2Ground: Weakly Supervised Phrase Grounding Guided by Image-Caption
  Alignment
Align2Ground: Weakly Supervised Phrase Grounding Guided by Image-Caption Alignment
Samyak Datta
Karan Sikka
Anirban Roy
Karuna Ahuja
Devi Parikh
Ajay Divakaran
14
102
0
27 Mar 2019
Recurrent Back-Projection Network for Video Super-Resolution
Recurrent Back-Projection Network for Video Super-Resolution
Muhammad Haris
Gregory Shakhnarovich
Norimichi Ukita
SupR
28
431
0
25 Mar 2019
Neural Sequential Phrase Grounding (SeqGROUND)
Neural Sequential Phrase Grounding (SeqGROUND)
Pelin Dogan
Leonid Sigal
Markus Gross
ObjD
30
51
0
18 Mar 2019
Image captioning with weakly-supervised attention penalty
Image captioning with weakly-supervised attention penalty
Jiayun Li
M. K. Ebrahimpour
Azadeh Moghtaderi
Yen-Yun Yu
20
5
0
06 Mar 2019
JECL: Joint Embedding and Cluster Learning for Image-Text Pairs
JECL: Joint Embedding and Cluster Learning for Image-Text Pairs
Sean T. Yang
Kuan-Hao Huang
Bill Howe
VLM
19
3
0
04 Jan 2019
CLEVR-Ref+: Diagnosing Visual Reasoning with Referring Expressions
CLEVR-Ref+: Diagnosing Visual Reasoning with Referring Expressions
Runtao Liu
Chenxi Liu
Yutong Bai
Alan Yuille
NAI
ObjD
22
122
0
03 Jan 2019
Transfer learning from language models to image caption generators:
  Better models may not transfer better
Transfer learning from language models to image caption generators: Better models may not transfer better
Marc Tanti
Albert Gatt
K. Camilleri
VLM
23
3
0
01 Jan 2019
Coupled Recurrent Network (CRN)
Coupled Recurrent Network (CRN)
Lin Sun
Kui Jia
Yuejia Shen
Silvio Savarese
Dit-Yan Yeung
Bertram E. Shi
23
4
0
25 Dec 2018
Attend More Times for Image Captioning
Attend More Times for Image Captioning
Jiajun Du
Yu Qin
Hongtao Lu
Yonghua Zhang
VLM
21
5
0
08 Dec 2018
An Attempt towards Interpretable Audio-Visual Video Captioning
An Attempt towards Interpretable Audio-Visual Video Captioning
Yapeng Tian
Chenxiao Guan
Justin Goodman
Marc Moore
Chenliang Xu
36
20
0
07 Dec 2018
Layer Flexible Adaptive Computational Time
Layer Flexible Adaptive Computational Time
Lida Zhang
Abdolghani Ebrahimi
Diego Klabjan
AI4CE
36
1
0
06 Dec 2018
Neural Rejuvenation: Improving Deep Network Training by Enhancing
  Computational Resource Utilization
Neural Rejuvenation: Improving Deep Network Training by Enhancing Computational Resource Utilization
Siyuan Qiao
Zhe-nan Lin
Jianming Zhang
Alan Yuille
8
23
0
02 Dec 2018
Intention Oriented Image Captions with Guiding Objects
Intention Oriented Image Captions with Guiding Objects
Yue Zheng
Yali Li
Shengjin Wang
24
55
0
19 Nov 2018
Image Captioning Based on a Hierarchical Attention Mechanism and Policy
  Gradient Optimization
Image Captioning Based on a Hierarchical Attention Mechanism and Policy Gradient Optimization
Shiyang Yan
Yuan Xie
F. Wu
Jeremy S. Smith
Wenjin Lu
Bailing Zhang
14
5
0
13 Nov 2018
A sequential guiding network with attention for image captioning
A sequential guiding network with attention for image captioning
Daouda Sow
Zengchang Qin
Mouhamed Niasse
T. Wan
26
3
0
01 Nov 2018
Session-based Recommendation with Graph Neural Networks
Session-based Recommendation with Graph Neural Networks
Shu Wu
Yuyuan Tang
Yanqiao Zhu
Liang Wang
Xing Xie
Tieniu Tan
GNN
16
1,538
0
01 Nov 2018
Gated Hierarchical Attention for Image Captioning
Gated Hierarchical Attention for Image Captioning
Qingzhong Wang
Antoni B. Chan
24
18
0
30 Oct 2018
Using Deep Learning for price prediction by exploiting stationary limit
  order book features
Using Deep Learning for price prediction by exploiting stationary limit order book features
Avraam Tsantekidis
Nikolaos Passalis
Anastasios Tefas
Juho Kanniainen
Moncef Gabbouj
Alexandros Iosifidis
OOD
24
88
0
23 Oct 2018
A Comprehensive Survey of Deep Learning for Image Captioning
A Comprehensive Survey of Deep Learning for Image Captioning
Md Zakir Hossain
Ferdous Sohel
M. Shiratuddin
Hamid Laga
VLM
3DV
45
761
0
06 Oct 2018
Zoom-RNN: A Novel Method for Person Recognition Using Recurrent Neural
  Networks
Zoom-RNN: A Novel Method for Person Recognition Using Recurrent Neural Networks
Sina Mokhtarzadeh Azar
Sajjad Azami
Mina Ghadimi Atigh
Mohammad Javadi
A. Nickabadi
23
2
0
24 Sep 2018
Neural Approaches to Conversational AI
Neural Approaches to Conversational AI
Jianfeng Gao
Michel Galley
Lihong Li
46
670
0
21 Sep 2018
Lessons learned in multilingual grounded language learning
Lessons learned in multilingual grounded language learning
Ákos Kádár
Desmond Elliott
Marc-Alexandre Côté
Grzegorz Chrupała
A. Alishahi
VLM
20
24
0
20 Sep 2018
Image Captioning based on Deep Reinforcement Learning
Image Captioning based on Deep Reinforcement Learning
Haichao Shi
Peng Li
Bo Wang
Zhenyu Wang
20
25
0
13 Sep 2018
End-to-end Image Captioning Exploits Multimodal Distributional
  Similarity
End-to-end Image Captioning Exploits Multimodal Distributional Similarity
Pranava Madhyastha
Josiah Wang
Lucia Specia
CoGe
35
7
0
11 Sep 2018
Previous
123456789
Next