ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1711.09151
  4. Cited By
Convolutional Image Captioning

Convolutional Image Captioning

24 November 2017
J. Aneja
Aditya Deshpande
A. Schwing
    VLM
ArXivPDFHTML

Papers citing "Convolutional Image Captioning"

50 / 103 papers shown
Title
ChatBEV: A Visual Language Model that Understands BEV Maps
ChatBEV: A Visual Language Model that Understands BEV Maps
Qingyao Xu
Tian Jin
Guang Chen
Yanfeng Wang
Yuyao Zhang
51
0
0
18 Mar 2025
Pixels to Prose: Understanding the art of Image Captioning
Pixels to Prose: Understanding the art of Image Captioning
Hrishikesh Singh
Aarti Sharma
Millie Pant
3DV
VLM
25
0
0
28 Aug 2024
Surveying the Landscape of Image Captioning Evaluation: A Comprehensive Taxonomy, Trends and Metrics Analysis
Surveying the Landscape of Image Captioning Evaluation: A Comprehensive Taxonomy, Trends and Metrics Analysis
Uri Berger
Gabriel Stanovsky
Omri Abend
Lea Frermann
35
0
0
09 Aug 2024
Compressed Image Captioning using CNN-based Encoder-Decoder Framework
Compressed Image Captioning using CNN-based Encoder-Decoder Framework
Md Alif
Mahmudul Hasan
Shovon Bhowmick
50
1
0
28 Apr 2024
Context-Guided Spatio-Temporal Video Grounding
Context-Guided Spatio-Temporal Video Grounding
Xin Gu
Hengrui Fan
Yan Huang
Tiejian Luo
Libo Zhang
35
14
0
03 Jan 2024
Survey of Social Bias in Vision-Language Models
Survey of Social Bias in Vision-Language Models
Nayeon Lee
Yejin Bang
Holy Lovenia
Samuel Cahyawijaya
Wenliang Dai
Pascale Fung
VLM
47
16
0
24 Sep 2023
Diagnosing Human-object Interaction Detectors
Diagnosing Human-object Interaction Detectors
Fangrui Zhu
Yiming Xie
Weidi Xie
Huaizu Jiang
30
7
0
16 Aug 2023
MMNet: Multi-Collaboration and Multi-Supervision Network for Sequential
  Deepfake Detection
MMNet: Multi-Collaboration and Multi-Supervision Network for Sequential Deepfake Detection
Ruiyang Xia
Decheng Liu
Jie Li
Lin Yuan
N. Wang
Xinbo Gao
28
17
0
06 Jul 2023
GEST: the Graph of Events in Space and Time as a Common Representation
  between Vision and Language
GEST: the Graph of Events in Space and Time as a Common Representation between Vision and Language
Mihai Masala
Nicolae Cudlenco
Traian Rebedea
Marius Leordeanu
14
0
0
22 May 2023
Image-to-Text Translation for Interactive Image Recognition: A
  Comparative User Study with Non-Expert Users
Image-to-Text Translation for Interactive Image Recognition: A Comparative User Study with Non-Expert Users
Wataru Kawabe
Yusuke Sugano
VLM
35
2
0
11 May 2023
Multi-modal Machine Learning in Engineering Design: A Review and Future
  Directions
Multi-modal Machine Learning in Engineering Design: A Review and Future Directions
Binyang Song
Ruilin Zhou
Faez Ahmed
AI4CE
37
40
0
14 Feb 2023
Overcoming Catastrophic Forgetting by XAI
Overcoming Catastrophic Forgetting by XAI
Giang Nguyen
18
0
0
25 Nov 2022
Improving Radiology Summarization with Radiograph and Anatomy Prompts
Improving Radiology Summarization with Radiograph and Anatomy Prompts
Jinpeng Hu
Zhihong Chen
Yang Liu
Xiang Wan
Tsung-Hui Chang
MedIm
34
8
0
15 Oct 2022
M^4I: Multi-modal Models Membership Inference
M^4I: Multi-modal Models Membership Inference
Pingyi Hu
Zihan Wang
Ruoxi Sun
Hu Wang
Minhui Xue
39
26
0
15 Sep 2022
Facial Expression Recognition and Image Description Generation in
  Vietnamese
Facial Expression Recognition and Image Description Generation in Vietnamese
Khang Nhut Lam
Kim Thi-Thanh Nguyen
Loc Huu Nguy
Jugal Kalita
3DH
CVBM
28
1
0
12 Aug 2022
Aesthetic Attributes Assessment of Images with AMANv2 and DPC-CaptionsV2
Aesthetic Attributes Assessment of Images with AMANv2 and DPC-CaptionsV2
Xinghui Zhou
Xin Jin
Jianwen Lv
Heng Huang
Ming Mao
Shuai Cui
CoGe
18
0
0
09 Aug 2022
Retrieval-Augmented Transformer for Image Captioning
Retrieval-Augmented Transformer for Image Captioning
Sara Sarto
Marcella Cornia
Lorenzo Baraldi
Rita Cucchiara
24
57
0
26 Jul 2022
Are metrics measuring what they should? An evaluation of image
  captioning task metrics
Are metrics measuring what they should? An evaluation of image captioning task metrics
Othón González-Chávez
Guillermo Ruiz
Daniela Moctezuma
Tania A. Ramirez-delreal
21
9
0
04 Jul 2022
Measuring Representational Harms in Image Captioning
Measuring Representational Harms in Image Captioning
Angelina Wang
Solon Barocas
Kristen Laird
Hanna M. Wallach
21
51
0
14 Jun 2022
Beyond Greedy Search: Tracking by Multi-Agent Reinforcement
  Learning-based Beam Search
Beyond Greedy Search: Tracking by Multi-Agent Reinforcement Learning-based Beam Search
Tianlin Li
Zhe Chen
Bo Jiang
Jin Tang
Bin Luo
Dacheng Tao
45
18
0
19 May 2022
Diverse Image Captioning with Grounded Style
Diverse Image Captioning with Grounded Style
Franz Klein
Shweta Mahajan
S. Roth
22
7
0
03 May 2022
Controllable Image Captioning
Luka Maxwell
33
0
0
28 Apr 2022
On Distinctive Image Captioning via Comparing and Reweighting
On Distinctive Image Captioning via Comparing and Reweighting
Jiuniu Wang
Wenjia Xu
Qingzhong Wang
Antoni B. Chan
38
16
0
08 Apr 2022
CaMEL: Mean Teacher Learning for Image Captioning
CaMEL: Mean Teacher Learning for Image Captioning
Manuele Barraco
Matteo Stefanini
Marcella Cornia
S. Cascianelli
Lorenzo Baraldi
Rita Cucchiara
ViT
VLM
38
27
0
21 Feb 2022
Deep Learning Approaches on Image Captioning: A Review
Deep Learning Approaches on Image Captioning: A Review
Taraneh Ghandi
H. Pourreza
H. Mahyar
VLM
19
89
0
31 Jan 2022
Do Smart Glasses Dream of Sentimental Visions? Deep Emotionship Analysis
  for Eyewear Devices
Do Smart Glasses Dream of Sentimental Visions? Deep Emotionship Analysis for Eyewear Devices
Yingying Zhao
Yuhu Chang
Yutian Lu
Yujiang Wang
Mingzhi Dong
...
Robert P. Dick
Fan Yang
T. Lu
Ning Gu
L. Shang
41
9
0
24 Jan 2022
An Integrated Approach for Video Captioning and Applications
An Integrated Approach for Video Captioning and Applications
Soheyla Amirian
T. Taha
Khaled Rasheed
H. Arabnia
31
1
0
23 Jan 2022
Interactive Attention AI to translate low light photos to captions for
  night scene understanding in women safety
Interactive Attention AI to translate low light photos to captions for night scene understanding in women safety
A. Rajagopal
V. Nirmala
Arun Muthuraj Vedamanickam
19
0
0
04 Jan 2022
Neural Attention for Image Captioning: Review of Outstanding Methods
Neural Attention for Image Captioning: Review of Outstanding Methods
Zanyar Zohourianshahzadi
Jugal Kalita
VLM
32
45
0
29 Nov 2021
ZeroCap: Zero-Shot Image-to-Text Generation for Visual-Semantic
  Arithmetic
ZeroCap: Zero-Shot Image-to-Text Generation for Visual-Semantic Arithmetic
Yoad Tewel
Yoav Shalev
Idan Schwartz
Lior Wolf
VLM
34
192
0
29 Nov 2021
Cross Modification Attention Based Deliberation Model for Image
  Captioning
Cross Modification Attention Based Deliberation Model for Image Captioning
Zheng Lian
Yanan Zhang
Haichang Li
Rui Wang
Xiaohui Hu
24
4
0
17 Sep 2021
Bornon: Bengali Image Captioning with Transformer-based Deep learning
  approach
Bornon: Bengali Image Captioning with Transformer-based Deep learning approach
Faisal Muhammad Shah
Mayeesha Humaira
Md Abidur Rahman Khan Jim
Amit Saha Ami
Shimul Paul
23
17
0
11 Sep 2021
Journalistic Guidelines Aware News Image Captioning
Journalistic Guidelines Aware News Image Captioning
Xuewen Yang
Svebor Karaman
Joel R. Tetreault
Alex Jaimes
16
27
0
07 Sep 2021
Group-based Distinctive Image Captioning with Memory Attention
Group-based Distinctive Image Captioning with Memory Attention
Jiuniu Wang
Wenjia Xu
Qingzhong Wang
Antoni B. Chan
21
18
0
20 Aug 2021
X-modaler: A Versatile and High-performance Codebase for Cross-modal
  Analytics
X-modaler: A Versatile and High-performance Codebase for Cross-modal Analytics
Yehao Li
Yingwei Pan
Jingwen Chen
Ting Yao
Tao Mei
VLM
19
31
0
18 Aug 2021
Image Retrieval on Real-life Images with Pre-trained Vision-and-Language
  Models
Image Retrieval on Real-life Images with Pre-trained Vision-and-Language Models
Zheyuan Liu
Cristian Rodriguez-Opazo
Damien Teney
Stephen Gould
VLM
19
192
0
09 Aug 2021
ReFormer: The Relational Transformer for Image Captioning
ReFormer: The Relational Transformer for Image Captioning
Xuewen Yang
Yingru Liu
Xin Wang
ViT
17
54
0
29 Jul 2021
From Show to Tell: A Survey on Deep Learning-based Image Captioning
From Show to Tell: A Survey on Deep Learning-based Image Captioning
Matteo Stefanini
Marcella Cornia
Lorenzo Baraldi
S. Cascianelli
G. Fiameni
Rita Cucchiara
3DV
VLM
MLLM
67
254
0
14 Jul 2021
Multi-Modal Image Captioning for the Visually Impaired
Multi-Modal Image Captioning for the Visually Impaired
Hiba Ahsan
Nikita Bhalla
Daivat Bhatt
Kaivankumar Shah
25
20
0
17 May 2021
Discrete-continuous Action Space Policy Gradient-based Attention for
  Image-Text Matching
Discrete-continuous Action Space Policy Gradient-based Attention for Image-Text Matching
Shiyang Yan
Li Yu
Yuan Xie
39
34
0
21 Apr 2021
Automatic Generation of Descriptive Titles for Video Clips Using Deep
  Learning
Automatic Generation of Descriptive Titles for Video Clips Using Deep Learning
Soheyla Amirian
Khaled Rasheed
T. Taha
H. Arabnia
VLM
VGen
19
23
0
07 Apr 2021
Dynamic Attention guided Multi-Trajectory Analysis for Single Object
  Tracking
Dynamic Attention guided Multi-Trajectory Analysis for Single Object Tracking
Tianlin Li
Zhe Chen
Jin Tang
Bin Luo
Yaowei Wang
Yonghong Tian
Feng Wu
26
44
0
30 Mar 2021
Analysis of Convolutional Decoder for Image Caption Generation
Analysis of Convolutional Decoder for Image Caption Generation
Sulabh Katiyar
S. Borgohain
18
0
0
08 Mar 2021
Comparative evaluation of CNN architectures for Image Caption Generation
Comparative evaluation of CNN architectures for Image Caption Generation
Sulabh Katiyar
S. Borgohain
19
24
0
23 Feb 2021
Image Captioning using Deep Stacked LSTMs, Contextual Word Embeddings
  and Data Augmentation
Image Captioning using Deep Stacked LSTMs, Contextual Word Embeddings and Data Augmentation
Sulabh Katiyar
S. Borgohain
VLM
24
14
0
22 Feb 2021
Intrinsic Image Captioning Evaluation
Intrinsic Image Captioning Evaluation
Chao Zeng
Sam Kwong
21
0
0
14 Dec 2020
Robust Image Captioning
Robust Image Captioning
Daniel Yarnell
Xian Wang
21
0
0
06 Dec 2020
Dual Attention on Pyramid Feature Maps for Image Captioning
Dual Attention on Pyramid Feature Maps for Image Captioning
Litao Yu
Jian Zhang
Qiang Wu
21
47
0
02 Nov 2020
Diverse Image Captioning with Context-Object Split Latent Spaces
Diverse Image Captioning with Context-Object Split Latent Spaces
Shweta Mahajan
Stefan Roth
19
41
0
02 Nov 2020
Pedestrian Trajectory Prediction with Convolutional Neural Networks
Pedestrian Trajectory Prediction with Convolutional Neural Networks
Simone Zamboni
Zekarias T. Kefato
Sarunas Girdzijauskas
Noren Christoffer
L. D. Col
HAI
13
93
0
12 Oct 2020
123
Next