Convolutional Image Captioning

24 November 2017

Papers citing "Convolutional Image Captioning"

50 / 103 papers shown

Title
ChatBEV: A Visual Language Model that Understands BEV Maps Qingyao Xu Tian Jin Guang Chen Yanfeng Wang Yuyao Zhang 51 0 0 18 Mar 2025
Pixels to Prose: Understanding the art of Image Captioning Hrishikesh Singh Aarti Sharma Millie Pant 3DV VLM 25 0 0 28 Aug 2024
Surveying the Landscape of Image Captioning Evaluation: A Comprehensive Taxonomy, Trends and Metrics Analysis Uri Berger Gabriel Stanovsky Omri Abend Lea Frermann 35 0 0 09 Aug 2024
Compressed Image Captioning using CNN-based Encoder-Decoder Framework Md Alif Mahmudul Hasan Shovon Bhowmick 50 1 0 28 Apr 2024
Context-Guided Spatio-Temporal Video Grounding Xin Gu Hengrui Fan Yan Huang Tiejian Luo Libo Zhang 35 14 0 03 Jan 2024
Survey of Social Bias in Vision-Language Models Nayeon Lee Yejin Bang Holy Lovenia Samuel Cahyawijaya Wenliang Dai Pascale Fung VLM 47 16 0 24 Sep 2023
Diagnosing Human-object Interaction Detectors Fangrui Zhu Yiming Xie Weidi Xie Huaizu Jiang 30 7 0 16 Aug 2023
MMNet: Multi-Collaboration and Multi-Supervision Network for Sequential Deepfake Detection Ruiyang Xia Decheng Liu Jie Li Lin Yuan N. Wang Xinbo Gao 28 17 0 06 Jul 2023
GEST: the Graph of Events in Space and Time as a Common Representation between Vision and Language Mihai Masala Nicolae Cudlenco Traian Rebedea Marius Leordeanu 14 0 0 22 May 2023
Image-to-Text Translation for Interactive Image Recognition: A Comparative User Study with Non-Expert Users Wataru Kawabe Yusuke Sugano VLM 35 2 0 11 May 2023
Multi-modal Machine Learning in Engineering Design: A Review and Future Directions Binyang Song Ruilin Zhou Faez Ahmed AI4CE 37 40 0 14 Feb 2023
Overcoming Catastrophic Forgetting by XAI Giang Nguyen 18 0 0 25 Nov 2022
Improving Radiology Summarization with Radiograph and Anatomy Prompts Jinpeng Hu Zhihong Chen Yang Liu Xiang Wan Tsung-Hui Chang MedIm 34 8 0 15 Oct 2022
M^4I: Multi-modal Models Membership Inference Pingyi Hu Zihan Wang Ruoxi Sun Hu Wang Minhui Xue 39 26 0 15 Sep 2022
Facial Expression Recognition and Image Description Generation in Vietnamese Khang Nhut Lam Kim Thi-Thanh Nguyen Loc Huu Nguy Jugal Kalita 3DH CVBM 28 1 0 12 Aug 2022
Aesthetic Attributes Assessment of Images with AMANv2 and DPC-CaptionsV2 Xinghui Zhou Xin Jin Jianwen Lv Heng Huang Ming Mao Shuai Cui CoGe 18 0 0 09 Aug 2022
Retrieval-Augmented Transformer for Image Captioning Sara Sarto Marcella Cornia Lorenzo Baraldi Rita Cucchiara 24 57 0 26 Jul 2022
Are metrics measuring what they should? An evaluation of image captioning task metrics Othón González-Chávez Guillermo Ruiz Daniela Moctezuma Tania A. Ramirez-delreal 21 9 0 04 Jul 2022
Measuring Representational Harms in Image Captioning Angelina Wang Solon Barocas Kristen Laird Hanna M. Wallach 21 51 0 14 Jun 2022
Beyond Greedy Search: Tracking by Multi-Agent Reinforcement Learning-based Beam Search Tianlin Li Zhe Chen Bo Jiang Jin Tang Bin Luo Dacheng Tao 45 18 0 19 May 2022
Diverse Image Captioning with Grounded Style Franz Klein Shweta Mahajan S. Roth 22 7 0 03 May 2022
Controllable Image Captioning Luka Maxwell 33 0 0 28 Apr 2022
On Distinctive Image Captioning via Comparing and Reweighting Jiuniu Wang Wenjia Xu Qingzhong Wang Antoni B. Chan 38 16 0 08 Apr 2022
CaMEL: Mean Teacher Learning for Image Captioning Manuele Barraco Matteo Stefanini Marcella Cornia S. Cascianelli Lorenzo Baraldi Rita Cucchiara ViT VLM 38 27 0 21 Feb 2022
Deep Learning Approaches on Image Captioning: A Review Taraneh Ghandi H. Pourreza H. Mahyar VLM 19 89 0 31 Jan 2022
Do Smart Glasses Dream of Sentimental Visions? Deep Emotionship Analysis for Eyewear Devices Yingying Zhao Yuhu Chang Yutian Lu Yujiang Wang Mingzhi Dong ... Robert P. Dick Fan Yang T. Lu Ning Gu L. Shang 41 9 0 24 Jan 2022
An Integrated Approach for Video Captioning and Applications Soheyla Amirian T. Taha Khaled Rasheed H. Arabnia 31 1 0 23 Jan 2022
Interactive Attention AI to translate low light photos to captions for night scene understanding in women safety A. Rajagopal V. Nirmala Arun Muthuraj Vedamanickam 19 0 0 04 Jan 2022
Neural Attention for Image Captioning: Review of Outstanding Methods Zanyar Zohourianshahzadi Jugal Kalita VLM 32 45 0 29 Nov 2021
ZeroCap: Zero-Shot Image-to-Text Generation for Visual-Semantic Arithmetic Yoad Tewel Yoav Shalev Idan Schwartz Lior Wolf VLM 34 192 0 29 Nov 2021
Cross Modification Attention Based Deliberation Model for Image Captioning Zheng Lian Yanan Zhang Haichang Li Rui Wang Xiaohui Hu 24 4 0 17 Sep 2021
Bornon: Bengali Image Captioning with Transformer-based Deep learning approach Faisal Muhammad Shah Mayeesha Humaira Md Abidur Rahman Khan Jim Amit Saha Ami Shimul Paul 23 17 0 11 Sep 2021
Journalistic Guidelines Aware News Image Captioning Xuewen Yang Svebor Karaman Joel R. Tetreault Alex Jaimes 16 27 0 07 Sep 2021
Group-based Distinctive Image Captioning with Memory Attention Jiuniu Wang Wenjia Xu Qingzhong Wang Antoni B. Chan 21 18 0 20 Aug 2021
X-modaler: A Versatile and High-performance Codebase for Cross-modal Analytics Yehao Li Yingwei Pan Jingwen Chen Ting Yao Tao Mei VLM 19 31 0 18 Aug 2021
Image Retrieval on Real-life Images with Pre-trained Vision-and-Language Models Zheyuan Liu Cristian Rodriguez-Opazo Damien Teney Stephen Gould VLM 19 192 0 09 Aug 2021
ReFormer: The Relational Transformer for Image Captioning Xuewen Yang Yingru Liu Xin Wang ViT 17 54 0 29 Jul 2021
From Show to Tell: A Survey on Deep Learning-based Image Captioning Matteo Stefanini Marcella Cornia Lorenzo Baraldi S. Cascianelli G. Fiameni Rita Cucchiara 3DV VLM MLLM 67 254 0 14 Jul 2021
Multi-Modal Image Captioning for the Visually Impaired Hiba Ahsan Nikita Bhalla Daivat Bhatt Kaivankumar Shah 25 20 0 17 May 2021
Discrete-continuous Action Space Policy Gradient-based Attention for Image-Text Matching Shiyang Yan Li Yu Yuan Xie 39 34 0 21 Apr 2021
Automatic Generation of Descriptive Titles for Video Clips Using Deep Learning Soheyla Amirian Khaled Rasheed T. Taha H. Arabnia VLM VGen 19 23 0 07 Apr 2021
Dynamic Attention guided Multi-Trajectory Analysis for Single Object Tracking Tianlin Li Zhe Chen Jin Tang Bin Luo Yaowei Wang Yonghong Tian Feng Wu 26 44 0 30 Mar 2021
Analysis of Convolutional Decoder for Image Caption Generation Sulabh Katiyar S. Borgohain 18 0 0 08 Mar 2021
Comparative evaluation of CNN architectures for Image Caption Generation Sulabh Katiyar S. Borgohain 19 24 0 23 Feb 2021
Image Captioning using Deep Stacked LSTMs, Contextual Word Embeddings and Data Augmentation Sulabh Katiyar S. Borgohain VLM 24 14 0 22 Feb 2021
Intrinsic Image Captioning Evaluation Chao Zeng Sam Kwong 21 0 0 14 Dec 2020
Robust Image Captioning Daniel Yarnell Xian Wang 21 0 0 06 Dec 2020
Dual Attention on Pyramid Feature Maps for Image Captioning Litao Yu Jian Zhang Qiang Wu 21 47 0 02 Nov 2020
Diverse Image Captioning with Context-Object Split Latent Spaces Shweta Mahajan Stefan Roth 19 41 0 02 Nov 2020
Pedestrian Trajectory Prediction with Convolutional Neural Networks Simone Zamboni Zekarias T. Kefato Sarunas Girdzijauskas Noren Christoffer L. D. Col HAI 13 93 0 12 Oct 2020