SemStyle: Learning to Generate Stylised Image Captions using Unaligned Text

18 May 2018

Papers citing "SemStyle: Learning to Generate Stylised Image Captions using Unaligned Text"

50 / 50 papers shown

Title
Semi-supervised Chinese Poem-to-Painting Generation via Cycle-consistent Adversarial Networks Zhengyang Lu Tianhao Guo Feng Wang GAN 31 1 0 25 Oct 2024
Surveying the Landscape of Image Captioning Evaluation: A Comprehensive Taxonomy, Trends and Metrics Analysis Uri Berger Gabriel Stanovsky Omri Abend Lea Frermann 35 0 0 09 Aug 2024
A Survey of Personality, Persona, and Profile in Conversational Agents and Chatbots Richard Sutcliffe 30 3 0 31 Dec 2023
Emotional Theory of Mind: Bridging Fast Visual Processing with Slow Linguistic Reasoning Yasaman Etesam Özge Nilay Yalçin Chuxuan Zhang Angelica Lim 35 2 0 30 Oct 2023
ADS-Cap: A Framework for Accurate and Diverse Stylized Captioning with Unpaired Stylistic Corpora Ka Leong Cheng Zheng Ma Shi Zong Jianbing Zhang Xinyu Dai Jiajun Chen DiffM 27 3 0 02 Aug 2023
Visual Captioning at Will: Describing Images and Videos Guided by a Few Stylized Sentences Di Yang Hongyu Chen Xinglin Hou T. Ge Yuning Jiang Qin Jin 36 0 0 31 Jul 2023
Generating Visual Spatial Description via Holistic 3D Scene Understanding Yu Zhao Hao Fei Wei Ji Jianguo Wei Meishan Zhang Hao Fei Tat-Seng Chua 28 33 0 19 May 2023
Learning Combinatorial Prompts for Universal Controllable Image Captioning Zhen Wang Jun Xiao Yueting Zhuang Fei Gao Jian Shao Long Chen 60 5 0 11 Mar 2023
ConZIC: Controllable Zero-shot Image Captioning by Sampling-Based Polishing Zequn Zeng Hao Zhang Zhengjue Wang Ruiying Lu Dongsheng Wang Bo Chen BDL DiffM 19 33 0 04 Mar 2023
Style-Aware Contrastive Learning for Multi-Style Image Captioning Yucheng Zhou Guodong Long 25 22 0 26 Jan 2023
CLID: Controlled-Length Image Descriptions with Limited Data Elad Hirsch A. Tal VLM 3DV 22 4 0 27 Nov 2022
Visual Spatial Description: Controlled Spatial-Oriented Image-to-Text Generation Yu Zhao Jianguo Wei Zhichao Lin Yueheng Sun Meishan Zhang Hao Fei 25 16 0 20 Oct 2022
Learning Distinct and Representative Styles for Image Captioning Qi Chen Chaorui Deng Qi Wu VLM 37 23 0 17 Sep 2022
Aesthetic Attributes Assessment of Images with AMANv2 and DPC-CaptionsV2 Xinghui Zhou Xin Jin Jianwen Lv Heng Huang Ming Mao Shuai Cui CoGe 18 0 0 09 Aug 2022
Diverse Image Captioning with Grounded Style Franz Klein Shweta Mahajan S. Roth 22 7 0 03 May 2022
Vision Transformers in Medical Computer Vision -- A Contemplative Retrospection Arshi Parvaiz Muhammad Anwaar Khalid Rukhsana Zafar Huma Ameer M. Ali M. Fraz MedIm 18 59 0 29 Mar 2022
Controllable Video Captioning with an Exemplar Sentence Yitian Yuan Lin Ma Jingwen Wang Wenwu Zhu 18 20 0 02 Dec 2021
Syntax Customized Video Captioning by Imitating Exemplar Sentences Yitian Yuan Lin Ma Wenwu Zhu 22 6 0 02 Dec 2021
Generating More Pertinent Captions by Leveraging Semantics and Style on Multi-Source Datasets Marcella Cornia Lorenzo Baraldi G. Fiameni Rita Cucchiara 20 12 0 24 Nov 2021
Similar Scenes arouse Similar Emotions: Parallel Data Augmentation for Stylized Image Captioning Guodun Li Yuchen Zhai Zehao Lin Yin Zhang 56 21 0 26 Aug 2021
From Show to Tell: A Survey on Deep Learning-based Image Captioning Matteo Stefanini Marcella Cornia Lorenzo Baraldi S. Cascianelli G. Fiameni Rita Cucchiara 3DV VLM MLLM 67 254 0 14 Jul 2021
SMURF: SeMantic and linguistic UndeRstanding Fusion for Caption Evaluation via Typicality Analysis Joshua Forster Feinglass Yezhou Yang 21 21 0 02 Jun 2021
Towards Accurate Text-based Image Captioning with Content Diversity Exploration Guanghui Xu Shuaicheng Niu Mingkui Tan Yucheng Luo Qing Du Qi Wu DiffM 17 56 0 23 Apr 2021
Human-like Controllable Image Captioning with Verb-specific Semantic Roles Long Chen Zhihong Jiang Jun Xiao Wei Liu 30 74 0 22 Mar 2021
Structural and Functional Decomposition for Personality Image Captioning in a Communication Game Minh-Thu Nguyen Duy Phung Minh Hoai Thien Huu Nguyen 25 4 0 17 Nov 2020
Multimodal Research in Vision and Language: A Review of Current and Emerging Trends Shagun Uppal Sarthak Bhagat Devamanyu Hazarika Navonil Majumdar Soujanya Poria Roger Zimmermann Amir Zadeh 23 6 0 19 Oct 2020
Denoising Large-Scale Image Captioning from Alt-text Data using Content Selection Models Khyathi Raghavi Chandu Piyush Sharma Soravit Changpinyo Ashish V. Thapliyal Radu Soricut DiffM VLM 27 3 0 10 Sep 2020
Length-Controllable Image Captioning Chaorui Deng Ning Ding Mingkui Tan Qi Wu VLM 33 56 0 19 Jul 2020
Diverse and Styled Image Captioning Using SVD-Based Mixture of Recurrent Experts Marzi Heidari M. Ghatee A. Nickabadi Arash Pourhasan Nezhad DiffM MoE 35 1 0 07 Jul 2020
Say As You Wish: Fine-grained Control of Image Caption Generation with Abstract Scene Graphs Shizhe Chen Qin Jin Peng Wang Qi Wu DiffM 36 215 0 01 Mar 2020
Knowledge-Enriched Visual Storytelling Chao-Chun Hsu Zi-Yuan Chen Chi-Yang Hsu Chih-Chia Li Tzu-Yuan Lin Ting-Hao 'Kenneth' Huang Lun-Wei Ku DiffM 27 43 0 03 Dec 2019
Aesthetic Image Captioning From Weakly-Labelled Photographs Koustav Ghosal A. Rana A. Smolic 27 25 0 29 Aug 2019
Towards Unsupervised Image Captioning with Shared Multimodal Embeddings Iro Laina Christian Rupprecht Nassir Navab SSL 21 103 0 25 Aug 2019
Unpaired Image-to-Speech Synthesis with Multimodal Information Bottleneck Shuang Ma Daniel J. McDuff Yale Song 25 22 0 19 Aug 2019
Towards Generating Stylized Image Captions via Adversarial Training Omid Mohamad Nezami Mark Dras Stephen Wan Cécile Paris Len Hamey GAN 14 18 0 08 Aug 2019
Image Captioning using Facial Expression and Attention Omid Mohamad Nezami Mark Dras Stephen Wan Cécile Paris CVBM 17 8 0 08 Aug 2019
Trends in Integration of Vision and Language Research: A Survey of Tasks, Datasets, and Methods Aditya Mogadala M. Kalimuthu Dietrich Klakow VLM 20 132 0 22 Jul 2019
Aesthetic Attributes Assessment of Images Xin Jin Le Wu Geng Zhao Xiaodong Li Xiaokun Zhang Shiming Ge Dongqing Zou Bin Zhou Xinghui Zhou 22 36 0 11 Jul 2019
Visual Story Post-Editing Ting-Yao Hsu Huang Chieh-Yang Yen-Chia Hsu Ting-Hao 'Kenneth' Huang 11 20 0 05 Jun 2019
Reasoning Visual Dialogs with Structural and Partial Observations Zilong Zheng Wenguan Wang Siyuan Qi Song-Chun Zhu 39 117 0 11 Apr 2019
Good News, Everyone! Context driven entity-aware captioning for news images Ali Furkan Biten Lluís Gómez Marçal Rusiñol Dimosthenis Karatzas 19 139 0 02 Apr 2019
Dixit: Interactive Visual Storytelling via Term Manipulation Chao-Chun Hsu Yu-Hua Chen Zi-Yuan Chen Hsin-Yu Lin Ting-Hao 'Kenneth' Huang Lun-Wei Ku DiffM VGen 11 1 0 06 Mar 2019
On How Users Edit Computer-Generated Visual Stories Ting-Yao Hsu Yen-Chia Hsu Ting-Hao 'Kenneth' Huang 18 14 0 22 Feb 2019
Pedestrian Attribute Recognition: A Survey Tianlin Li Shaofei Zheng Rui Yang Aihua Zheng Zhe Chen Jin Tang Bin Luo CVBM 28 127 0 22 Jan 2019
Show, Control and Tell: A Framework for Generating Controllable and Grounded Captions Marcella Cornia Lorenzo Baraldi Rita Cucchiara DiffM 28 175 0 26 Nov 2018
Image Chat: Engaging Grounded Conversations Kurt Shuster Samuel Humeau Antoine Bordes Jason Weston 23 115 0 02 Nov 2018
Engaging Image Captioning Via Personality Kurt Shuster Samuel Humeau Hexiang Hu Antoine Bordes Jason Weston 31 149 0 25 Oct 2018
Unsupervised Stylish Image Description Generation via Domain Layer Norm Cheng Kuan Chen Zhufeng Pan Min Sun Ming-Yu Liu 20 29 0 11 Sep 2018
Google's Neural Machine Translation System: Bridging the Gap between Human and Machine Translation Yonghui Wu M. Schuster Z. Chen Quoc V. Le Mohammad Norouzi ... Alex Rudnick Oriol Vinyals G. Corrado Macduff Hughes J. Dean AIMat 716 6,746 0 26 Sep 2016
Effective Approaches to Attention-based Neural Machine Translation Thang Luong Hieu H. Pham Christopher D. Manning 218 7,925 0 17 Aug 2015