ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1711.07068
  4. Cited By
Diverse and Accurate Image Description Using a Variational Auto-Encoder
  with an Additive Gaussian Encoding Space

Diverse and Accurate Image Description Using a Variational Auto-Encoder with an Additive Gaussian Encoding Space

19 November 2017
Liwei Wang
A. Schwing
Svetlana Lazebnik
    CoGe
ArXivPDFHTML

Papers citing "Diverse and Accurate Image Description Using a Variational Auto-Encoder with an Additive Gaussian Encoding Space"

50 / 79 papers shown
Title
Variational Prefix Tuning for Diverse and Accurate Code Summarization Using Pre-trained Language Models
Variational Prefix Tuning for Diverse and Accurate Code Summarization Using Pre-trained Language Models
Junda Zhao
Yuliang Song
Eldan Cohen
21
0
0
14 May 2025
Group-based Distinctive Image Captioning with Memory Difference Encoding and Attention
Group-based Distinctive Image Captioning with Memory Difference Encoding and Attention
Jiuniu Wang
Wenjia Xu
Qingzhong Wang
Antoni B. Chan
45
0
0
03 Apr 2025
Harnessing the Power of LLMs: Automating Unit Test Generation for
  High-Performance Computing
Harnessing the Power of LLMs: Automating Unit Test Generation for High-Performance Computing
Rabimba Karanjai
Aftab Hussain
Md Rafiqul Islam Rabin
Lei Xu
Weidong Shi
Mohammad Amin Alipour
62
2
0
06 Jul 2024
Self-Supervised Learning Based Handwriting Verification
Self-Supervised Learning Based Handwriting Verification
Mihir Chauhan
Mohammad Abuzar Shaikh
Abhishek Satbhai
Mir Basheer Ali
B. Ramamurthy
Mingchen Gao
Siwei Lyu
Sargur Srihari
24
2
0
28 May 2024
Sentiment-oriented Transformer-based Variational Autoencoder Network for
  Live Video Commenting
Sentiment-oriented Transformer-based Variational Autoencoder Network for Live Video Commenting
Fengyi Fu
Shancheng Fang
Weidong Chen
Zhendong Mao
ViT
VGen
31
4
0
19 Apr 2024
Conditional Unscented Autoencoders for Trajectory Prediction
Conditional Unscented Autoencoders for Trajectory Prediction
Faris Janjos
Marcel Hallgarten
Anthony Knittel
Maxim Dolgov
Andreas Zell
J. Marius Zöllner
37
7
0
30 Oct 2023
ADS-Cap: A Framework for Accurate and Diverse Stylized Captioning with
  Unpaired Stylistic Corpora
ADS-Cap: A Framework for Accurate and Diverse Stylized Captioning with Unpaired Stylistic Corpora
Ka Leong Cheng
Zheng Ma
Shi Zong
Jianbing Zhang
Xinyu Dai
Jiajun Chen
DiffM
19
3
0
02 Aug 2023
ConZIC: Controllable Zero-shot Image Captioning by Sampling-Based
  Polishing
ConZIC: Controllable Zero-shot Image Captioning by Sampling-Based Polishing
Zequn Zeng
Hao Zhang
Zhengjue Wang
Ruiying Lu
Dongsheng Wang
Bo Chen
BDL
DiffM
19
33
0
04 Mar 2023
CodeLMSec Benchmark: Systematically Evaluating and Finding Security
  Vulnerabilities in Black-Box Code Language Models
CodeLMSec Benchmark: Systematically Evaluating and Finding Security Vulnerabilities in Black-Box Code Language Models
Hossein Hajipour
Keno Hassler
Thorsten Holz
Lea Schonherr
Mario Fritz
ELM
40
20
0
08 Feb 2023
IC3: Image Captioning by Committee Consensus
IC3: Image Captioning by Committee Consensus
David M. Chan
Austin Myers
Sudheendra Vijayanarasimhan
David A. Ross
John F. Canny
32
17
0
02 Feb 2023
DuNST: Dual Noisy Self Training for Semi-Supervised Controllable Text
  Generation
DuNST: Dual Noisy Self Training for Semi-Supervised Controllable Text Generation
Yuxi Feng
Xiaoyuan Yi
Xiting Wang
L. Lakshmanan
Xing Xie
DiffM
35
5
0
16 Dec 2022
Towards Diverse, Relevant and Coherent Open-Domain Dialogue Generation
  via Hybrid Latent Variables
Towards Diverse, Relevant and Coherent Open-Domain Dialogue Generation via Hybrid Latent Variables
Bin Sun
Yitong Li
Fei Mi
Weichao Wang
Yiwei Li
Kan Li
BDL
35
5
0
02 Dec 2022
Make-A-Story: Visual Memory Conditioned Consistent Story Generation
Make-A-Story: Visual Memory Conditioned Consistent Story Generation
Tanzila Rahman
Hsin-Ying Lee
Jian Ren
Sergey Tulyakov
Shweta Mahajan
Leonid Sigal
DiffM
19
68
0
23 Nov 2022
PCAE: A Framework of Plug-in Conditional Auto-Encoder for Controllable
  Text Generation
PCAE: A Framework of Plug-in Conditional Auto-Encoder for Controllable Text Generation
Haoqin Tu
Zhongliang Yang
Jinshuai Yang
Siyu Zhang
Yong Huang
23
7
0
07 Oct 2022
Vision+X: A Survey on Multimodal Learning in the Light of Data
Vision+X: A Survey on Multimodal Learning in the Light of Data
Ye Zhu
Yuehua Wu
N. Sebe
Yan Yan
33
16
0
05 Oct 2022
Learning Distinct and Representative Styles for Image Captioning
Learning Distinct and Representative Styles for Image Captioning
Qi Chen
Chaorui Deng
Qi Wu
VLM
34
23
0
17 Sep 2022
Fuse It More Deeply! A Variational Transformer with Layer-Wise Latent
  Variable Inference for Text Generation
Fuse It More Deeply! A Variational Transformer with Layer-Wise Latent Variable Inference for Text Generation
Jinyi Hu
Xiaoyuan Yi
Wenhao Li
Maosong Sun
Xing Xie
21
21
0
13 Jul 2022
Discrete Contrastive Diffusion for Cross-Modal Music and Image
  Generation
Discrete Contrastive Diffusion for Cross-Modal Music and Image Generation
Ye Zhu
Yuehua Wu
Kyle Olszewski
Jian Ren
Sergey Tulyakov
Yan Yan
DiffM
25
47
0
15 Jun 2022
Variational Transfer Learning using Cross-Domain Latent Modulation
Variational Transfer Learning using Cross-Domain Latent Modulation
Jinyong Hou
Jeremiah D. Deng
Stephen Cranefield
Xuejie Din
DRL
25
1
0
31 May 2022
Fine-grained Image Captioning with CLIP Reward
Fine-grained Image Captioning with CLIP Reward
Jaemin Cho
Seunghyun Yoon
Ajinkya Kale
Franck Dernoncourt
Trung Bui
Joey Tianyi Zhou
CLIP
131
76
0
26 May 2022
Diverse Image Captioning with Grounded Style
Diverse Image Captioning with Grounded Style
Franz Klein
Shweta Mahajan
S. Roth
22
7
0
03 May 2022
SceneTrilogy: On Human Scene-Sketch and its Complementarity with Photo
  and Text
SceneTrilogy: On Human Scene-Sketch and its Complementarity with Photo and Text
Pinaki Nath Chowdhury
A. Bhunia
Aneeshan Sain
Subhadeep Koley
Tao Xiang
Yi-Zhe Song
38
29
0
25 Apr 2022
FS-COCO: Towards Understanding of Freehand Sketches of Common Objects in
  Context
FS-COCO: Towards Understanding of Freehand Sketches of Common Objects in Context
Pinaki Nath Chowdhury
Aneeshan Sain
A. Bhunia
Tao Xiang
Yulia Gryaditskaya
Yi-Zhe Song
3DV
41
52
0
04 Mar 2022
FlowEval: A Consensus-Based Dialogue Evaluation Framework Using Segment
  Act Flows
FlowEval: A Consensus-Based Dialogue Evaluation Framework Using Segment Act Flows
Jianqiao Zhao
Yanyang Li
Wanyu Du
Yangfeng Ji
Dong Yu
M. Lyu
Liwei Wang
20
4
0
14 Feb 2022
ZeroCap: Zero-Shot Image-to-Text Generation for Visual-Semantic
  Arithmetic
ZeroCap: Zero-Shot Image-to-Text Generation for Visual-Semantic Arithmetic
Yoad Tewel
Yoav Shalev
Idan Schwartz
Lior Wolf
VLM
34
192
0
29 Nov 2021
Group-based Distinctive Image Captioning with Memory Attention
Group-based Distinctive Image Captioning with Memory Attention
Jiuniu Wang
Wenjia Xu
Qingzhong Wang
Antoni B. Chan
16
18
0
20 Aug 2021
From Show to Tell: A Survey on Deep Learning-based Image Captioning
From Show to Tell: A Survey on Deep Learning-based Image Captioning
Matteo Stefanini
Marcella Cornia
Lorenzo Baraldi
S. Cascianelli
G. Fiameni
Rita Cucchiara
3DV
VLM
MLLM
67
254
0
14 Jul 2021
Saying the Unseen: Video Descriptions via Dialog Agents
Saying the Unseen: Video Descriptions via Dialog Agents
Ye Zhu
Yu Wu
Yi Yang
Yan Yan
22
6
0
26 Jun 2021
Human-like Controllable Image Captioning with Verb-specific Semantic
  Roles
Human-like Controllable Image Captioning with Verb-specific Semantic Roles
Long Chen
Zhihong Jiang
Jun Xiao
Wei Liu
30
74
0
22 Mar 2021
Cross-Domain Latent Modulation for Variational Transfer Learning
Cross-Domain Latent Modulation for Variational Transfer Learning
Jinyong Hou
Jeremiah D. Deng
Stephen Cranefield
Xuejie Ding
DRL
24
1
0
21 Dec 2020
Image Captioning with Context-Aware Auxiliary Guidance
Image Captioning with Context-Aware Auxiliary Guidance
Zeliang Song
Xiaofei Zhou
Zhendong Mao
Jianlong Tan
28
31
0
10 Dec 2020
CapWAP: Captioning with a Purpose
CapWAP: Captioning with a Purpose
Adam Fisch
Kenton Lee
Ming-Wei Chang
J. Clark
Regina Barzilay
8
11
0
09 Nov 2020
Diverse Image Captioning with Context-Object Split Latent Spaces
Diverse Image Captioning with Context-Object Split Latent Spaces
Shweta Mahajan
Stefan Roth
11
41
0
02 Nov 2020
Dense Relational Image Captioning via Multi-task Triple-Stream Networks
Dense Relational Image Captioning via Multi-task Triple-Stream Networks
Dong-Jin Kim
Tae-Hyun Oh
Jinsoo Choi
In So Kweon
24
27
0
08 Oct 2020
Target Conditioning for One-to-Many Generation
Target Conditioning for One-to-Many Generation
Marie-Anne Lachaux
Armand Joulin
Guillaume Lample
6
13
0
21 Sep 2020
MPCC: Matching Priors and Conditionals for Clustering
MPCC: Matching Priors and Conditionals for Clustering
N. Astorga
P. Huijse
P. Protopapas
P. Estévez
14
2
0
21 Aug 2020
Describing Unseen Videos via Multi-Modal Cooperative Dialog Agents
Describing Unseen Videos via Multi-Modal Cooperative Dialog Agents
Ye Zhu
Yu Wu
Yi Yang
Yan Yan
11
13
0
18 Aug 2020
Comprehensive Image Captioning via Scene Graph Decomposition
Comprehensive Image Captioning via Scene Graph Decomposition
Yiwu Zhong
Liwei Wang
Jianshu Chen
Dong Yu
Yin Li
87
124
0
23 Jul 2020
Length-Controllable Image Captioning
Length-Controllable Image Captioning
Chaorui Deng
Ning Ding
Mingkui Tan
Qi Wu
VLM
25
56
0
19 Jul 2020
Diverse and Styled Image Captioning Using SVD-Based Mixture of Recurrent
  Experts
Diverse and Styled Image Captioning Using SVD-Based Mixture of Recurrent Experts
Marzi Heidari
M. Ghatee
A. Nickabadi
Arash Pourhasan Nezhad
DiffM
MoE
27
1
0
07 Jul 2020
Geodesics in fibered latent spaces: A geometric approach to learning
  correspondences between conditions
Geodesics in fibered latent spaces: A geometric approach to learning correspondences between conditions
Tariq Daouda
Reda Chhaibi
Prudencio Tossou
A. Villani
6
2
0
16 May 2020
Better Captioning with Sequence-Level Exploration
Better Captioning with Sequence-Level Exploration
Jia Chen
Qin Jin
37
12
0
08 Mar 2020
Analysis of diversity-accuracy tradeoff in image captioning
Analysis of diversity-accuracy tradeoff in image captioning
Ruotian Luo
Gregory Shakhnarovich
14
12
0
27 Feb 2020
Latent Normalizing Flows for Many-to-Many Cross-Domain Mappings
Latent Normalizing Flows for Many-to-Many Cross-Domain Mappings
Shweta Mahajan
Iryna Gurevych
Stefan Roth
DRL
18
36
0
16 Feb 2020
A Discrete CVAE for Response Generation on Short-Text Conversation
A Discrete CVAE for Response Generation on Short-Text Conversation
Jun Gao
Wei Bi
Xiaojiang Liu
Junhui Li
Guodong Zhou
Shuming Shi
DRL
20
49
0
22 Nov 2019
Pre-train and Plug-in: Flexible Conditional Text Generation with
  Variational Auto-Encoders
Pre-train and Plug-in: Flexible Conditional Text Generation with Variational Auto-Encoders
Yu Duan
Canwen Xu
Jiaxin Pei
Jialong Han
Chenliang Li
16
42
0
10 Nov 2019
An Information Theory Approach on Deciding Spectroscopic Follow Ups
An Information Theory Approach on Deciding Spectroscopic Follow Ups
Javiera Astudillo
P. Protopapas
K. Pichara
P. Huijse
6
4
0
06 Nov 2019
Diverse Video Captioning Through Latent Variable Expansion
Diverse Video Captioning Through Latent Variable Expansion
Huanhou Xiao
Jinglun Shi
DiffM
30
15
0
26 Oct 2019
Zero-shot Learning via Simultaneous Generating and Learning
Zero-shot Learning via Simultaneous Generating and Learning
Hyeonwoo Yu
Beomhee Lee
VLM
GAN
17
54
0
21 Oct 2019
Probabilistic framework for solving Visual Dialog
Probabilistic framework for solving Visual Dialog
Badri N. Patro
Anupriy
Vinay P. Namboodiri
BDL
30
13
0
11 Sep 2019
12
Next