Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2111.13792
Cited By
LAFITE: Towards Language-Free Training for Text-to-Image Generation
27 November 2021
Yufan Zhou
Ruiyi Zhang
Changyou Chen
Chunyuan Li
Chris Tensmeyer
Tong Yu
Jiuxiang Gu
Jinhui Xu
Tong Sun
VLM
Re-assign community
ArXiv
PDF
HTML
Papers citing
"LAFITE: Towards Language-Free Training for Text-to-Image Generation"
50 / 107 papers shown
Title
Evaluating Text-to-Image Synthesis with a Conditional Fréchet Distance
Jaywon Koo
J. Hernandez
Moayed Haji-Ali
Ziyan Yang
Vicente Ordonez
EGVM
72
0
0
27 Mar 2025
Towards Generating Realistic 3D Semantic Training Data for Autonomous Driving
Lucas Nunes
Rodrigo Marcuzzi
Jens Behley
C. Stachniss
3DPC
83
0
0
27 Mar 2025
Towards Improved Text-Aligned Codebook Learning: Multi-Hierarchical Codebook-Text Alignment with Long Text
Guotao Liang
Baoquan Zhang
Zhiyuan Wen
Junteng Zhao
Yunming Ye
Kola Ye
Yao He
52
0
0
03 Mar 2025
GANFusion: Feed-Forward Text-to-3D with Diffusion in GAN Space
Souhaib Attaiki
Paul Guerrero
Duygu Ceylan
Niloy J. Mitra
M. Ovsjanikov
90
0
0
21 Dec 2024
CLIP-SR: Collaborative Linguistic and Image Processing for Super-Resolution
Bingwen Hu
Heng Liu
Zhedong Zheng
Ping Liu
SupR
83
0
0
16 Dec 2024
Reward Incremental Learning in Text-to-Image Generation
Maorong Wang
Jiafeng Mao
Xueting Wang
Toshihiko Yamasaki
EGVM
103
0
0
26 Nov 2024
Quantum-Brain: Quantum-Inspired Neural Network Approach to Vision-Brain Understanding
Hoang-Quan Nguyen
Xuan-Bac Nguyen
Hugh Churchill
Arabinda Kumar Choudhary
Pawan Sinha
S. Khan
Khoa Luu
69
1
0
20 Nov 2024
Phrase Decoupling Cross-Modal Hierarchical Matching and Progressive Position Correction for Visual Grounding
Minghong Xie
Hao Wu
Huafeng Li
Yafei Zhang
Dapeng Tao
Z. Yu
ObjD
40
1
0
31 Oct 2024
Diff-Instruct*: Towards Human-Preferred One-step Text-to-image Generative Models
Weijian Luo
C. Zhang
Debing Zhang
Zhengyang Geng
28
3
0
28 Oct 2024
Diff-Instruct++: Training One-step Text-to-image Generator Model to Align with Human Preferences
Weijian Luo
EGVM
36
6
0
24 Oct 2024
Meissonic: Revitalizing Masked Generative Transformers for Efficient High-Resolution Text-to-Image Synthesis
Jinbin Bai
Tian-Chun Ye
Wei Chow
Enxin Song
Qing-Guo Chen
Hefei Ling
Zhen Dong
Lei Zhu
63
13
0
10 Oct 2024
Rectified Diffusion: Straightness Is Not Your Need in Rectified Flow
Fu-Yun Wang
Ling Yang
Zhaoyang Huang
Mengdi Wang
Hongsheng Li
34
13
0
09 Oct 2024
Robo-MUTUAL: Robotic Multimodal Task Specification via Unimodal Learning
Jianxiong Li
Zhihao Wang
Jinliang Zheng
Xiaoai Zhou
Guanming Wang
...
Yu Liu
Jingjing Liu
Ya-Qin Zhang
Junzhi Yu
Xianyuan Zhan
38
2
0
02 Oct 2024
MM2Latent: Text-to-facial image generation and editing in GANs with multimodal assistance
Debin Meng
Christos Tzelepis
Ioannis Patras
Georgios Tzimiropoulos
DiffM
31
0
0
17 Sep 2024
Language-Queried Target Sound Extraction Without Parallel Training Data
Hao Ma
Zhiyuan Peng
Xu Li
Yukai Li
Mingjie Shao
Qiuqiang Kong
Ju Liu
VLM
71
1
0
14 Sep 2024
Exploring Foundation Models for Synthetic Medical Imaging: A Study on Chest X-Rays and Fine-Tuning Techniques
Davide Clode da Silva
Marina Musse Bernardes
Nathalia Giacomini Ceretta
Gabriel Vaz de Souza
Gabriel Fonseca Silva
Rafael Heitor Bordini
S. Musse
MedIm
LM&MA
31
0
0
06 Sep 2024
A New Chinese Landscape Paintings Generation Model based on Stable Diffusion using DreamBooth
Yujia Gu
Xinyu Fang
Xueyuan Deng
Zihan Peng
Yinan Peng
DiffM
18
0
0
16 Aug 2024
Text2LiDAR: Text-guided LiDAR Point Cloud Generation via Equirectangular Transformer
Yang Wu
Kaihua Zhang
Jianjun Qian
Jin Xie
Jian Yang
DiffM
47
4
0
29 Jul 2024
GVDIFF: Grounded Text-to-Video Generation with Diffusion Models
Huanzhang Dou
Ruixiang Li
Wei Su
Xi Li
DiffM
39
1
0
02 Jul 2024
FastCLIP: A Suite of Optimization Techniques to Accelerate CLIP Training with Limited Resources
Xiyuan Wei
Fanjiang Ye
Ori Yonay
Xingyu Chen
Baixi Sun
Dingwen Tao
Tianbao Yang
VLM
CLIP
56
2
0
01 Jul 2024
Analyzing Quality, Bias, and Performance in Text-to-Image Generative Models
Nila Masrourisaadat
Nazanin Sedaghatkish
Fatemeh Sarshartehrani
Edward A. Fox
37
6
0
28 Jun 2024
Vision-Language Consistency Guided Multi-modal Prompt Learning for Blind AI Generated Image Quality Assessment
Jun Fu
Wei Zhou
Qiuping Jiang
Hantao Liu
Guangtao Zhai
VLM
CLIP
40
8
0
24 Jun 2024
Open-World Human-Object Interaction Detection via Multi-modal Prompts
Jie-jin Yang
Bingliang Li
Ailing Zeng
L. Zhang
Ruimao Zhang
VLM
32
8
0
11 Jun 2024
Improved Distribution Matching Distillation for Fast Image Synthesis
Tianwei Yin
Michael Gharbi
Taesung Park
Richard Zhang
Eli Shechtman
Frédo Durand
William T. Freeman
DiffM
44
95
0
23 May 2024
RATLIP: Generative Adversarial CLIP Text-to-Image Synthesis Based on Recurrent Affine Transformations
Chengde Lin
Xijun Lu
Guangxi Chen
33
0
0
13 May 2024
PKU-AIGIQA-4K: A Perceptual Quality Assessment Database for Both Text-to-Image and Image-to-Image AI-Generated Images
Jiquan Yuan
Fanyi Yang
Jihe Li
Xinyan Cao
Jinming Che
Jinlong Lin
Xixin Cao
EGVM
31
2
0
29 Apr 2024
Adaptive Mixed-Scale Feature Fusion Network for Blind AI-Generated Image Quality Assessment
Tianwei Zhou
Songbai Tan
Wei Zhou
Yu Luo
Yuan-Gen Wang
Guanghui Yue
EGVM
36
10
0
23 Apr 2024
Accelerating Image Generation with Sub-path Linear Approximation Model
Chen Xu
Tian-Shu Song
Weixin Feng
Xubin Li
Tiezheng Ge
Bo Zheng
Limin Wang
39
11
0
22 Apr 2024
ANCHOR: LLM-driven News Subject Conditioning for Text-to-Image Synthesis
Aashish Anantha Ramakrishnan
Sharon X. Huang
Dongwon Lee
32
0
0
15 Apr 2024
Would Deep Generative Models Amplify Bias in Future Models?
Tianwei Chen
Yusuke Hirota
Mayu Otani
Noa Garcia
Yuta Nakashima
45
12
0
04 Apr 2024
Can Language Beat Numerical Regression? Language-Based Multimodal Trajectory Prediction
Inhwan Bae
Junoh Lee
Hae-Gon Jeon
33
15
0
27 Mar 2024
CLIP-VQDiffusion : Langauge Free Training of Text To Image generation using CLIP and vector quantized diffusion model
S. Han
Joohee Kim
DiffM
CLIP
34
1
0
22 Mar 2024
Scaling Diffusion Models to Real-World 3D LiDAR Scene Completion
Lucas Nunes
Rodrigo Marcuzzi
Benedikt Mersch
Jens Behley
C. Stachniss
DiffM
31
15
0
20 Mar 2024
Can AI Outperform Human Experts in Creating Social Media Creatives?
Eunkyung Park
Raymond K. Wong
Junbum Kwon
38
0
0
19 Mar 2024
Rethinking cluster-conditioned diffusion models
Nikolas Adaloglou
Tim Kaiser
Félix D. P. Michels
M. Kollmann
VLM
37
3
0
01 Mar 2024
Contextualized Diffusion Models for Text-Guided Image and Video Generation
Ling Yang
Zhilong Zhang
Zhaochen Yu
Jingwei Liu
Minkai Xu
Stefano Ermon
Bin Cui
41
4
0
26 Feb 2024
Separable Multi-Concept Erasure from Diffusion Models
Mengnan Zhao
Lihe Zhang
Tianhang Zheng
Yuqiu Kong
Baocai Yin
50
9
0
03 Feb 2024
TIER: Text-Image Encoder-based Regression for AIGC Image Quality Assessment
Jiquan Yuan
Xinyan Cao
Jinming Che
Qinyuan Wang
Sen Liang
Wei Ren
Jinlong Lin
Xixin Cao
EGVM
16
1
0
08 Jan 2024
Latte: Latent Diffusion Transformer for Video Generation
Xin Ma
Yaohui Wang
Gengyun Jia
Xinyuan Chen
Ziqiang Liu
Yuan-Fang Li
Cunjian Chen
Yu Qiao
DiffM
VGen
125
233
0
05 Jan 2024
Improving Diffusion-Based Image Synthesis with Context Prediction
Ling Yang
Jingwei Liu
Shenda Hong
Zhilong Zhang
Zhilin Huang
Zheming Cai
Wentao Zhang
Bin Cui
DiffM
43
33
0
04 Jan 2024
BrainVis: Exploring the Bridge between Brain and Visual Signals via Image Reconstruction
Honghao Fu
Zhiqi Shen
Jing Jih Chin
Hao Wang
DiffM
24
5
0
22 Dec 2023
Textual Prompt Guided Image Restoration
Qiuhai Yan
Aiwen Jiang
Kang Chen
Long Peng
Qiaosi Yi
Chunjie Zhang
DiffM
VLM
23
10
0
11 Dec 2023
PSCR: Patches Sampling-based Contrastive Regression for AIGC Image Quality Assessment
Jiquan Yuan
Xinyan Cao
Linjing Cao
Jinlong Lin
Xixin Cao
EGVM
33
11
0
10 Dec 2023
ECLIPSE: A Resource-Efficient Text-to-Image Prior for Image Generations
Maitreya Patel
Changhoon Kim
Sheng Cheng
Chitta Baral
Yezhou Yang
VLM
27
18
0
07 Dec 2023
One-step Diffusion with Distribution Matching Distillation
Tianwei Yin
Michael Gharbi
Richard Zhang
Eli Shechtman
Frédo Durand
William T. Freeman
Taesung Park
DiffM
124
219
0
30 Nov 2023
MobileDiffusion: Instant Text-to-Image Generation on Mobile Devices
Yang Zhao
Yanwu Xu
Zhisheng Xiao
Haolin Jia
Tingbo Hou
VLM
39
11
0
28 Nov 2023
Efficient Multimodal Diffusion Models Using Joint Data Infilling with Partially Shared U-Net
Zizhao Hu
Shaochong Jia
Mohammad Rostami
DiffM
MedIm
19
0
0
28 Nov 2023
PKU-I2IQA: An Image-to-Image Quality Assessment Database for AI Generated Images
Jiquan Yuan
Xinyan Cao
Changjin Li
Fanyi Yang
Jinlong Lin
Xixin Cao
EGVM
38
18
0
27 Nov 2023
UFOGen: You Forward Once Large Scale Text-to-Image Generation via Diffusion GANs
Yanwu Xu
Yang Zhao
Zhisheng Xiao
Tingbo Hou
131
106
0
14 Nov 2023
DeltaSpace: A Semantic-aligned Feature Space for Flexible Text-guided Image Editing
Yueming Lyu
Kang Zhao
Bo Peng
Yue Jiang
Yingya Zhang
Jing Dong
31
2
0
12 Oct 2023
1
2
3
Next