ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2111.13792
  4. Cited By
LAFITE: Towards Language-Free Training for Text-to-Image Generation

LAFITE: Towards Language-Free Training for Text-to-Image Generation

27 November 2021
Yufan Zhou
Ruiyi Zhang
Changyou Chen
Chunyuan Li
Chris Tensmeyer
Tong Yu
Jiuxiang Gu
Jinhui Xu
Tong Sun
    VLM
ArXivPDFHTML

Papers citing "LAFITE: Towards Language-Free Training for Text-to-Image Generation"

50 / 107 papers shown
Title
Evaluating Text-to-Image Synthesis with a Conditional Fréchet Distance
Evaluating Text-to-Image Synthesis with a Conditional Fréchet Distance
Jaywon Koo
J. Hernandez
Moayed Haji-Ali
Ziyan Yang
Vicente Ordonez
EGVM
72
0
0
27 Mar 2025
Towards Generating Realistic 3D Semantic Training Data for Autonomous Driving
Towards Generating Realistic 3D Semantic Training Data for Autonomous Driving
Lucas Nunes
Rodrigo Marcuzzi
Jens Behley
C. Stachniss
3DPC
83
0
0
27 Mar 2025
Towards Improved Text-Aligned Codebook Learning: Multi-Hierarchical Codebook-Text Alignment with Long Text
Guotao Liang
Baoquan Zhang
Zhiyuan Wen
Junteng Zhao
Yunming Ye
Kola Ye
Yao He
54
0
0
03 Mar 2025
GANFusion: Feed-Forward Text-to-3D with Diffusion in GAN Space
GANFusion: Feed-Forward Text-to-3D with Diffusion in GAN Space
Souhaib Attaiki
Paul Guerrero
Duygu Ceylan
Niloy J. Mitra
M. Ovsjanikov
90
0
0
21 Dec 2024
CLIP-SR: Collaborative Linguistic and Image Processing for Super-Resolution
CLIP-SR: Collaborative Linguistic and Image Processing for Super-Resolution
Bingwen Hu
Heng Liu
Zhedong Zheng
Ping Liu
SupR
86
0
0
16 Dec 2024
Reward Incremental Learning in Text-to-Image Generation
Reward Incremental Learning in Text-to-Image Generation
Maorong Wang
Jiafeng Mao
Xueting Wang
Toshihiko Yamasaki
EGVM
103
0
0
26 Nov 2024
Quantum-Brain: Quantum-Inspired Neural Network Approach to Vision-Brain
  Understanding
Quantum-Brain: Quantum-Inspired Neural Network Approach to Vision-Brain Understanding
Hoang-Quan Nguyen
Xuan-Bac Nguyen
Hugh Churchill
Arabinda Kumar Choudhary
Pawan Sinha
S. Khan
Khoa Luu
69
1
0
20 Nov 2024
Phrase Decoupling Cross-Modal Hierarchical Matching and Progressive
  Position Correction for Visual Grounding
Phrase Decoupling Cross-Modal Hierarchical Matching and Progressive Position Correction for Visual Grounding
Minghong Xie
Hao Wu
Huafeng Li
Yafei Zhang
Dapeng Tao
Z. Yu
ObjD
40
1
0
31 Oct 2024
Diff-Instruct*: Towards Human-Preferred One-step Text-to-image
  Generative Models
Diff-Instruct*: Towards Human-Preferred One-step Text-to-image Generative Models
Weijian Luo
C. Zhang
Debing Zhang
Zhengyang Geng
28
3
0
28 Oct 2024
Diff-Instruct++: Training One-step Text-to-image Generator Model to
  Align with Human Preferences
Diff-Instruct++: Training One-step Text-to-image Generator Model to Align with Human Preferences
Weijian Luo
EGVM
36
6
0
24 Oct 2024
Meissonic: Revitalizing Masked Generative Transformers for Efficient High-Resolution Text-to-Image Synthesis
Meissonic: Revitalizing Masked Generative Transformers for Efficient High-Resolution Text-to-Image Synthesis
Jinbin Bai
Tian-Chun Ye
Wei Chow
Enxin Song
Qing-Guo Chen
Hefei Ling
Zhen Dong
Lei Zhu
66
13
0
10 Oct 2024
Rectified Diffusion: Straightness Is Not Your Need in Rectified Flow
Rectified Diffusion: Straightness Is Not Your Need in Rectified Flow
Fu-Yun Wang
Ling Yang
Zhaoyang Huang
Mengdi Wang
Hongsheng Li
34
14
0
09 Oct 2024
Robo-MUTUAL: Robotic Multimodal Task Specification via Unimodal Learning
Robo-MUTUAL: Robotic Multimodal Task Specification via Unimodal Learning
Jianxiong Li
Zhihao Wang
Jinliang Zheng
Xiaoai Zhou
Guanming Wang
...
Yu Liu
Jingjing Liu
Ya-Qin Zhang
Junzhi Yu
Xianyuan Zhan
38
2
0
02 Oct 2024
MM2Latent: Text-to-facial image generation and editing in GANs with
  multimodal assistance
MM2Latent: Text-to-facial image generation and editing in GANs with multimodal assistance
Debin Meng
Christos Tzelepis
Ioannis Patras
Georgios Tzimiropoulos
DiffM
31
0
0
17 Sep 2024
Language-Queried Target Sound Extraction Without Parallel Training Data
Language-Queried Target Sound Extraction Without Parallel Training Data
Hao Ma
Zhiyuan Peng
Xu Li
Yukai Li
Mingjie Shao
Qiuqiang Kong
Ju Liu
VLM
74
1
0
14 Sep 2024
Exploring Foundation Models for Synthetic Medical Imaging: A Study on
  Chest X-Rays and Fine-Tuning Techniques
Exploring Foundation Models for Synthetic Medical Imaging: A Study on Chest X-Rays and Fine-Tuning Techniques
Davide Clode da Silva
Marina Musse Bernardes
Nathalia Giacomini Ceretta
Gabriel Vaz de Souza
Gabriel Fonseca Silva
Rafael Heitor Bordini
S. Musse
MedIm
LM&MA
31
0
0
06 Sep 2024
A New Chinese Landscape Paintings Generation Model based on Stable
  Diffusion using DreamBooth
A New Chinese Landscape Paintings Generation Model based on Stable Diffusion using DreamBooth
Yujia Gu
Xinyu Fang
Xueyuan Deng
Zihan Peng
Yinan Peng
DiffM
18
0
0
16 Aug 2024
Text2LiDAR: Text-guided LiDAR Point Cloud Generation via Equirectangular
  Transformer
Text2LiDAR: Text-guided LiDAR Point Cloud Generation via Equirectangular Transformer
Yang Wu
Kaihua Zhang
Jianjun Qian
Jin Xie
Jian Yang
DiffM
47
4
0
29 Jul 2024
GVDIFF: Grounded Text-to-Video Generation with Diffusion Models
GVDIFF: Grounded Text-to-Video Generation with Diffusion Models
Huanzhang Dou
Ruixiang Li
Wei Su
Xi Li
DiffM
42
1
0
02 Jul 2024
FastCLIP: A Suite of Optimization Techniques to Accelerate CLIP Training
  with Limited Resources
FastCLIP: A Suite of Optimization Techniques to Accelerate CLIP Training with Limited Resources
Xiyuan Wei
Fanjiang Ye
Ori Yonay
Xingyu Chen
Baixi Sun
Dingwen Tao
Tianbao Yang
VLM
CLIP
56
2
0
01 Jul 2024
Analyzing Quality, Bias, and Performance in Text-to-Image Generative
  Models
Analyzing Quality, Bias, and Performance in Text-to-Image Generative Models
Nila Masrourisaadat
Nazanin Sedaghatkish
Fatemeh Sarshartehrani
Edward A. Fox
37
6
0
28 Jun 2024
Vision-Language Consistency Guided Multi-modal Prompt Learning for Blind
  AI Generated Image Quality Assessment
Vision-Language Consistency Guided Multi-modal Prompt Learning for Blind AI Generated Image Quality Assessment
Jun Fu
Wei Zhou
Qiuping Jiang
Hantao Liu
Guangtao Zhai
VLM
CLIP
42
8
0
24 Jun 2024
Open-World Human-Object Interaction Detection via Multi-modal Prompts
Open-World Human-Object Interaction Detection via Multi-modal Prompts
Jie-jin Yang
Bingliang Li
Ailing Zeng
L. Zhang
Ruimao Zhang
VLM
32
8
0
11 Jun 2024
Improved Distribution Matching Distillation for Fast Image Synthesis
Improved Distribution Matching Distillation for Fast Image Synthesis
Tianwei Yin
Michael Gharbi
Taesung Park
Richard Zhang
Eli Shechtman
Frédo Durand
William T. Freeman
DiffM
44
95
0
23 May 2024
RATLIP: Generative Adversarial CLIP Text-to-Image Synthesis Based on
  Recurrent Affine Transformations
RATLIP: Generative Adversarial CLIP Text-to-Image Synthesis Based on Recurrent Affine Transformations
Chengde Lin
Xijun Lu
Guangxi Chen
35
0
0
13 May 2024
PKU-AIGIQA-4K: A Perceptual Quality Assessment Database for Both
  Text-to-Image and Image-to-Image AI-Generated Images
PKU-AIGIQA-4K: A Perceptual Quality Assessment Database for Both Text-to-Image and Image-to-Image AI-Generated Images
Jiquan Yuan
Fanyi Yang
Jihe Li
Xinyan Cao
Jinming Che
Jinlong Lin
Xixin Cao
EGVM
33
2
0
29 Apr 2024
Adaptive Mixed-Scale Feature Fusion Network for Blind AI-Generated Image
  Quality Assessment
Adaptive Mixed-Scale Feature Fusion Network for Blind AI-Generated Image Quality Assessment
Tianwei Zhou
Songbai Tan
Wei Zhou
Yu Luo
Yuan-Gen Wang
Guanghui Yue
EGVM
38
10
0
23 Apr 2024
Accelerating Image Generation with Sub-path Linear Approximation Model
Accelerating Image Generation with Sub-path Linear Approximation Model
Chen Xu
Tian-Shu Song
Weixin Feng
Xubin Li
Tiezheng Ge
Bo Zheng
Limin Wang
42
11
0
22 Apr 2024
ANCHOR: LLM-driven News Subject Conditioning for Text-to-Image Synthesis
ANCHOR: LLM-driven News Subject Conditioning for Text-to-Image Synthesis
Aashish Anantha Ramakrishnan
Sharon X. Huang
Dongwon Lee
34
0
0
15 Apr 2024
Would Deep Generative Models Amplify Bias in Future Models?
Would Deep Generative Models Amplify Bias in Future Models?
Tianwei Chen
Yusuke Hirota
Mayu Otani
Noa Garcia
Yuta Nakashima
45
12
0
04 Apr 2024
Can Language Beat Numerical Regression? Language-Based Multimodal
  Trajectory Prediction
Can Language Beat Numerical Regression? Language-Based Multimodal Trajectory Prediction
Inhwan Bae
Junoh Lee
Hae-Gon Jeon
33
15
0
27 Mar 2024
CLIP-VQDiffusion : Langauge Free Training of Text To Image generation
  using CLIP and vector quantized diffusion model
CLIP-VQDiffusion : Langauge Free Training of Text To Image generation using CLIP and vector quantized diffusion model
S. Han
Joohee Kim
DiffM
CLIP
34
1
0
22 Mar 2024
Scaling Diffusion Models to Real-World 3D LiDAR Scene Completion
Scaling Diffusion Models to Real-World 3D LiDAR Scene Completion
Lucas Nunes
Rodrigo Marcuzzi
Benedikt Mersch
Jens Behley
C. Stachniss
DiffM
33
15
0
20 Mar 2024
Can AI Outperform Human Experts in Creating Social Media Creatives?
Can AI Outperform Human Experts in Creating Social Media Creatives?
Eunkyung Park
Raymond K. Wong
Junbum Kwon
41
0
0
19 Mar 2024
Rethinking cluster-conditioned diffusion models
Rethinking cluster-conditioned diffusion models
Nikolas Adaloglou
Tim Kaiser
Félix D. P. Michels
M. Kollmann
VLM
37
3
0
01 Mar 2024
Contextualized Diffusion Models for Text-Guided Image and Video
  Generation
Contextualized Diffusion Models for Text-Guided Image and Video Generation
Ling Yang
Zhilong Zhang
Zhaochen Yu
Jingwei Liu
Minkai Xu
Stefano Ermon
Bin Cui
44
4
0
26 Feb 2024
Separable Multi-Concept Erasure from Diffusion Models
Separable Multi-Concept Erasure from Diffusion Models
Mengnan Zhao
Lihe Zhang
Tianhang Zheng
Yuqiu Kong
Baocai Yin
50
9
0
03 Feb 2024
TIER: Text-Image Encoder-based Regression for AIGC Image Quality
  Assessment
TIER: Text-Image Encoder-based Regression for AIGC Image Quality Assessment
Jiquan Yuan
Xinyan Cao
Jinming Che
Qinyuan Wang
Sen Liang
Wei Ren
Jinlong Lin
Xixin Cao
EGVM
18
1
0
08 Jan 2024
Latte: Latent Diffusion Transformer for Video Generation
Latte: Latent Diffusion Transformer for Video Generation
Xin Ma
Yaohui Wang
Gengyun Jia
Xinyuan Chen
Ziqiang Liu
Yuan-Fang Li
Cunjian Chen
Yu Qiao
DiffM
VGen
125
233
0
05 Jan 2024
Improving Diffusion-Based Image Synthesis with Context Prediction
Improving Diffusion-Based Image Synthesis with Context Prediction
Ling Yang
Jingwei Liu
Shenda Hong
Zhilong Zhang
Zhilin Huang
Zheming Cai
Wentao Zhang
Bin Cui
DiffM
46
33
0
04 Jan 2024
BrainVis: Exploring the Bridge between Brain and Visual Signals via
  Image Reconstruction
BrainVis: Exploring the Bridge between Brain and Visual Signals via Image Reconstruction
Honghao Fu
Zhiqi Shen
Jing Jih Chin
Hao Wang
DiffM
26
5
0
22 Dec 2023
Textual Prompt Guided Image Restoration
Textual Prompt Guided Image Restoration
Qiuhai Yan
Aiwen Jiang
Kang Chen
Long Peng
Qiaosi Yi
Chunjie Zhang
DiffM
VLM
25
10
0
11 Dec 2023
PSCR: Patches Sampling-based Contrastive Regression for AIGC Image
  Quality Assessment
PSCR: Patches Sampling-based Contrastive Regression for AIGC Image Quality Assessment
Jiquan Yuan
Xinyan Cao
Linjing Cao
Jinlong Lin
Xixin Cao
EGVM
33
11
0
10 Dec 2023
ECLIPSE: A Resource-Efficient Text-to-Image Prior for Image Generations
ECLIPSE: A Resource-Efficient Text-to-Image Prior for Image Generations
Maitreya Patel
Changhoon Kim
Sheng Cheng
Chitta Baral
Yezhou Yang
VLM
27
18
0
07 Dec 2023
One-step Diffusion with Distribution Matching Distillation
One-step Diffusion with Distribution Matching Distillation
Tianwei Yin
Michael Gharbi
Richard Zhang
Eli Shechtman
Frédo Durand
William T. Freeman
Taesung Park
DiffM
124
219
0
30 Nov 2023
MobileDiffusion: Instant Text-to-Image Generation on Mobile Devices
MobileDiffusion: Instant Text-to-Image Generation on Mobile Devices
Yang Zhao
Yanwu Xu
Zhisheng Xiao
Haolin Jia
Tingbo Hou
VLM
41
11
0
28 Nov 2023
Efficient Multimodal Diffusion Models Using Joint Data Infilling with
  Partially Shared U-Net
Efficient Multimodal Diffusion Models Using Joint Data Infilling with Partially Shared U-Net
Zizhao Hu
Shaochong Jia
Mohammad Rostami
DiffM
MedIm
21
0
0
28 Nov 2023
PKU-I2IQA: An Image-to-Image Quality Assessment Database for AI
  Generated Images
PKU-I2IQA: An Image-to-Image Quality Assessment Database for AI Generated Images
Jiquan Yuan
Xinyan Cao
Changjin Li
Fanyi Yang
Jinlong Lin
Xixin Cao
EGVM
38
18
0
27 Nov 2023
UFOGen: You Forward Once Large Scale Text-to-Image Generation via
  Diffusion GANs
UFOGen: You Forward Once Large Scale Text-to-Image Generation via Diffusion GANs
Yanwu Xu
Yang Zhao
Zhisheng Xiao
Tingbo Hou
134
107
0
14 Nov 2023
DeltaSpace: A Semantic-aligned Feature Space for Flexible Text-guided
  Image Editing
DeltaSpace: A Semantic-aligned Feature Space for Flexible Text-guided Image Editing
Yueming Lyu
Kang Zhao
Bo Peng
Yue Jiang
Yingya Zhang
Jing Dong
31
2
0
12 Oct 2023
123
Next