ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2403.03206
  4. Cited By
Scaling Rectified Flow Transformers for High-Resolution Image Synthesis

Scaling Rectified Flow Transformers for High-Resolution Image Synthesis

5 March 2024
Patrick Esser
Sumith Kulal
A. Blattmann
Rahim Entezari
Jonas Muller
Harry Saini
Yam Levi
Dominik Lorenz
Axel Sauer
Frederic Boesel
Dustin Podell
Tim Dockhorn
Zion English
Kyle Lacey
Alex Goodwin
Yannik Marek
Robin Rombach
    DiffM
ArXivPDFHTML

Papers citing "Scaling Rectified Flow Transformers for High-Resolution Image Synthesis"

50 / 814 papers shown
Title
Learning Multimodal Behaviors from Scratch with Diffusion Policy
  Gradient
Learning Multimodal Behaviors from Scratch with Diffusion Policy Gradient
Zechu Li
Rickmer Krohn
Tao Chen
Anurag Ajay
Pulkit Agrawal
Georgia Chalvatzaki
DiffM
50
8
0
02 Jun 2024
Improving Text Generation on Images with Synthetic Captions
Improving Text Generation on Images with Synthetic Captions
Jun Young Koh
Sang Hyun Park
Joy Song
DiffM
51
2
0
01 Jun 2024
Frieren: Efficient Video-to-Audio Generation Network with Rectified Flow Matching
Frieren: Efficient Video-to-Audio Generation Network with Rectified Flow Matching
Yongqi Wang
Wenxiang Guo
Rongjie Huang
Jia-Bin Huang
Zehan Wang
Fuming You
Ruiqi Li
Zhou Zhao
VGen
DiffM
31
11
0
01 Jun 2024
Bootstrap3D: Improving 3D Content Creation with Synthetic Data
Bootstrap3D: Improving 3D Content Creation with Synthetic Data
Zeyi Sun
Tong Wu
Pan Zhang
Yuhang Zang
Xiao-wen Dong
Yuanjun Xiong
Dahua Lin
Jiaqi Wang
47
0
0
31 May 2024
Spectrum-Aware Parameter Efficient Fine-Tuning for Diffusion Models
Spectrum-Aware Parameter Efficient Fine-Tuning for Diffusion Models
Xinxi Zhang
Song Wen
Ligong Han
Felix Juefei Xu
Akash Srivastava
Junzhou Huang
Hao Wang
Molei Tao
Dimitris N. Metaxas
DiffM
36
5
0
31 May 2024
Improving the Training of Rectified Flows
Improving the Training of Rectified Flows
Sangyun Lee
Zinan Lin
Giulia Fanti
41
19
0
30 May 2024
SemFlow: Binding Semantic Segmentation and Image Synthesis via Rectified
  Flow
SemFlow: Binding Semantic Segmentation and Image Synthesis via Rectified Flow
Chaoyang Wang
Xiangtai Li
Lu Qi
Henghui Ding
Yunhai Tong
Ming-Hsuan Yang
DiffM
81
6
0
30 May 2024
CV-VAE: A Compatible Video VAE for Latent Generative Video Models
CV-VAE: A Compatible Video VAE for Latent Generative Video Models
Sijie Zhao
Yong Zhang
Xiaodong Cun
Shaoshu Yang
Muyao Niu
Xiaoyu Li
Wenbo Hu
Ying Shan
DiffM
61
23
0
30 May 2024
Flow Priors for Linear Inverse Problems via Iterative Corrupted Trajectory Matching
Flow Priors for Linear Inverse Problems via Iterative Corrupted Trajectory Matching
Yasi Zhang
Peiyu Yu
Yaxuan Zhu
Yingshan Chang
Feng Gao
Yingnian Wu
Oscar Leong
83
7
0
29 May 2024
T2V-Turbo: Breaking the Quality Bottleneck of Video Consistency Model
  with Mixed Reward Feedback
T2V-Turbo: Breaking the Quality Bottleneck of Video Consistency Model with Mixed Reward Feedback
Jiachen Li
Weixi Feng
Tsu-jui Fu
Xinyi Wang
Sugato Basu
Wenhu Chen
William Yang Wang
VGen
34
27
0
29 May 2024
DiG: Scalable and Efficient Diffusion Models with Gated Linear Attention
DiG: Scalable and Efficient Diffusion Models with Gated Linear Attention
Lianghui Zhu
Zilong Huang
Bencheng Liao
Jun Hao Liew
Hanshu Yan
Jiashi Feng
Xinggang Wang
70
13
0
28 May 2024
FAIntbench: A Holistic and Precise Benchmark for Bias Evaluation in
  Text-to-Image Models
FAIntbench: A Holistic and Precise Benchmark for Bias Evaluation in Text-to-Image Models
Hanjun Luo
Ziye Deng
Ruizhe Chen
Zuo-Qiang Liu
EGVM
48
9
0
28 May 2024
FlowSDF: Flow Matching for Medical Image Segmentation Using Distance Transforms
FlowSDF: Flow Matching for Medical Image Segmentation Using Distance Transforms
L. Bogensperger
Dominik Narnhofer
Alexander Falk
Konrad Schindler
Thomas Pock
MedIm
43
3
0
28 May 2024
A Closer Look at Time Steps is Worthy of Triple Speed-Up for Diffusion Model Training
A Closer Look at Time Steps is Worthy of Triple Speed-Up for Diffusion Model Training
Kai Wang
Yukun Zhou
Mingjia Shi
Zhihang Yuan
Yuzhang Shang
Yuzhang Shang
Hanwang Zhang
Hanwang Zhang
Yang You
71
10
0
27 May 2024
Automatic Jailbreaking of the Text-to-Image Generative AI Systems
Automatic Jailbreaking of the Text-to-Image Generative AI Systems
Minseon Kim
Hyomin Lee
Boqing Gong
Huishuai Zhang
Sung Ju Hwang
32
12
0
26 May 2024
Towards Black-Box Membership Inference Attack for Diffusion Models
Towards Black-Box Membership Inference Attack for Diffusion Models
Jingwei Li
Jingyi Dong
Tianxing He
Jingzhao Zhang
35
3
0
25 May 2024
Lateralization MLP: A Simple Brain-inspired Architecture for Diffusion
Lateralization MLP: A Simple Brain-inspired Architecture for Diffusion
Zizhao Hu
Mohammad Rostami
34
0
0
25 May 2024
A Misleading Gallery of Fluid Motion by Generative Artificial
  Intelligence
A Misleading Gallery of Fluid Motion by Generative Artificial Intelligence
Ali Kashefi
VGen
51
5
0
24 May 2024
Bitune: Bidirectional Instruction-Tuning
Bitune: Bidirectional Instruction-Tuning
D. J. Kopiczko
Tijmen Blankevoort
Yuki Markus Asano
27
2
0
23 May 2024
RectifID: Personalizing Rectified Flow with Anchored Classifier Guidance
RectifID: Personalizing Rectified Flow with Anchored Classifier Guidance
Zhicheng Sun
Zhenhao Yang
Yang Jin
Haozhe Chi
Kun Xu
...
Hao Jiang
Di Zhang
Yang Song
Kun Gai
Yadong Mu
37
3
0
23 May 2024
Fisher Flow Matching for Generative Modeling over Discrete Data
Fisher Flow Matching for Generative Modeling over Discrete Data
Oscar Davis
Samuel Kessler
Mircea Petrache
.Ismail .Ilkan Ceylan
Michael M. Bronstein
A. Bose
48
16
0
23 May 2024
LiteVAE: Lightweight and Efficient Variational Autoencoders for Latent Diffusion Models
LiteVAE: Lightweight and Efficient Variational Autoencoders for Latent Diffusion Models
Seyedmorteza Sadat
Jakob Buhmann
Derek Bradley
Otmar Hilliges
Romann M. Weber
49
9
0
23 May 2024
Adversarial Schrödinger Bridge Matching
Adversarial Schrödinger Bridge Matching
Nikita Gushchin
Daniil Selikhanovych
Sergei Kholkin
Evgeny Burnaev
Alexander Korotin
OT
DiffM
31
1
0
23 May 2024
DiM: Diffusion Mamba for Efficient High-Resolution Image Synthesis
DiM: Diffusion Mamba for Efficient High-Resolution Image Synthesis
Yao Teng
Yue Wu
Han Shi
Xuefei Ning
Guohao Dai
Yu-Xiang Wang
Zhenguo Li
Xihui Liu
Mamba
53
34
0
23 May 2024
Survey on Visual Signal Coding and Processing with Generative Models:
  Technologies, Standards and Optimization
Survey on Visual Signal Coding and Processing with Generative Models: Technologies, Standards and Optimization
Zhibo Chen
Heming Sun
Li Zhang
Fan Zhang
40
3
0
23 May 2024
TerDiT: Ternary Diffusion Models with Transformers
TerDiT: Ternary Diffusion Models with Transformers
Xudong Lu
Aojun Zhou
Ziyi Lin
Qi Liu
Yuhui Xu
Renrui Zhang
Yafei Wen
Shuai Ren
Peng Gao
Junchi Yan
MQ
53
2
0
23 May 2024
DisenStudio: Customized Multi-subject Text-to-Video Generation with
  Disentangled Spatial Control
DisenStudio: Customized Multi-subject Text-to-Video Generation with Disentangled Spatial Control
Hong Chen
Xin Wang
Yipeng Zhang
Yuwei Zhou
Zeyang Zhang
Siao Tang
Wenwu Zhu
VGen
DiffM
47
9
0
21 May 2024
On the Trajectory Regularity of ODE-based Diffusion Sampling
On the Trajectory Regularity of ODE-based Diffusion Sampling
Defang Chen
Zhenyu Zhou
Can Wang
Chunhua Shen
Siwei Lyu
37
14
0
18 May 2024
Hunyuan-DiT: A Powerful Multi-Resolution Diffusion Transformer with
  Fine-Grained Chinese Understanding
Hunyuan-DiT: A Powerful Multi-Resolution Diffusion Transformer with Fine-Grained Chinese Understanding
Zhimin Li
Jianwei Zhang
Qin Lin
Jiangfeng Xiong
Yanxin Long
...
Wei Liu
Dingyong Wang
Yong Yang
Jie Jiang
Qinglin Lu
ViT
48
91
0
14 May 2024
Erasing Concepts from Text-to-Image Diffusion Models with Few-shot
  Unlearning
Erasing Concepts from Text-to-Image Diffusion Models with Few-shot Unlearning
Masane Fuchi
Tomohiro Takagi
DiffM
VLM
53
13
0
12 May 2024
Lumina-T2X: Transforming Text into Any Modality, Resolution, and
  Duration via Flow-based Large Diffusion Transformers
Lumina-T2X: Transforming Text into Any Modality, Resolution, and Duration via Flow-based Large Diffusion Transformers
Peng Gao
Le Zhuo
Ziyi Lin
Ruoyi Du
Xu Luo
...
Weicai Ye
He Tong
Jingwen He
Yu Qiao
Hongsheng Li
VGen
37
83
0
09 May 2024
Video Diffusion Models: A Survey
Video Diffusion Models: A Survey
Andrew Melnik
Michal Ljubljanac
Cong Lu
Qi Yan
Weiming Ren
Helge J. Ritter
VGen
71
12
0
06 May 2024
ObjectAdd: Adding Objects into Image via a Training-Free Diffusion Modification Fashion
ObjectAdd: Adding Objects into Image via a Training-Free Diffusion Modification Fashion
Ziyue Zhang
Mingbao Lin
Rongrong Ji
Yuxin Zhang
Rongrong Ji
DiffM
59
3
0
26 Apr 2024
ReflectanceFusion: Diffusion-based text to SVBRDF Generation
ReflectanceFusion: Diffusion-based text to SVBRDF Generation
Bowen Xue
G. C. Guarnera
Shuang Zhao
Zahra Montazeri
25
2
0
25 Apr 2024
Align Your Steps: Optimizing Sampling Schedules in Diffusion Models
Align Your Steps: Optimizing Sampling Schedules in Diffusion Models
Amirmojtaba Sabour
Sanja Fidler
Karsten Kreis
DiffM
40
24
0
22 Apr 2024
TextCenGen: Attention-Guided Text-Centric Background Adaptation for Text-to-Image Generation
TextCenGen: Attention-Guided Text-Centric Background Adaptation for Text-to-Image Generation
Tianyi Liang
Jiangqi Liu
Sicheng Song
Shiqi Jiang
Yifei Huang
Changbo Wang
Chenhui Li
42
0
0
18 Apr 2024
Long-form music generation with latent diffusion
Long-form music generation with latent diffusion
Zach Evans
Julian Parker
CJ Carr
Zack Zukowski
Josiah Taylor
Jordi Pons
MGen
DiffM
44
39
0
16 Apr 2024
Ctrl-Adapter: An Efficient and Versatile Framework for Adapting Diverse
  Controls to Any Diffusion Model
Ctrl-Adapter: An Efficient and Versatile Framework for Adapting Diverse Controls to Any Diffusion Model
Han Lin
Jaemin Cho
Abhaysinh Zala
Mohit Bansal
DiffM
VGen
69
20
0
15 Apr 2024
Magic Clothing: Controllable Garment-Driven Image Synthesis
Magic Clothing: Controllable Garment-Driven Image Synthesis
Weifeng Chen
Tao Gu
Yuhao Xu
Chengcai Chen
48
16
0
15 Apr 2024
An Overview of Diffusion Models: Applications, Guided Generation,
  Statistical Rates and Optimization
An Overview of Diffusion Models: Applications, Guided Generation, Statistical Rates and Optimization
Minshuo Chen
Song Mei
Jianqing Fan
Mengdi Wang
VLM
MedIm
DiffM
37
48
0
11 Apr 2024
No "Zero-Shot" Without Exponential Data: Pretraining Concept Frequency
  Determines Multimodal Model Performance
No "Zero-Shot" Without Exponential Data: Pretraining Concept Frequency Determines Multimodal Model Performance
Vishaal Udandarao
Ameya Prabhu
Adhiraj Ghosh
Yash Sharma
Philip Torr
Adel Bibi
Samuel Albanie
Matthias Bethge
VLM
128
45
0
04 Apr 2024
Visual Autoregressive Modeling: Scalable Image Generation via Next-Scale
  Prediction
Visual Autoregressive Modeling: Scalable Image Generation via Next-Scale Prediction
Keyu Tian
Yi-Xin Jiang
Zehuan Yuan
Bingyue Peng
Liwei Wang
VGen
42
250
0
03 Apr 2024
On the Scalability of Diffusion-based Text-to-Image Generation
On the Scalability of Diffusion-based Text-to-Image Generation
Hao Li
Yang Zou
Ying Wang
Orchid Majumder
Yusheng Xie
R. Manmatha
Ashwin Swaminathan
Zhuowen Tu
Stefano Ermon
Stefano Soatto
64
20
0
03 Apr 2024
Faster Diffusion via Temporal Attention Decomposition
Faster Diffusion via Temporal Attention Decomposition
Haozhe Liu
Wentian Zhang
Jinheng Xie
Francesco Faccio
Mengmeng Xu
Tao Xiang
Mike Zheng Shou
Juan-Manuel Perez-Rua
Jürgen Schmidhuber
DiffM
75
19
0
03 Apr 2024
SLEDGE: Synthesizing Driving Environments with Generative Models and
  Rule-Based Traffic
SLEDGE: Synthesizing Driving Environments with Generative Models and Rule-Based Traffic
Kashyap Chitta
D. Dauner
Andreas Geiger
3DGS
48
5
0
26 Mar 2024
DepthFM: Fast Monocular Depth Estimation with Flow Matching
DepthFM: Fast Monocular Depth Estimation with Flow Matching
Ming Gui
Johannes S. Fischer
Ulrich Prestel
Pingchuan Ma
Dmytro Kotovenko
Olga Grebenkova
S. A. Baumann
Vincent Tao Hu
Bjorn Ommer
MDE
36
52
0
20 Mar 2024
Mora: Enabling Generalist Video Generation via A Multi-Agent Framework
Mora: Enabling Generalist Video Generation via A Multi-Agent Framework
Zhengqing Yuan
Ruoxi Chen
Zhaoxu Li
Haolong Jia
Lifang He
Chi Wang
Lichao Sun
VGen
63
27
0
20 Mar 2024
Diffusion Model for Data-Driven Black-Box Optimization
Diffusion Model for Data-Driven Black-Box Optimization
Zihao Li
Hui Yuan
Kaixuan Huang
Chengzhuo Ni
Yinyu Ye
Minshuo Chen
Mengdi Wang
DiffM
40
9
0
20 Mar 2024
Optimal Flow Matching: Learning Straight Trajectories in Just One Step
Optimal Flow Matching: Learning Straight Trajectories in Just One Step
Nikita Kornilov
Petr Mokrov
Alexander Gasnikov
Alexander Korotin
34
11
0
19 Mar 2024
Fast High-Resolution Image Synthesis with Latent Adversarial Diffusion
  Distillation
Fast High-Resolution Image Synthesis with Latent Adversarial Diffusion Distillation
Axel Sauer
Frederic Boesel
Tim Dockhorn
A. Blattmann
Patrick Esser
Robin Rombach
DiffM
46
107
0
18 Mar 2024
Previous
123...151617
Next