ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2403.03206
  4. Cited By
Scaling Rectified Flow Transformers for High-Resolution Image Synthesis

Scaling Rectified Flow Transformers for High-Resolution Image Synthesis

5 March 2024
Patrick Esser
Sumith Kulal
A. Blattmann
Rahim Entezari
Jonas Muller
Harry Saini
Yam Levi
Dominik Lorenz
Axel Sauer
Frederic Boesel
Dustin Podell
Tim Dockhorn
Zion English
Kyle Lacey
Alex Goodwin
Yannik Marek
Robin Rombach
    DiffM
ArXivPDFHTML

Papers citing "Scaling Rectified Flow Transformers for High-Resolution Image Synthesis"

50 / 829 papers shown
Title
AuthFace: Towards Authentic Blind Face Restoration with Face-oriented
  Generative Diffusion Prior
AuthFace: Towards Authentic Blind Face Restoration with Face-oriented Generative Diffusion Prior
Guoqiang Liang
Qingnan Fan
Bingtao Fu
Jinwei Chen
Hong Gu
Lin Wang
DiffM
34
0
0
13 Oct 2024
CtrLoRA: An Extensible and Efficient Framework for Controllable Image Generation
CtrLoRA: An Extensible and Efficient Framework for Controllable Image Generation
Yifeng Xu
Zhenliang He
Shiguang Shan
Xilin Chen
DiffM
32
3
0
12 Oct 2024
Losing dimensions: Geometric memorization in generative diffusion
Losing dimensions: Geometric memorization in generative diffusion
Beatrice Achilli
Enrico Ventura
Gianluigi Silvestri
Bao Pham
G. Raya
Dmitry Krotov
Carlo Lucibello
L. Ambrogioni
50
4
0
11 Oct 2024
CryoFM: A Flow-based Foundation Model for Cryo-EM Densities
CryoFM: A Flow-based Foundation Model for Cryo-EM Densities
Yi Zhou
Yilai Li
Jing Yuan
Quanquan Gu
35
1
0
11 Oct 2024
Diffusion Models Need Visual Priors for Image Generation
Diffusion Models Need Visual Priors for Image Generation
Xiaoyu Yue
Zidong Wang
Zeyu Lu
S. Sun
Meng Wei
Wanli Ouyang
Junlin Wu
Luping Zhou
VLM
53
1
0
11 Oct 2024
Scaling Laws For Diffusion Transformers
Scaling Laws For Diffusion Transformers
Zhengyang Liang
Hao He
Ceyuan Yang
Bo Dai
35
9
0
10 Oct 2024
DART: Denoising Autoregressive Transformer for Scalable Text-to-Image Generation
DART: Denoising Autoregressive Transformer for Scalable Text-to-Image Generation
Jiatao Gu
Yuyang Wang
Yizhe Zhang
Qihang Zhang
Dinghuai Zhang
Navdeep Jaitly
Josh Susskind
Shuangfei Zhai
DiffM
39
13
0
10 Oct 2024
$\textit{Jump Your Steps}$: Optimizing Sampling Schedule of Discrete
  Diffusion Models
Jump Your Steps\textit{Jump Your Steps}Jump Your Steps: Optimizing Sampling Schedule of Discrete Diffusion Models
Yong-Hyun Park
Chieh-Hsin Lai
Satoshi Hayakawa
Yuhta Takida
Yuki Mitsufuji
54
4
0
10 Oct 2024
I-Max: Maximize the Resolution Potential of Pre-trained Rectified Flow
  Transformers with Projected Flow
I-Max: Maximize the Resolution Potential of Pre-trained Rectified Flow Transformers with Projected Flow
Ruoyi Du
Dongyang Liu
Le Zhuo
Qin Qi
Hongsheng Li
Zhanyu Ma
Peng Gao
34
1
0
10 Oct 2024
Meissonic: Revitalizing Masked Generative Transformers for Efficient High-Resolution Text-to-Image Synthesis
Meissonic: Revitalizing Masked Generative Transformers for Efficient High-Resolution Text-to-Image Synthesis
Jinbin Bai
Tian-Chun Ye
Wei Chow
Enxin Song
Qing-Guo Chen
Hefei Ling
Zhen Dong
Lei Zhu
71
14
0
10 Oct 2024
MotionAura: Generating High-Quality and Motion Consistent Videos using Discrete Diffusion
MotionAura: Generating High-Quality and Motion Consistent Videos using Discrete Diffusion
Onkar Susladkar
Jishu Sen Gupta
Chirag Sehgal
Sparsh Mittal
Rekha Singhal
DiffM
VGen
44
0
0
10 Oct 2024
EvolveDirector: Approaching Advanced Text-to-Image Generation with Large
  Vision-Language Models
EvolveDirector: Approaching Advanced Text-to-Image Generation with Large Vision-Language Models
Rui Zhao
Hangjie Yuan
Yujie Wei
Shiwei Zhang
Yuchao Gu
...
Xiang Wang
Zhangjie Wu
Junhao Zhang
Yingya Zhang
Mike Zheng Shou
DiffM
VLM
57
4
0
09 Oct 2024
Rectified Diffusion: Straightness Is Not Your Need in Rectified Flow
Rectified Diffusion: Straightness Is Not Your Need in Rectified Flow
Fu-Yun Wang
Ling Yang
Zhaoyang Huang
Mengdi Wang
Hongsheng Li
44
15
0
09 Oct 2024
Jointly Generating Multi-view Consistent PBR Textures using
  Collaborative Control
Jointly Generating Multi-view Consistent PBR Textures using Collaborative Control
Shimon Vainer
Konstantin Kutsy
Dante De Nigris
Ciara Rowles
Slava Elizarov
Simon Donné
DiffM
68
1
0
09 Oct 2024
G2D2: Gradient-guided Discrete Diffusion for image inverse problem
  solving
G2D2: Gradient-guided Discrete Diffusion for image inverse problem solving
Naoki Murata
Chieh-Hsin Lai
Yuhta Takida
Toshimitsu Uesaka
Bac Nguyen
Stefano Ermon
Yuki Mitsufuji
DiffM
65
1
0
09 Oct 2024
Representation Alignment for Generation: Training Diffusion Transformers Is Easier Than You Think
Representation Alignment for Generation: Training Diffusion Transformers Is Easier Than You Think
Sihyun Yu
Sangkyung Kwak
Huiwon Jang
Jongheon Jeong
Jonathan Huang
Jinwoo Shin
Saining Xie
OCL
78
69
0
09 Oct 2024
F5-TTS: A Fairytaler that Fakes Fluent and Faithful Speech with Flow Matching
F5-TTS: A Fairytaler that Fakes Fluent and Faithful Speech with Flow Matching
Yushen Chen
Zhikang Niu
Ziyang Ma
Keqi Deng
Chunhui Wang
Jian Zhao
Kai Yu
Xie Chen
35
55
0
09 Oct 2024
IterComp: Iterative Composition-Aware Feedback Learning from Model Gallery for Text-to-Image Generation
IterComp: Iterative Composition-Aware Feedback Learning from Model Gallery for Text-to-Image Generation
Xinchen Zhang
Ling Yang
Ge Li
Yaqi Cai
Jiake Xie
Yong Tang
Yujiu Yang
Mengdi Wang
Bin Cui
EGVM
CoGe
46
7
0
09 Oct 2024
AP-LDM: Attentive and Progressive Latent Diffusion Model for
  Training-Free High-Resolution Image Generation
AP-LDM: Attentive and Progressive Latent Diffusion Model for Training-Free High-Resolution Image Generation
Boyuan Cao
Jiaxin Ye
Yujie Wei
Hongming Shan
45
3
0
08 Oct 2024
Sparse Repellency for Shielded Generation in Text-to-image Diffusion
  Models
Sparse Repellency for Shielded Generation in Text-to-image Diffusion Models
Michael Kirchhof
James Thornton
Pierre Ablin
Louis Béthune
Eugène Ndiaye
Marco Cuturi
54
2
0
08 Oct 2024
Pyramidal Flow Matching for Efficient Video Generative Modeling
Pyramidal Flow Matching for Efficient Video Generative Modeling
Yang Jin
Zhicheng Sun
Ningyuan Li
Kun Xu
K. Xu
...
Nan Zhuang
Quzhe Huang
Yang Song
Yadong Mu
Zhouchen Lin
VGen
79
66
0
08 Oct 2024
Active Fine-Tuning of Generalist Policies
Active Fine-Tuning of Generalist Policies
Marco Bagatella
Jonas Hübotter
Georg Martius
Andreas Krause
37
0
0
07 Oct 2024
CAR: Controllable Autoregressive Modeling for Visual Generation
CAR: Controllable Autoregressive Modeling for Visual Generation
Ziyu Yao
Jialin Li
Yifeng Zhou
Yong Liu
Xi Jiang
Chengjie Wang
Feng Zheng
Yuexian Zou
Lei Li
DiffM
45
13
0
07 Oct 2024
Image Watermarks are Removable Using Controllable Regeneration from Clean Noise
Image Watermarks are Removable Using Controllable Regeneration from Clean Noise
Yepeng Liu
Yiren Song
Hai Ci
Yu Zhang
Haofan Wang
Mike Zheng Shou
Yuheng Bu
WIGM
64
3
0
07 Oct 2024
Realizing Video Summarization from the Path of Language-based Semantic
  Understanding
Realizing Video Summarization from the Path of Language-based Semantic Understanding
Kuan-Chen Mu
Zhi-Yi Chin
Wei-Chen Chiu
28
0
0
06 Oct 2024
A Reflection on the Impact of Misspecifying Unidentifiable Causal
  Inference Models in Surrogate Endpoint Evaluation
A Reflection on the Impact of Misspecifying Unidentifiable Causal Inference Models in Surrogate Endpoint Evaluation
Gokce Deliorman
Florian Stijven
Wim Van der Elst
Maria del Carmen Pardo
Ariel Alonso
CML
42
0
0
06 Oct 2024
Is What You Ask For What You Get? Investigating Concept Associations in Text-to-Image Models
Is What You Ask For What You Get? Investigating Concept Associations in Text-to-Image Models
Salma Abdel Magid
Weiwei Pan
Simon Warchol
Grace Guo
Junsik Kim
Mahia Rahman
Hanspeter Pfister
97
0
0
06 Oct 2024
AutoLoRA: AutoGuidance Meets Low-Rank Adaptation for Diffusion Models
AutoLoRA: AutoGuidance Meets Low-Rank Adaptation for Diffusion Models
Artur Kasymov
Marcin Sendera
Michał Stypułkowski
Maciej Ziȩba
Przemysław Spurek
33
1
0
04 Oct 2024
Real-World Benchmarks Make Membership Inference Attacks Fail on
  Diffusion Models
Real-World Benchmarks Make Membership Inference Attacks Fail on Diffusion Models
Chumeng Liang
Jiaxuan You
44
0
0
04 Oct 2024
VEDIT: Latent Prediction Architecture For Procedural Video
  Representation Learning
VEDIT: Latent Prediction Architecture For Procedural Video Representation Learning
Han Lin
Tushar Nagarajan
Nicolas Ballas
Mido Assran
Mojtaba Komeili
Joey Tianyi Zhou
Koustuv Sinha
AI4TS
65
3
0
04 Oct 2024
Elucidating the Design Choice of Probability Paths in Flow Matching for Forecasting
Elucidating the Design Choice of Probability Paths in Flow Matching for Forecasting
S. H. Lim
Yijin Wang
Annan Yu
Emma Hart
Michael W. Mahoney
Xiaoye S. Li
N. Benjamin Erichson
AI4TS
52
1
0
04 Oct 2024
Revisit Large-Scale Image-Caption Data in Pre-training Multimodal
  Foundation Models
Revisit Large-Scale Image-Caption Data in Pre-training Multimodal Foundation Models
Zhengfeng Lai
Vasileios Saveris
Chen Chen
Hong-You Chen
Haotian Zhang
...
Wenze Hu
Zhe Gan
Peter Grasch
Meng Cao
Yinfei Yang
VLM
41
4
0
03 Oct 2024
Eliminating Oversaturation and Artifacts of High Guidance Scales in
  Diffusion Models
Eliminating Oversaturation and Artifacts of High Guidance Scales in Diffusion Models
Seyedmorteza Sadat
Otmar Hilliges
Romann M. Weber
DiffM
23
8
0
03 Oct 2024
Stochastic Sampling from Deterministic Flow Models
Stochastic Sampling from Deterministic Flow Models
Saurabh Singh
Ian S. Fischer
41
2
0
03 Oct 2024
Channel-aware Contrastive Conditional Diffusion for Multivariate
  Probabilistic Time Series Forecasting
Channel-aware Contrastive Conditional Diffusion for Multivariate Probabilistic Time Series Forecasting
Siyang Li
Yize Chen
Hui Xiong
DiffM
AI4TS
38
0
0
03 Oct 2024
Selective Attention Improves Transformer
Selective Attention Improves Transformer
Yaniv Leviathan
Matan Kalman
Yossi Matias
51
9
0
03 Oct 2024
Diffusion & Adversarial Schr\"odinger Bridges via Iterative Proportional Markovian Fitting
Diffusion & Adversarial Schr\"odinger Bridges via Iterative Proportional Markovian Fitting
Sergei Kholkin
Grigoriy Ksenofontov
David Li
Nikita Kornilov
Nikita Gushchin
Alexandra Suvorikova
Alexey Kroshnin
Evgeny Burnaev
Alexander Korotin
44
2
0
03 Oct 2024
ControlAR: Controllable Image Generation with Autoregressive Models
ControlAR: Controllable Image Generation with Autoregressive Models
Zongming Li
Tianheng Cheng
Shoufa Chen
Peize Sun
Haocheng Shen
Longjin Ran
Xiaoxin Chen
Wenyu Liu
Xinggang Wang
DiffM
136
15
0
03 Oct 2024
KnobGen: Controlling the Sophistication of Artwork in Sketch-Based Diffusion Models
KnobGen: Controlling the Sophistication of Artwork in Sketch-Based Diffusion Models
Pouyan Navard
Amin Karimi Monsefi
Mengxi Zhou
Wei-Lun Chao
Alper Yilmaz
R. Ramnath
DiffM
54
2
0
02 Oct 2024
Accelerating Auto-regressive Text-to-Image Generation with Training-free Speculative Jacobi Decoding
Accelerating Auto-regressive Text-to-Image Generation with Training-free Speculative Jacobi Decoding
Yao Teng
Han Shi
Xian Liu
Xuefei Ning
Guohao Dai
Yu Wang
Zhenguo Li
Xihui Liu
58
10
0
02 Oct 2024
Denoising with a Joint-Embedding Predictive Architecture
Denoising with a Joint-Embedding Predictive Architecture
Dengsheng Chen
Jie Hu
Xiaoming Wei
Enhua Wu
DiffM
52
2
0
02 Oct 2024
Khattat: Enhancing Readability and Concept Representation of Semantic
  Typography
Khattat: Enhancing Readability and Concept Representation of Semantic Typography
Ahmed Hussein
Alaa Elsetohy
Sama Hadhoud
Tameem Bakr
Yasser Rohaim
Badr AlKhamissi
VLM
36
0
0
01 Oct 2024
A Cat Is A Cat (Not A Dog!): Unraveling Information Mix-ups in
  Text-to-Image Encoders through Causal Analysis and Embedding Optimization
A Cat Is A Cat (Not A Dog!): Unraveling Information Mix-ups in Text-to-Image Encoders through Causal Analysis and Embedding Optimization
Chieh-Yun Chen
Chiang Tseng
Li-Wu Tsao
Hong-Han Shuai
30
6
0
01 Oct 2024
ACE: All-round Creator and Editor Following Instructions via Diffusion
  Transformer
ACE: All-round Creator and Editor Following Instructions via Diffusion Transformer
Zhen Han
Zeyinzi Jiang
Yulin Pan
Jingfeng Zhang
Chaojie Mao
Chenwei Xie
Yu Liu
Jingren Zhou
DiffM
35
17
0
30 Sep 2024
Illustrious: an Open Advanced Illustration Model
Illustrious: an Open Advanced Illustration Model
Sang Hyun Park
Jun Young Koh
Junha Lee
Joy Song
Dongha Kim
Hoyeon Moon
Hyunju Lee
Min Song
VLM
46
1
0
30 Sep 2024
Effective Diffusion Transformer Architecture for Image Super-Resolution
Effective Diffusion Transformer Architecture for Image Super-Resolution
Kun Cheng
Lei Yu
Zhijun Tu
Xiao He
Liyu Chen
Yong Guo
Mingrui Zhu
Nannan Wang
Xinbo Gao
Jie Hu
37
0
0
29 Sep 2024
Secure Multiparty Generative AI
Secure Multiparty Generative AI
Manil Shrestha
Yashodha Ravichandran
Edward Kim
20
0
0
27 Sep 2024
Emu3: Next-Token Prediction is All You Need
Emu3: Next-Token Prediction is All You Need
Xinlong Wang
Xiaosong Zhang
Zhengxiong Luo
Quan-Sen Sun
Yufeng Cui
...
Xi Yang
Jingjing Liu
Yonghua Lin
Tiejun Huang
Zhongyuan Wang
MLLM
47
166
0
27 Sep 2024
Token Caching for Diffusion Transformer Acceleration
Token Caching for Diffusion Transformer Acceleration
Jinming Lou
Wenyang Luo
Yufan Liu
Bing Li
Xinmiao Ding
Weiming Hu
Jiajiong Cao
Yuming Li
Chenguang Ma
39
4
0
27 Sep 2024
FlowTurbo: Towards Real-time Flow-Based Image Generation with Velocity
  Refiner
FlowTurbo: Towards Real-time Flow-Based Image Generation with Velocity Refiner
Wenliang Zhao
Minglei Shi
Xumin Yu
Jie Zhou
Jiwen Lu
37
0
0
26 Sep 2024
Previous
123...121314151617
Next