ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2403.03206
  4. Cited By
Scaling Rectified Flow Transformers for High-Resolution Image Synthesis

Scaling Rectified Flow Transformers for High-Resolution Image Synthesis

5 March 2024
Patrick Esser
Sumith Kulal
A. Blattmann
Rahim Entezari
Jonas Muller
Harry Saini
Yam Levi
Dominik Lorenz
Axel Sauer
Frederic Boesel
Dustin Podell
Tim Dockhorn
Zion English
Kyle Lacey
Alex Goodwin
Yannik Marek
Robin Rombach
    DiffM
ArXivPDFHTML

Papers citing "Scaling Rectified Flow Transformers for High-Resolution Image Synthesis"

50 / 829 papers shown
Title
TextDestroyer: A Training- and Annotation-Free Diffusion Method for
  Destroying Anomal Text from Images
TextDestroyer: A Training- and Annotation-Free Diffusion Method for Destroying Anomal Text from Images
Mengcheng Li
Mingbao Lin
Rongrong Ji
Chia-Wen Lin
Rongrong Ji
DiffM
58
0
0
01 Nov 2024
Constant Acceleration Flow
Constant Acceleration Flow
Dogyun Park
Sojin Lee
S. Kim
Taehoon Lee
Youngjoon Hong
Hyunwoo J. Kim
65
2
0
01 Nov 2024
Scaling Concept With Text-Guided Diffusion Models
Scaling Concept With Text-Guided Diffusion Models
Chao Huang
Susan Liang
Yunlong Tang
Yapeng Tian
Anurag Kumar
Chenliang Xu
DiffM
59
6
0
31 Oct 2024
Redefining <Creative> in Dictionary: Towards an Enhanced Semantic Understanding of Creative Generation
Redefining <Creative> in Dictionary: Towards an Enhanced Semantic Understanding of Creative Generation
Fu Feng
Yucheng Xie
Xu Yang
Jing Wang
Xin Geng
DiffM
38
0
0
31 Oct 2024
Controlling Language and Diffusion Models by Transporting Activations
Controlling Language and Diffusion Models by Transporting Activations
P. Rodríguez
Arno Blaas
Michal Klein
Luca Zappella
N. Apostoloff
Marco Cuturi
Xavier Suau
LLMSV
42
5
0
30 Oct 2024
FlowDCN: Exploring DCN-like Architectures for Fast Image Generation with
  Arbitrary Resolution
FlowDCN: Exploring DCN-like Architectures for Fast Image Generation with Arbitrary Resolution
Shuai Wang
Zexian Li
Tianhui Song
Xubin Li
Tiezheng Ge
Bo Zheng
Liwen Wang
37
1
0
30 Oct 2024
Consistency Diffusion Bridge Models
Consistency Diffusion Bridge Models
Guande He
Kaiwen Zheng
Jianfei Chen
Fan Bao
Jun-Jie Zhu
DiffM
72
3
0
30 Oct 2024
Diffusion Beats Autoregressive: An Evaluation of Compositional Generation in Text-to-Image Models
Diffusion Beats Autoregressive: An Evaluation of Compositional Generation in Text-to-Image Models
Arash Marioriyad
Parham Rezaei
M. Baghshah
M. Rohban
CoGe
219
0
0
30 Oct 2024
ET-Flow: Equivariant Flow-Matching for Molecular Conformer Generation
ET-Flow: Equivariant Flow-Matching for Molecular Conformer Generation
Majdi Hassan
Nikhil Shenoy
Jungyoon Lee
Hannes Stärk
Stephan Thaler
Dominique Beaini
44
6
0
29 Oct 2024
GRADE: Quantifying Sample Diversity in Text-to-Image Models
GRADE: Quantifying Sample Diversity in Text-to-Image Models
Royi Rassin
Aviv Slobodkin
Shauli Ravfogel
Yanai Elazar
Yoav Goldberg
171
1
0
29 Oct 2024
AdvI2I: Adversarial Image Attack on Image-to-Image Diffusion models
AdvI2I: Adversarial Image Attack on Image-to-Image Diffusion models
Yaopei Zeng
Yuanpu Cao
Bochuan Cao
Yurui Chang
Jinghui Chen
Lu Lin
DiffM
43
3
0
28 Oct 2024
Shallow Diffuse: Robust and Invisible Watermarking through
  Low-Dimensional Subspaces in Diffusion Models
Shallow Diffuse: Robust and Invisible Watermarking through Low-Dimensional Subspaces in Diffusion Models
Wenda Li
Huijie Zhang
Qing Qu
WIGM
49
2
0
28 Oct 2024
Diff-Instruct*: Towards Human-Preferred One-step Text-to-image
  Generative Models
Diff-Instruct*: Towards Human-Preferred One-step Text-to-image Generative Models
Weijian Luo
C. Zhang
Debing Zhang
Zhengyang Geng
30
4
0
28 Oct 2024
GrounDiT: Grounding Diffusion Transformers via Noisy Patch
  Transplantation
GrounDiT: Grounding Diffusion Transformers via Noisy Patch Transplantation
Phillip Y. Lee
Taehoon Yoon
Minhyuk Sung
62
4
1
27 Oct 2024
An Efficient Watermarking Method for Latent Diffusion Models via
  Low-Rank Adaptation
An Efficient Watermarking Method for Latent Diffusion Models via Low-Rank Adaptation
Dongdong Lin
Yue Li
B. Tondi
Bin Li
Mauro Barni
WIGM
38
1
0
26 Oct 2024
Flow Generator Matching
Flow Generator Matching
Zemin Huang
Zhengyang Geng
Weijian Luo
Guo-jun Qi
49
9
0
25 Oct 2024
Towards Visual Text Design Transfer Across Languages
Towards Visual Text Design Transfer Across Languages
Yejin Choi
Jiwan Chung
Sumin Shim
Giyeong Oh
Youngjae Yu
VLM
DiffM
48
1
0
24 Oct 2024
Rectified Diffusion Guidance for Conditional Generation
Rectified Diffusion Guidance for Conditional Generation
Mengfei Xia
Nan Xue
Yujun Shen
Ran Yi
Tieliang Gong
Yu Liu
DiffM
41
3
0
24 Oct 2024
DreamClear: High-Capacity Real-World Image Restoration with Privacy-Safe
  Dataset Curation
DreamClear: High-Capacity Real-World Image Restoration with Privacy-Safe Dataset Curation
Yuang Ai
Xiaoqiang Zhou
Huaibo Huang
Xiaotian Han
Zhengyu Chen
Quanzeng You
Hongxia Yang
53
9
0
24 Oct 2024
FreCaS: Efficient Higher-Resolution Image Generation via Frequency-aware Cascaded Sampling
FreCaS: Efficient Higher-Resolution Image Generation via Frequency-aware Cascaded Sampling
Zhengqiang Zhang
Ruihuang Li
Lei Zhang
43
2
0
24 Oct 2024
Robust Watermarking Using Generative Priors Against Image Editing: From Benchmarking to Advances
Robust Watermarking Using Generative Priors Against Image Editing: From Benchmarking to Advances
Shilin Lu
Zihan Zhou
Jiayou Lu
Yuanzhi Zhu
A. Kong
WIGM
97
11
0
24 Oct 2024
Scalable Ranked Preference Optimization for Text-to-Image Generation
Scalable Ranked Preference Optimization for Text-to-Image Generation
Shyamgopal Karthik
Huseyin Coskun
Zeynep Akata
Sergey Tulyakov
J. Ren
Anil Kag
EGVM
57
6
0
23 Oct 2024
Training Free Guided Flow Matching with Optimal Control
Training Free Guided Flow Matching with Optimal Control
Luran Wang
Chaoran Cheng
Yizhen Liao
Yanru Qu
Ge Liu
43
1
0
23 Oct 2024
VistaDream: Sampling multiview consistent images for single-view scene
  reconstruction
VistaDream: Sampling multiview consistent images for single-view scene reconstruction
Haiping Wang
Yuan Liu
Ziwei Liu
Wenping Wang
Z. Dong
Bisheng Yang
53
11
0
22 Oct 2024
One-Step Diffusion Distillation through Score Implicit Matching
One-Step Diffusion Distillation through Score Implicit Matching
Weijian Luo
Zemin Huang
Zhengyang Geng
J. Zico Kolter
Guo-jun Qi
DiffM
39
15
0
22 Oct 2024
Efficient Antibody Structure Refinement Using Energy-Guided SE(3) Flow
  Matching
Efficient Antibody Structure Refinement Using Energy-Guided SE(3) Flow Matching
Jiying Zhang
Zijing Liu
Shengyuan Bai
He Cao
Yu Li
Lei Zhang
DiffM
44
1
0
22 Oct 2024
Progressive Compositionality in Text-to-Image Generative Models
Progressive Compositionality in Text-to-Image Generative Models
Xu Han
Linghao Jin
Xiaofeng Liu
Paul Pu Liang
CoGe
106
2
0
22 Oct 2024
Elucidating the design space of language models for image generation
Elucidating the design space of language models for image generation
Xuantong Liu
Shaozhe Hao
Xianbiao Qi
Tianyang Hu
Jun Wang
Rong Xiao
Yuan Yao
VLM
37
3
0
21 Oct 2024
Warped Diffusion: Solving Video Inverse Problems with Image Diffusion
  Models
Warped Diffusion: Solving Video Inverse Problems with Image Diffusion Models
Giannis Daras
Weili Nie
Karsten Kreis
A. Dimakis
Morteza Mardani
Nikola B. Kovachki
Arash Vahdat
DiffM
35
6
0
21 Oct 2024
Residual vector quantization for KV cache compression in large language
  model
Residual vector quantization for KV cache compression in large language model
Ankur Kumar
MQ
34
0
0
21 Oct 2024
Allegro: Open the Black Box of Commercial-Level Video Generation Model
Allegro: Open the Black Box of Commercial-Level Video Generation Model
Yuan Zhou
Qiuyue Wang
Yuxuan Cai
Huan Yang
VGen
VLM
88
26
0
20 Oct 2024
Group Diffusion Transformers are Unsupervised Multitask Learners
Group Diffusion Transformers are Unsupervised Multitask Learners
Lianghua Huang
Wei Wang
Zhi-Fan Wu
Huanzhang Dou
Yupeng Shi
Yutong Feng
C. Liang
Yu Liu
Jingren Zhou
VLM
52
12
0
19 Oct 2024
Reflexive Guidance: Improving OoDD in Vision-Language Models via Self-Guided Image-Adaptive Concept Generation
Reflexive Guidance: Improving OoDD in Vision-Language Models via Self-Guided Image-Adaptive Concept Generation
Seulbi Lee
J. Kim
Sangheum Hwang
LRM
202
0
0
19 Oct 2024
Fluid: Scaling Autoregressive Text-to-image Generative Models with
  Continuous Tokens
Fluid: Scaling Autoregressive Text-to-image Generative Models with Continuous Tokens
Lijie Fan
Tianhong Li
Siyang Qin
Yuanzhen Li
Chen Sun
Michael Rubinstein
Deqing Sun
Kaiming He
Yonglong Tian
VLM
DiffM
48
42
0
17 Oct 2024
FiTv2: Scalable and Improved Flexible Vision Transformer for Diffusion
  Model
FiTv2: Scalable and Improved Flexible Vision Transformer for Diffusion Model
ZiDong Wang
Zeyu Lu
Di Huang
Cai Zhou
Wanli Ouyang
and Lei Bai
76
3
0
17 Oct 2024
On Diffusion Models for Multi-Agent Partial Observability: Shared Attractors, Error Bounds, and Composite Flow
On Diffusion Models for Multi-Agent Partial Observability: Shared Attractors, Error Bounds, and Composite Flow
Tonghan Wang
Heng Dong
Yanchen Jiang
David C. Parkes
Milind Tambe
DiffM
50
2
0
17 Oct 2024
One Step Diffusion via Shortcut Models
One Step Diffusion via Shortcut Models
Kevin Frans
Danijar Hafner
Sergey Levine
Pieter Abbeel
VLM
DiffM
39
26
0
16 Oct 2024
FlashAudio: Rectified Flows for Fast and High-Fidelity Text-to-Audio
  Generation
FlashAudio: Rectified Flows for Fast and High-Fidelity Text-to-Audio Generation
Huadai Liu
Jialei Wang
Rongjie Huang
Yang Liu
H. Lu
Wei Xue
Zhou Zhao
13
3
0
16 Oct 2024
Preference Optimization with Multi-Sample Comparisons
Preference Optimization with Multi-Sample Comparisons
Chaoqi Wang
Zhuokai Zhao
Chen Zhu
Karthik Abinav Sankararaman
Michal Valko
...
Zhaorun Chen
Madian Khabsa
Yuxin Chen
Hao Ma
Sinong Wang
74
10
0
16 Oct 2024
CtrlSynth: Controllable Image Text Synthesis for Data-Efficient
  Multimodal Learning
CtrlSynth: Controllable Image Text Synthesis for Data-Efficient Multimodal Learning
Qingqing Cao
Mahyar Najibi
Sachin Mehta
CLIP
DiffM
35
1
0
15 Oct 2024
Jigsaw++: Imagining Complete Shape Priors for Object Reassembly
Jigsaw++: Imagining Complete Shape Priors for Object Reassembly
Jiaxin Lu
Gang Hua
Qixing Huang
44
2
0
15 Oct 2024
Your Mixture-of-Experts LLM Is Secretly an Embedding Model For Free
Your Mixture-of-Experts LLM Is Secretly an Embedding Model For Free
Ziyue Li
Dinesh Manocha
MoE
74
6
0
14 Oct 2024
MEV Capture Through Time-Advantaged Arbitrage
MEV Capture Through Time-Advantaged Arbitrage
Robin Fritsch
Maria Ines Silva
A. Mamageishvili
Benjamin Livshits
E. Felten
39
1
0
14 Oct 2024
Enhancing Robustness in Deep Reinforcement Learning: A Lyapunov Exponent
  Approach
Enhancing Robustness in Deep Reinforcement Learning: A Lyapunov Exponent Approach
Rory Young
Nicolas Pugeault
AAML
65
0
0
14 Oct 2024
SANA: Efficient High-Resolution Image Synthesis with Linear Diffusion
  Transformers
SANA: Efficient High-Resolution Image Synthesis with Linear Diffusion Transformers
Enze Xie
Junsong Chen
Junyu Chen
Han Cai
Haotian Tang
...
Zhekai Zhang
Zhekai Zhang
Ligeng Zhu
Yaojie Lu
Song Han
VLM
52
51
0
14 Oct 2024
Customize Your Visual Autoregressive Recipe with Set Autoregressive
  Modeling
Customize Your Visual Autoregressive Recipe with Set Autoregressive Modeling
Wenze Liu
Le Zhuo
Yi Xin
Sheng Xia
Peng Gao
Xiangyu Yue
42
6
0
14 Oct 2024
FasterDiT: Towards Faster Diffusion Transformers Training without
  Architecture Modification
FasterDiT: Towards Faster Diffusion Transformers Training without Architecture Modification
J. Yao
Wang Cheng
Wenyu Liu
Xinggang Wang
48
8
0
14 Oct 2024
The Ingredients for Robotic Diffusion Transformers
The Ingredients for Robotic Diffusion Transformers
Sudeep Dasari
Oier Mees
Sebastian Zhao
Mohan Kumar Srirama
Sergey Levine
59
20
0
14 Oct 2024
MMIE: Massive Multimodal Interleaved Comprehension Benchmark for Large Vision-Language Models
MMIE: Massive Multimodal Interleaved Comprehension Benchmark for Large Vision-Language Models
Peng Xia
Siwei Han
Shi Qiu
Yiyang Zhou
Zhaoyang Wang
...
Chenhang Cui
Mingyu Ding
Linjie Li
Lijuan Wang
Huaxiu Yao
67
10
0
14 Oct 2024
Evaluating Semantic Variation in Text-to-Image Synthesis: A Causal Perspective
Evaluating Semantic Variation in Text-to-Image Synthesis: A Causal Perspective
Xiangru Zhu
Penglei Sun
Yaoxian Song
Yanghua Xiao
Zhixu Li
Chengyu Wang
Jun Huang
Bei Yang
Xiaoxiao Xu
EGVM
260
1
0
14 Oct 2024
Previous
123...111213...151617
Next