ResearchTrend.AI
  • Communities
  • Connect sessions
  • AI calendar
  • Organizations
  • Join Slack
  • Contact Sales
Papers
Communities
Social Events
Terms and Conditions
Pricing
Contact Sales
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2501.00603
  4. Cited By
DiC: Rethinking Conv3x3 Designs in Diffusion Models
v1v2 (latest)

DiC: Rethinking Conv3x3 Designs in Diffusion Models

Computer Vision and Pattern Recognition (CVPR), 2024
31 December 2024
Yuchuan Tian
Jing Han
Chengcheng Wang
Yuchen Liang
Chao Xu
Hanting Chen
    DiffM
ArXiv (abs)PDFHTML

Papers citing "DiC: Rethinking Conv3x3 Designs in Diffusion Models"

20 / 20 papers shown
Title
Rectifying Magnitude Neglect in Linear Attention
Rectifying Magnitude Neglect in Linear Attention
Qihang Fan
Huaibo Huang
Yuang Ai
Xiao-Yu Zhang
251
4
0
01 Jul 2025
DiCo: Revitalizing ConvNets for Scalable and Efficient Diffusion Modeling
DiCo: Revitalizing ConvNets for Scalable and Efficient Diffusion Modeling
Yuang Ai
Qihang Fan
Xuefeng Hu
Zhenheng Yang
Xiao-Yu Zhang
Huaibo Huang
DiffM
282
1
0
16 May 2025
U-REPA: Aligning Diffusion U-Nets to ViTs
U-REPA: Aligning Diffusion U-Nets to ViTs
Yuchuan Tian
Hanting Chen
Mengyu Zheng
Yuchen Liang
Chao Xu
Yunhe Wang
255
5
0
24 Mar 2025
SANA: Efficient High-Resolution Image Synthesis with Linear Diffusion
  Transformers
SANA: Efficient High-Resolution Image Synthesis with Linear Diffusion Transformers
Enze Xie
Junsong Chen
Junyu Chen
Han Cai
Haotian Tang
...
Zhekai Zhang
Zhekai Zhang
Ligeng Zhu
Yaojie Lu
Song Han
VLM
243
172
0
14 Oct 2024
Wavelet Convolutions for Large Receptive Fields
Wavelet Convolutions for Large Receptive Fields
Shahaf E. Finder
Roy Amoyal
Eran Treister
Oren Freifeld
ViTMDE
379
261
0
08 Jul 2024
DiM: Diffusion Mamba for Efficient High-Resolution Image Synthesis
DiM: Diffusion Mamba for Efficient High-Resolution Image Synthesis
Yao Teng
Yue Wu
Han Shi
Xuefei Ning
Guohao Dai
Yu Wang
Zhenguo Li
Xihui Liu
Mamba
230
62
0
23 May 2024
U-DiTs: Downsample Tokens in U-Shaped Diffusion Transformers
U-DiTs: Downsample Tokens in U-Shaped Diffusion TransformersNeural Information Processing Systems (NeurIPS), 2024
Yuchuan Tian
Zhijun Tu
Hanting Chen
Jie Hu
Chao Xu
Yunhe Wang
170
32
0
04 May 2024
PixArt-Σ: Weak-to-Strong Training of Diffusion Transformer for 4K
  Text-to-Image Generation
PixArt-Σ: Weak-to-Strong Training of Diffusion Transformer for 4K Text-to-Image GenerationEuropean Conference on Computer Vision (ECCV), 2024
Junsong Chen
Chongjian Ge
Enze Xie
Yue Wu
Lewei Yao
Xiaozhe Ren
Zhongdao Wang
Ping Luo
Huchuan Lu
Zhenguo Li
568
200
0
07 Mar 2024
Scaling Rectified Flow Transformers for High-Resolution Image Synthesis
Scaling Rectified Flow Transformers for High-Resolution Image Synthesis
Patrick Esser
Sumith Kulal
A. Blattmann
Rahim Entezari
Jonas Muller
...
Zion English
Kyle Lacey
Alex Goodwin
Yannik Marek
Robin Rombach
DiffM
995
2,470
0
05 Mar 2024
SiT: Exploring Flow and Diffusion-based Generative Models with Scalable
  Interpolant Transformers
SiT: Exploring Flow and Diffusion-based Generative Models with Scalable Interpolant TransformersEuropean Conference on Computer Vision (ECCV), 2024
Nanye Ma
Mark Goldstein
M. S. Albergo
Nicholas M. Boffi
Eric Vanden-Eijnden
Saining Xie
DiffM
296
384
0
16 Jan 2024
PIXART-δ: Fast and Controllable Image Generation with Latent
  Consistency Models
PIXART-δ: Fast and Controllable Image Generation with Latent Consistency Models
Junsong Chen
Yue Wu
Simian Luo
Enze Xie
Sayak Paul
Ping Luo
Hang Zhao
Zhenguo Li
VLM
181
114
0
10 Jan 2024
DiffiT: Diffusion Vision Transformers for Image Generation
DiffiT: Diffusion Vision Transformers for Image GenerationEuropean Conference on Computer Vision (ECCV), 2023
Ali Hatamizadeh
Jiaming Song
Guilin Liu
Jan Kautz
Arash Vahdat
275
111
0
04 Dec 2023
Mamba: Linear-Time Sequence Modeling with Selective State Spaces
Mamba: Linear-Time Sequence Modeling with Selective State Spaces
Albert Gu
Tri Dao
Mamba
458
4,784
0
01 Dec 2023
Diffusion Models Without Attention
Diffusion Models Without AttentionComputer Vision and Pattern Recognition (CVPR), 2023
Jing Nathan Yan
Jiatao Gu
Alexander M. Rush
264
89
0
30 Nov 2023
PixArt-$α$: Fast Training of Diffusion Transformer for
  Photorealistic Text-to-Image Synthesis
PixArt-ααα: Fast Training of Diffusion Transformer for Photorealistic Text-to-Image SynthesisInternational Conference on Learning Representations (ICLR), 2023
Junsong Chen
Jincheng Yu
Chongjian Ge
Lewei Yao
Enze Xie
...
Zhongdao Wang
James T. Kwok
Ping Luo
Huchuan Lu
Zhenguo Li
DiffM
441
626
0
30 Sep 2023
FlashAttention-2: Faster Attention with Better Parallelism and Work
  Partitioning
FlashAttention-2: Faster Attention with Better Parallelism and Work PartitioningInternational Conference on Learning Representations (ICLR), 2023
Tri Dao
LRM
325
1,938
0
17 Jul 2023
Scalable Diffusion Models with Transformers
Scalable Diffusion Models with TransformersIEEE International Conference on Computer Vision (ICCV), 2022
William S. Peebles
Saining Xie
GNN
974
3,850
0
19 Dec 2022
InternImage: Exploring Large-Scale Vision Foundation Models with
  Deformable Convolutions
InternImage: Exploring Large-Scale Vision Foundation Models with Deformable ConvolutionsComputer Vision and Pattern Recognition (CVPR), 2022
Wenhai Wang
Jifeng Dai
Zhe Chen
Zhenhang Huang
Zhiqi Li
...
Tong Lu
Lewei Lu
Jiaming Song
Xiaogang Wang
Yu Qiao
VLM
411
924
0
10 Nov 2022
All are Worth Words: A ViT Backbone for Diffusion Models
All are Worth Words: A ViT Backbone for Diffusion ModelsComputer Vision and Pattern Recognition (CVPR), 2022
Fan Bao
Shen Nie
Kaiwen Xue
Yue Cao
Chongxuan Li
Hang Su
Jun Zhu
VLM
425
478
0
25 Sep 2022
FlashAttention: Fast and Memory-Efficient Exact Attention with
  IO-Awareness
FlashAttention: Fast and Memory-Efficient Exact Attention with IO-AwarenessNeural Information Processing Systems (NeurIPS), 2022
Tri Dao
Daniel Y. Fu
Stefano Ermon
Atri Rudra
Christopher Ré
VLM
731
3,122
0
27 May 2022
1