ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2305.10855
  4. Cited By
TextDiffuser: Diffusion Models as Text Painters

TextDiffuser: Diffusion Models as Text Painters

18 May 2023
Jingye Chen
Yupan Huang
Tengchao Lv
Lei Cui
Qifeng Chen
Furu Wei
ArXivPDFHTML

Papers citing "TextDiffuser: Diffusion Models as Text Painters"

50 / 100 papers shown
Title
The Devil is in Fine-tuning and Long-tailed Problems:A New Benchmark for Scene Text Detection
The Devil is in Fine-tuning and Long-tailed Problems:A New Benchmark for Scene Text Detection
Tianjiao Cao
Jiahao Lyu
Weichao Zeng
Weimin Mu
Yu Zhou
7
0
0
21 May 2025
LogicOCR: Do Your Large Multimodal Models Excel at Logical Reasoning on Text-Rich Images?
LogicOCR: Do Your Large Multimodal Models Excel at Logical Reasoning on Text-Rich Images?
Maoyuan Ye
Jing Zhang
Juhua Liu
Bo Du
Dacheng Tao
LRM
4
0
0
18 May 2025
Towards Self-Improvement of Diffusion Models via Group Preference Optimization
Towards Self-Improvement of Diffusion Models via Group Preference Optimization
Renjie Chen
Wenfeng Lin
Yichen Zhang
Jiangchuan Wei
Boyuan Liu
Chao Feng
Jiao Ran
Mingyu Guo
17
0
0
16 May 2025
HDGlyph: A Hierarchical Disentangled Glyph-Based Framework for Long-Tail Text Rendering in Diffusion Models
HDGlyph: A Hierarchical Disentangled Glyph-Based Framework for Long-Tail Text Rendering in Diffusion Models
Shuhan Zhuang
Mengqi Huang
Fengyi Fu
Nan Chen
Bohan Lei
Zhendong Mao
DiffM
40
0
0
10 May 2025
Flow-GRPO: Training Flow Matching Models via Online RL
Flow-GRPO: Training Flow Matching Models via Online RL
Jie Liu
Gongye Liu
Jiajun Liang
Yongqian Li
Jiaheng Liu
Xihuai Wang
Pengfei Wan
Di Zhang
Wanli Ouyang
AI4CE
75
0
0
08 May 2025
GlyphMastero: A Glyph Encoder for High-Fidelity Scene Text Editing
GlyphMastero: A Glyph Encoder for High-Fidelity Scene Text Editing
Tong Wang
Ting Liu
Xiaochao Qu
Chengjing Wu
Luoqi Liu
Xiaolin Hu
DiffM
62
0
0
08 May 2025
T2VTextBench: A Human Evaluation Benchmark for Textual Control in Video Generation Models
T2VTextBench: A Human Evaluation Benchmark for Textual Control in Video Generation Models
Xuyang Guo
Jiayan Huo
Zhenmei Shi
Zhao Song
Jiahao Zhang
Jiale Zhao
VGen
237
0
0
08 May 2025
FLUX-Text: A Simple and Advanced Diffusion Transformer Baseline for Scene Text Editing
FLUX-Text: A Simple and Advanced Diffusion Transformer Baseline for Scene Text Editing
Rui Lan
Y. Bai
Xu Duan
Mingxing Li
Lei Sun
Xiaowen Chu
DiffM
206
0
0
06 May 2025
Unified Multimodal Understanding and Generation Models: Advances, Challenges, and Opportunities
Unified Multimodal Understanding and Generation Models: Advances, Challenges, and Opportunities
Jiahui Geng
Jintao Guo
Shanshan Zhao
Minghao Fu
Lunhao Duan
Guo-Hua Wang
Qing-Guo Chen
Zhao Xu
Weihua Luo
Kaifu Zhang
DiffM
74
0
0
05 May 2025
Visual Text Processing: A Comprehensive Review and Unified Evaluation
Visual Text Processing: A Comprehensive Review and Unified Evaluation
Yan Shu
Weichao Zeng
Fangmin Zhao
Zeyu Chen
Feiyu Xiong
...
Paolo Rota
Xiang Bai
Lianwen Jin
Xu-Cheng Yin
N. Sebe
CoGe
66
0
0
30 Apr 2025
RepText: Rendering Visual Text via Replicating
RepText: Rendering Visual Text via Replicating
Haozhao Wang
Yongjun Xu
Yongqian Li
Jiajun Li
Chaowei Zhang
Jingchao Wang
Kejia Yang
Z. Chen
VLM
66
0
0
28 Apr 2025
Point-Driven Interactive Text and Image Layer Editing Using Diffusion Models
Point-Driven Interactive Text and Image Layer Editing Using Diffusion Models
Zhenyu Yu
Mohd Yamani Idna Idris
Pei Wang
Yuelong Xia
DiffM
31
0
0
18 Apr 2025
ViMo: A Generative Visual GUI World Model for App Agents
ViMo: A Generative Visual GUI World Model for App Agents
Dezhao Luo
Bohan Tang
Kang Li
Georgios Papoudakis
Jifei Song
S. Gong
Haifeng Zhang
Jun Wang
Kun Shao
LM&Ro
VGen
53
0
0
15 Apr 2025
TextCrafter: Accurately Rendering Multiple Texts in Complex Visual Scenes
TextCrafter: Accurately Rendering Multiple Texts in Complex Visual Scenes
Nikai Du
Zhennan Chen
Zheyu Chen
Shan Gao
Xi Chen
Zhengkai Jiang
Jian Yang
Ying Tai
DiffM
51
0
0
30 Mar 2025
LeX-Art: Rethinking Text Generation via Scalable High-Quality Data Synthesis
LeX-Art: Rethinking Text Generation via Scalable High-Quality Data Synthesis
Jike Zhong
Qilong Wu
Xinyue Li
Bo Zhang
Ming Li
...
Houqiang Li
Yu Qiao
Peng Gao
Bin Fu
Zhen Li
EGVM
50
0
0
27 Mar 2025
Beyond Words: Advancing Long-Text Image Generation via Multimodal Autoregressive Models
Beyond Words: Advancing Long-Text Image Generation via Multimodal Autoregressive Models
Alex Jinpeng Wang
Linjie Li
Zheng Yang
Lijuan Wang
Min Li
DiffM
73
0
0
26 Mar 2025
BizGen: Advancing Article-level Visual Text Rendering for Infographics Generation
BizGen: Advancing Article-level Visual Text Rendering for Infographics Generation
Yuyang Peng
Shishi Xiao
Keming Wu
Qisheng Liao
Bohan Chen
Kevin Lin
Danqing Huang
Ji Li
Yuhui Yuan
DiffM
82
1
0
26 Mar 2025
From Fragment to One Piece: A Survey on AI-Driven Graphic Design
From Fragment to One Piece: A Survey on AI-Driven Graphic Design
Xingxing Zou
Wen Zhang
Nanxuan Zhao
59
0
0
24 Mar 2025
TextInVision: Text and Prompt Complexity Driven Visual Text Generation Benchmark
TextInVision: Text and Prompt Complexity Driven Visual Text Generation Benchmark
Forouzan Fallah
Maitreya Patel
Agneet Chatterjee
Vlad I. Morariu
Chitta Baral
Yezhou Yang
CoGe
67
0
0
17 Mar 2025
DesignDiffusion: High-Quality Text-to-Design Image Generation with Diffusion Models
Zhendong Wang
Jianmin Bao
Shuyang Gu
Dong Chen
Wengang Zhou
Houqiang Li
DiffM
53
0
0
03 Mar 2025
M2-omni: Advancing Omni-MLLM for Comprehensive Modality Support with Competitive Performance
M2-omni: Advancing Omni-MLLM for Comprehensive Modality Support with Competitive Performance
Qingpei Guo
Kaiyou Song
Zipeng Feng
Ziping Ma
Qinglong Zhang
...
Yunxiao Sun
Tai-WeiChang
Jingdong Chen
Ming Yang
Jun Zhou
MLLM
VLM
90
3
0
26 Feb 2025
Precise Parameter Localization for Textual Generation in Diffusion Models
Precise Parameter Localization for Textual Generation in Diffusion Models
Łukasz Staniszewski
Bartosz Cywiñski
Franziska Boenisch
Kamil Deja
Adam Dziedzic
DiffM
247
0
0
17 Feb 2025
ControlText: Unlocking Controllable Fonts in Multilingual Text Rendering without Font Annotations
ControlText: Unlocking Controllable Fonts in Multilingual Text Rendering without Font Annotations
Bowen Jiang
Yuan Yuan
Xinyi Bai
Zhuoqun Hao
Alyson Yin
Yaojie Hu
Wenyu Liao
Lyle Ungar
Camillo J Taylor
DiffM
58
1
0
16 Feb 2025
Assessing the use of Diffusion models for motion artifact correction in brain MRI
Assessing the use of Diffusion models for motion artifact correction in brain MRI
Paolo Angella
Vito Paolo Pastore
Matteo Santacesaria
MedIm
DiffM
67
1
0
03 Feb 2025
Beyond Flat Text: Dual Self-inherited Guidance for Visual Text Generation
Beyond Flat Text: Dual Self-inherited Guidance for Visual Text Generation
Minxing Luo
Zixun Xia
L. Chen
Zhenhang Li
Weichao Zeng
Jize Wang
Wentao Cheng
Yaxing Wang
Yu Zhou
Jian Yang
DiffM
57
1
0
10 Jan 2025
SceneVTG++: Controllable Multilingual Visual Text Generation in the Wild
SceneVTG++: Controllable Multilingual Visual Text Generation in the Wild
Jiawei Liu
Yuanzhi Zhu
Feiyu Gao
Zheng Yang
P. Wang
Junyang Lin
Xihuai Wang
Wenyu Liu
DiffM
45
0
0
08 Jan 2025
Hierarchical Vision-Language Alignment for Text-to-Image Generation via Diffusion Models
Emily Johnson
Noah Wilson
VLM
62
0
0
03 Jan 2025
CharGen: High Accurate Character-Level Visual Text Generation Model with
  MultiModal Encoder
CharGen: High Accurate Character-Level Visual Text Generation Model with MultiModal Encoder
Lichen Ma
Tiezhu Yue
Pei Fu
Yujie Zhong
Kai Zhou
Xiaoming Wei
Jie Hu
DiffM
83
2
0
23 Dec 2024
TextSSR: Diffusion-based Data Synthesis for Scene Text Recognition
TextSSR: Diffusion-based Data Synthesis for Scene Text Recognition
Xingsong Ye
Yongkun Du
Yunbo Tao
Z. Chen
DiffM
129
0
0
02 Dec 2024
Type-R: Automatically Retouching Typos for Text-to-Image Generation
Type-R: Automatically Retouching Typos for Text-to-Image Generation
Wataru Shimoda
Naoto Inoue
Daichi Haraguchi
Hayato Mitani
S. Uchida
Kota Yamaguchi
DiffM
116
0
0
27 Nov 2024
GenDeg: Diffusion-based Degradation Synthesis for Generalizable All-In-One Image Restoration
GenDeg: Diffusion-based Degradation Synthesis for Generalizable All-In-One Image Restoration
Sudarshan Rajagopalan
Nithin Gopalakrishnan Nair
Jay N. Paranjape
Vishal M. Patel
DiffM
98
0
0
26 Nov 2024
AnyText2: Visual Text Generation and Editing With Customizable
  Attributes
AnyText2: Visual Text Generation and Editing With Customizable Attributes
Yuxiang Tuo
Yifeng Geng
Liefeng Bo
VLM
93
6
0
22 Nov 2024
GLDesigner: Leveraging Multi-Modal LLMs as Designer for Enhanced Aesthetic Text Glyph Layouts
Junwen He
Yifan Wang
Lijun Wang
Huchuan Lu
Jun-Yan He
Chong Li
Hanyuan Chen
Jin-Peng Lan
Bin Luo
Yifeng Geng
74
1
0
18 Nov 2024
Text2CAD: Text to 3D CAD Generation via Technical Drawings
Text2CAD: Text to 3D CAD Generation via Technical Drawings
Mohsen Yavartanoo
S. Hong
Reyhaneh Neshatavar
Kyoung Mu Lee
25
1
0
09 Nov 2024
TypeScore: A Text Fidelity Metric for Text-to-Image Generative Models
TypeScore: A Text Fidelity Metric for Text-to-Image Generative Models
Georgia Gabriela Sampaio
Ruixiang Zhang
Shuangfei Zhai
Jiatao Gu
J. Susskind
Navdeep Jaitly
Yizhe Zhang
DiffM
CLIP
40
0
0
02 Nov 2024
TextDestroyer: A Training- and Annotation-Free Diffusion Method for
  Destroying Anomal Text from Images
TextDestroyer: A Training- and Annotation-Free Diffusion Method for Destroying Anomal Text from Images
Mengcheng Li
Mingbao Lin
Rongrong Ji
Chia-Wen Lin
Rongrong Ji
DiffM
58
0
0
01 Nov 2024
Towards Visual Text Design Transfer Across Languages
Towards Visual Text Design Transfer Across Languages
Yejin Choi
Jiwan Chung
Sumin Shim
Giyeong Oh
Youngjae Yu
VLM
DiffM
48
1
0
24 Oct 2024
Ctrl-U: Robust Conditional Image Generation via Uncertainty-aware Reward Modeling
Ctrl-U: Robust Conditional Image Generation via Uncertainty-aware Reward Modeling
Guiyu Zhang
Huan-ang Gao
Zijian Jiang
Hao Zhao
Zhedong Zheng
EGVM
57
6
0
15 Oct 2024
TextCtrl: Diffusion-based Scene Text Editing with Prior Guidance Control
TextCtrl: Diffusion-based Scene Text Editing with Prior Guidance Control
Weichao Zeng
Yan Shu
Zhenhang Li
Dongbao Yang
Yu Zhou
DiffM
29
7
0
14 Oct 2024
TextMaster: Universal Controllable Text Edit
TextMaster: Universal Controllable Text Edit
Aoqiang Wang
Yufei Guo
Zhenyu Yan
Wenxiang Shang
Ran Lin
Zhao Zhang
DiffM
28
2
0
13 Oct 2024
TextLap: Customizing Language Models for Text-to-Layout Planning
TextLap: Customizing Language Models for Text-to-Layout Planning
Jian Chen
Ruiyi Zhang
Yufan Zhou
Jennifer Healey
J. Gu
Zhiqiang Xu
Cen Chen
VLM
44
3
0
09 Oct 2024
A Reflection on the Impact of Misspecifying Unidentifiable Causal
  Inference Models in Surrogate Endpoint Evaluation
A Reflection on the Impact of Misspecifying Unidentifiable Causal Inference Models in Surrogate Endpoint Evaluation
Gokce Deliorman
Florian Stijven
Wim Van der Elst
Maria del Carmen Pardo
Ariel Alonso
CML
42
0
0
06 Oct 2024
A Spark of Vision-Language Intelligence: 2-Dimensional Autoregressive
  Transformer for Efficient Finegrained Image Generation
A Spark of Vision-Language Intelligence: 2-Dimensional Autoregressive Transformer for Efficient Finegrained Image Generation
Liang Chen
Sinan Tan
Zefan Cai
Weichu Xie
Haozhe Zhao
Yichi Zhang
Junyang Lin
Jinze Bai
Tianyu Liu
Baobao Chang
ViT
58
3
0
02 Oct 2024
Text Image Generation for Low-Resource Languages with Dual Translation
  Learning
Text Image Generation for Low-Resource Languages with Dual Translation Learning
Chihiro Noguchi
Shun Fukuda
Shoichiro Mihara
Masao Yamanaka
DiffM
36
0
0
26 Sep 2024
AnyLogo: Symbiotic Subject-Driven Diffusion System with Gemini Status
AnyLogo: Symbiotic Subject-Driven Diffusion System with Gemini Status
Jinghao Zhang
Wen Qian
Hao Luo
Fan Wang
Feng Zhao
DiffM
32
0
0
26 Sep 2024
Neural Contrast: Leveraging Generative Editing for Graphic Design
  Recommendations
Neural Contrast: Leveraging Generative Editing for Graphic Design Recommendations
Marian Lupascu
Ionut Mironica
Mihai-Sorin Stupariu
DiffM
33
0
0
26 Sep 2024
DiffusionPen: Towards Controlling the Style of Handwritten Text
  Generation
DiffusionPen: Towards Controlling the Style of Handwritten Text Generation
Konstantina Nikolaidou
George Retsinas
Giorgos Sfikas
Marcus Liwicki
DiffM
45
3
0
09 Sep 2024
Elucidating Optimal Reward-Diversity Tradeoffs in Text-to-Image
  Diffusion Models
Elucidating Optimal Reward-Diversity Tradeoffs in Text-to-Image Diffusion Models
Rohit Jena
Ali Taghibakhshi
Sahil Jain
Gerald Shen
Nima Tajbakhsh
Arash Vahdat
44
3
0
09 Sep 2024
Forward KL Regularized Preference Optimization for Aligning Diffusion
  Policies
Forward KL Regularized Preference Optimization for Aligning Diffusion Policies
Zhao Shan
Chenyou Fan
Shuang Qiu
Jiyuan Shi
Chenjia Bai
45
4
0
09 Sep 2024
Zero-Shot Paragraph-level Handwriting Imitation with Latent Diffusion
  Models
Zero-Shot Paragraph-level Handwriting Imitation with Latent Diffusion Models
Martin Mayr
Marcel Dreier
Florian Kordon
Mathias Seuret
Jochen Zöllner
Fei Wu
Andreas Maier
Vincent Christlein
DiffM
59
1
0
01 Sep 2024
12
Next