ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2502.07556
  4. Cited By
SketchFlex: Facilitating Spatial-Semantic Coherence in Text-to-Image Generation with Region-Based Sketches

SketchFlex: Facilitating Spatial-Semantic Coherence in Text-to-Image Generation with Region-Based Sketches

11 February 2025
Haichuan Lin
Yilin Ye
Jiazhi Xia
Wei Zeng
    DiffM
ArXivPDFHTML

Papers citing "SketchFlex: Facilitating Spatial-Semantic Coherence in Text-to-Image Generation with Region-Based Sketches"

11 / 11 papers shown
Title
On Discrete Prompt Optimization for Diffusion Models
On Discrete Prompt Optimization for Diffusion Models
Ruochen Wang
Ting Liu
Cho-Jui Hsieh
Boqing Gong
DiffM
66
7
0
27 Jun 2024
DOCCI: Descriptions of Connected and Contrasting Images
DOCCI: Descriptions of Connected and Contrasting Images
Yasumasa Onoe
Sunayana Rane
Zachary Berger
Yonatan Bitton
Jaemin Cho
...
Zarana Parekh
Jordi Pont-Tuset
Garrett Tanzer
Su Wang
Jason Baldridge
59
57
0
30 Apr 2024
Mastering Text-to-Image Diffusion: Recaptioning, Planning, and
  Generating with Multimodal LLMs
Mastering Text-to-Image Diffusion: Recaptioning, Planning, and Generating with Multimodal LLMs
Ling Yang
Zhaochen Yu
Chenlin Meng
Minkai Xu
Stefano Ermon
Tengjiao Wang
CoGe
DiffM
70
127
0
22 Jan 2024
StreamDiffusion: A Pipeline-level Solution for Real-time Interactive
  Generation
StreamDiffusion: A Pipeline-level Solution for Real-time Interactive Generation
Akio Kodaira
Chenfeng Xu
Toshiki Hazama
Takanori Yoshimoto
Kohei Ohno
Shogo Mitsuhori
Soichi Sugano
Hanying Cho
Zhijian Liu
Kurt Keutzer
52
34
0
19 Dec 2023
BoxDiff: Text-to-Image Synthesis with Training-Free Box-Constrained
  Diffusion
BoxDiff: Text-to-Image Synthesis with Training-Free Box-Constrained Diffusion
Jinheng Xie
Yuexiang Li
Yawen Huang
Haozhe Liu
Wentian Zhang
Yefeng Zheng
Mike Zheng Shou
DiffM
102
200
0
20 Jul 2023
Picture that Sketch: Photorealistic Image Generation from Abstract
  Sketches
Picture that Sketch: Photorealistic Image Generation from Abstract Sketches
Subhadeep Koley
A. Bhunia
Aneeshan Sain
Pinaki Nath Chowdhury
Tao Xiang
Yi-Zhe Song
3DH
53
32
0
20 Mar 2023
3DALL-E: Integrating Text-to-Image AI in 3D Design Workflows
3DALL-E: Integrating Text-to-Image AI in 3D Design Workflows
Vivian Liu
Jo Vermeulen
G. Fitzmaurice
Justin Matejka
HAI
52
119
0
20 Oct 2022
Hierarchical Text-Conditional Image Generation with CLIP Latents
Hierarchical Text-Conditional Image Generation with CLIP Latents
Aditya A. Ramesh
Prafulla Dhariwal
Alex Nichol
Casey Chu
Mark Chen
VLM
DiffM
294
6,768
0
13 Apr 2022
Design Guidelines for Prompt Engineering Text-to-Image Generative Models
Design Guidelines for Prompt Engineering Text-to-Image Generative Models
Vivian Liu
Lydia B. Chilton
42
482
0
14 Sep 2021
CLIPScore: A Reference-free Evaluation Metric for Image Captioning
CLIPScore: A Reference-free Evaluation Metric for Image Captioning
Jack Hessel
Ari Holtzman
Maxwell Forbes
Ronan Le Bras
Yejin Choi
CLIP
84
1,512
0
18 Apr 2021
Towards Robust Monocular Depth Estimation: Mixing Datasets for Zero-shot
  Cross-dataset Transfer
Towards Robust Monocular Depth Estimation: Mixing Datasets for Zero-shot Cross-dataset Transfer
René Ranftl
Katrin Lasinger
David Hafner
Konrad Schindler
V. Koltun
MDE
173
1,774
0
02 Jul 2019
1