Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2306.09344
Cited By
DreamSim: Learning New Dimensions of Human Visual Similarity using Synthetic Data
15 June 2023
Stephanie Fu
Netanel Y. Tamir
Shobhita Sundaram
Lucy Chai
Richard Y. Zhang
Tali Dekel
Phillip Isola
EGVM
Re-assign community
ArXiv
PDF
HTML
Papers citing
"DreamSim: Learning New Dimensions of Human Visual Similarity using Synthetic Data"
50 / 81 papers shown
Title
Image Interpolation with Score-based Riemannian Metrics of Diffusion Models
Shinnosuke Saito
Takashi Matsubara
DiffM
82
1
0
28 Apr 2025
HepatoGEN: Generating Hepatobiliary Phase MRI with Perceptual and Adversarial Models
Jens Hooge
Gerard Sanroma-Guell
Faidra Stavropoulou
Alexander Ullmann
Gesine Knobloch
Mark Klemens
Carola Schmidt
Sabine Weckbach
Andreas Bolz
DiffM
MedIm
97
0
0
25 Apr 2025
Eval3D: Interpretable and Fine-grained Evaluation for 3D Generation
Shivam Duggal
Yushi Hu
Oscar Michel
Aniruddha Kembhavi
William T. Freeman
Noah A. Smith
Ranjay Krishna
Antonio Torralba
Ali Farhadi
Wei-Chiu Ma
EGVM
ELM
77
0
0
25 Apr 2025
Augmenting Perceptual Super-Resolution via Image Quality Predictors
Fengjia Zhang
Samrudhdhi B. Rangrej
Tristan Aumentado-Armstrong
Afsaneh Fazly
Alex Levinshtein
SupR
72
0
0
25 Apr 2025
LayoutCoT: Unleashing the Deep Reasoning Potential of Large Language Models for Layout Generation
Hengyu Shi
Junhao Su
Huansheng Ning
Xiaoming Wei
Jialin Gao
3DV
AI4TS
LRM
57
0
0
15 Apr 2025
Storybooth: Training-free Multi-Subject Consistency for Improved Visual Storytelling
Jaskirat Singh
Junshen Kevin Chen
Jonas Kohler
Michael Cohen
DiffM
VGen
43
0
0
08 Apr 2025
Dynamic Objective MPC for Motion Planning of Seamless Docking Maneuvers
Oliver Schumann
Michael Buchholz
Klaus C. J. Dietmayer
40
0
0
04 Apr 2025
Deep Reinforcement Learning via Object-Centric Attention
Jannis Blüml
Cedric Derstroff
Bjarne Gregori
Elisabeth Dillies
Quentin Delfosse
Kristian Kersting
OCL
49
0
0
03 Apr 2025
Object Isolated Attention for Consistent Story Visualization
Xiangyang Luo
Junhao Cheng
Yifan Xie
Xin Zhang
Tao Feng
Ziqiang Liu
Fei Ma
Fei Richard Yu
DiffM
50
1
0
30 Mar 2025
Evaluating Text-to-Image Synthesis with a Conditional Fréchet Distance
Jaywon Koo
J. Hernandez
Moayed Haji-Ali
Ziyan Yang
Vicente Ordonez
EGVM
72
0
0
27 Mar 2025
What's Producible May Not Be Reachable: Measuring the Steerability of Generative Models
Keyon Vafa
Sarah Bentley
Jon M. Kleinberg
S. Mullainathan
43
0
0
21 Mar 2025
ObjectMover: Generative Object Movement with Video Prior
Xin Yu
Tianyu Wang
Seunggeun Kim
Paul Guerrero
Xi Chen
Qing Liu
Zhe Lin
Xiaojuan Qi
DiffM
VGen
OCL
81
0
0
11 Mar 2025
MTReD: 3D Reconstruction Dataset for Fly-over Videos of Maritime Domain
Rui Yi Yong
Samuel Picosson
Arnold Wiliem
37
0
0
02 Mar 2025
Seeing Eye to AI? Applying Deep-Feature-Based Similarity Metrics to Information Visualization
Sheng Long
Angelos Chatzimparmpas
Emma Alexander
Matthew Kay
Jessica Hullman
34
0
0
28 Feb 2025
Skrr: Skip and Re-use Text Encoder Layers for Memory Efficient Text-to-Image Generation
H. Seo
Wongi Jeong
Jae-sun Seo
Se Young Chun
62
0
0
12 Feb 2025
SliderSpace: Decomposing the Visual Capabilities of Diffusion Models
Rohit Gandikota
Zongze Wu
Richard Zhang
David Bau
Eli Shechtman
Nick Kolkin
DiffM
53
1
0
03 Feb 2025
The in-context inductive biases of vision-language models differ across modalities
Kelsey Allen
Ishita Dasgupta
Eliza Kosoy
Andrew Kyle Lampinen
70
0
0
03 Feb 2025
CorrFill: Enhancing Faithfulness in Reference-based Inpainting with Correspondence Guidance in Diffusion Models
Kuan-Hung Liu
Cheng-Kun Yang
Min-Hung Chen
Yu-Lun Liu
Y. Lin
DiffM
33
1
0
04 Jan 2025
VLM2Vec: Training Vision-Language Models for Massive Multimodal Embedding Tasks
Ziyan Jiang
Rui Meng
Xinyi Yang
Semih Yavuz
Yingbo Zhou
Wenhu Chen
MLLM
VLM
51
20
0
03 Jan 2025
Similarity Trajectories: Linking Sampling Process to Artifacts in Diffusion-Generated Images
Dennis Menn
Feng Liang
Hung-Yueh Chiang
Diana Marculescu
DiffM
77
0
0
22 Dec 2024
GME: Improving Universal Multimodal Retrieval by Multimodal LLMs
Xin Zhang
Yanzhao Zhang
Wen Xie
Mingxin Li
Ziqi Dai
Dingkun Long
Pengjun Xie
Meishan Zhang
Wenjie Li
Hao Fei
116
8
0
22 Dec 2024
Navigation World Models
Amir Bar
G. Zhou
Danny Tran
Trevor Darrell
Yann LeCun
VGen
EgoV
82
14
0
04 Dec 2024
Semi-Truths: A Large-Scale Dataset of AI-Augmented Images for Evaluating Robustness of AI-Generated Image detectors
Anisha Pal
Julia Kruk
Mansi Phute
Manognya Bhattaram
Diyi Yang
Duen Horng Chau
Judy Hoffman
AAML
47
2
0
12 Nov 2024
BrainBits: How Much of the Brain are Generative Reconstruction Methods Using?
David Mayo
Christopher Wang
Asa Harbin
Abdulrahman Alabdulkareem
Albert Eaton Shaw
Boris Katz
Andrei Barbu
DiffM
49
0
0
05 Nov 2024
MM-Embed: Universal Multimodal Retrieval with Multimodal LLMs
Sheng-Chieh Lin
Chankyu Lee
M. Shoeybi
Jimmy J. Lin
Bryan Catanzaro
Ming-Yu Liu
67
12
0
04 Nov 2024
Unbounded: A Generative Infinite Game of Character Life Simulation
Jialu Li
Yuanzhen Li
Neal Wadhwa
Yael Pritch
David E. Jacobs
Michael Rubinstein
Joey Tianyi Zhou
Nataniel Ruiz
VGen
AI4CE
36
4
0
24 Oct 2024
FairQueue: Rethinking Prompt Learning for Fair Text-to-Image Generation
Christopher T. H. Teo
Milad Abdollahzadeh
Xinda Ma
Ngai-man Cheung
DiffM
26
1
0
24 Oct 2024
FrameBridge: Improving Image-to-Video Generation with Bridge Models
Yuji Wang
Zehua Chen
Xiaoyu Chen
Jun-Jie Zhu
Jianfei Chen
DiffM
VGen
176
1
0
20 Oct 2024
MagicTailor: Component-Controllable Personalization in Text-to-Image Diffusion Models
Donghao Zhou
Jiancheng Huang
J. Bai
Jiaze Wang
Hao Chen
Guangyong Chen
Xiaowei Hu
Pheng Ann Heng
47
5
0
17 Oct 2024
LVD-2M: A Long-take Video Dataset with Temporally Dense Captions
Tianwei Xiong
Yuqing Wang
Daquan Zhou
Zhijie Lin
Jiashi Feng
Xihui Liu
VGen
33
7
0
14 Oct 2024
DreamStruct: Understanding Slides and User Interfaces via Synthetic Data Generation
Yi-Hao Peng
Faria Huq
Yue Jiang
Jason Wu
Amanda Li
Jeffrey P. Bigham
Amy Pavel
DiffM
35
4
0
30 Sep 2024
MM1.5: Methods, Analysis & Insights from Multimodal LLM Fine-tuning
Haotian Zhang
Mingfei Gao
Zhe Gan
Philipp Dufter
Nina Wenzel
...
Haoxuan You
Zirui Wang
Afshin Dehghan
Peter Grasch
Yinfei Yang
VLM
MLLM
40
32
1
30 Sep 2024
Foundation Models Boost Low-Level Perceptual Similarity Metrics
Abhijay Ghildyal
Nabajeet Barman
Saman Zadtootaghaj
42
3
0
11 Sep 2024
Thinking Outside the BBox: Unconstrained Generative Object Compositing
Gemma Canet Tarrés
Zhe Lin
Zhifei Zhang
Jianming Zhang
Yizhi Song
Dan Ruta
Andrew Gilbert
John Collomosse
Soo Ye Kim
DiffM
35
9
0
06 Sep 2024
A General Albedo Recovery Approach for Aerial Photogrammetric Images through Inverse Rendering
Shuang Song
R. Qin
44
0
0
04 Sep 2024
Not Every Image is Worth a Thousand Words: Quantifying Originality in Stable Diffusion
Adi Haviv
Shahar Sarfaty
Uri Y. Hacohen
N. Elkin-Koren
Roi Livni
Amit H. Bermano
37
2
0
15 Aug 2024
DreamStory: Open-Domain Story Visualization by LLM-Guided Multi-Subject Consistent Diffusion
Huiguo He
Huan Yang
Zixi Tuo
Yuan Zhou
Qiuyue Wang
Yuhang Zhang
Zeyu Liu
Wenhao Huang
Hongyang Chao
Jian Yin
DiffM
VGen
62
12
0
17 Jul 2024
Geospecific View Generation -- Geometry-Context Aware High-resolution Ground View Inference from Satellite Views
Ningli Xu
R. Qin
72
5
0
10 Jul 2024
DRAGON: Drone and Ground Gaussian Splatting for 3D Building Reconstruction
Yujin Ham
Mateusz Michalkiewicz
Guha Balakrishnan
63
1
0
01 Jul 2024
DreamBench++: A Human-Aligned Benchmark for Personalized Image Generation
Yuang Peng
Yuxin Cui
Haomiao Tang
Zekun Qi
Runpei Dong
Jing Bai
Chunrui Han
Zheng Ge
Xiangyu Zhang
Shu-Tao Xia
EGVM
75
31
0
24 Jun 2024
Consistency-diversity-realism Pareto fronts of conditional image generative models
Pietro Astolfi
Marlene Careil
Melissa Hall
Oscar Manas
Matthew Muckley
Jakob Verbeek
Adriana Romero Soriano
M. Drozdzal
51
10
0
14 Jun 2024
Words Worth a Thousand Pictures: Measuring and Understanding Perceptual Variability in Text-to-Image Generation
Raphael Tang
Xinyu Crystina Zhang
Lixinyu Xu
Yao Lu
Wenyan Li
Pontus Stenetorp
Jimmy Lin
Ferhan Ture
34
0
0
12 Jun 2024
FaithFill: Faithful Inpainting for Object Completion Using a Single Reference Image
Rupayan Mallick
Amr Abdalla
Sarah Adel Bargal
DiffM
26
0
0
12 Jun 2024
GenAI Arena: An Open Evaluation Platform for Generative Models
Dongfu Jiang
Max W.F. Ku
Tianle Li
Yuansheng Ni
Shizhuo Sun
Rongqi Fan
Wenhu Chen
EGVM
41
20
0
06 Jun 2024
fruit-SALAD: A Style Aligned Artwork Dataset to reveal similarity perception in image embeddings
Tillmann Ohm
Andres Karjus
Mikhail Tamm
Maximilian Schich
38
1
0
03 Jun 2024
URDFormer: A Pipeline for Constructing Articulated Simulation Environments from Real-World Images
Zoey Chen
Aaron Walsman
Marius Memmel
Kaichun Mo
Alex Fang
Karthikeya Vemuri
Alan Wu
Dieter Fox
Abhishek Gupta
AI4CE
VGen
65
26
0
19 May 2024
AquaLoRA: Toward White-box Protection for Customized Stable Diffusion Models via Watermark LoRA
Weitao Feng
Wenbo Zhou
Jiyan He
Jie Zhang
Tianyi Wei
Guanlin Li
Tianwei Zhang
Weiming Zhang
Neng H. Yu
31
18
0
18 May 2024
Content-Based Image Retrieval for Multi-Class Volumetric Radiology Images: A Benchmark Study
Farnaz Khun Jush
Steffen Vogler
Tuan Truong
Matthias Lenga
42
2
0
15 May 2024
MANTIS: Interleaved Multi-Image Instruction Tuning
Dongfu Jiang
Xuan He
Huaye Zeng
Cong Wei
Max W.F. Ku
Qian Liu
Wenhu Chen
VLM
MLLM
33
103
0
02 May 2024
DOCCI: Descriptions of Connected and Contrasting Images
Yasumasa Onoe
Sunayana Rane
Zachary Berger
Yonatan Bitton
Jaemin Cho
...
Zarana Parekh
Jordi Pont-Tuset
Garrett Tanzer
Su Wang
Jason Baldridge
41
48
0
30 Apr 2024
1
2
Next