Communities
Connect sessions
AI calendar
Organizations
Join Slack
Contact Sales
Search
Open menu
Home
Papers
2209.00647
Cited By
Visual Prompting via Image Inpainting
Neural Information Processing Systems (NeurIPS), 2022
1 September 2022
Amir Bar
Yossi Gandelsman
Trevor Darrell
Amir Globerson
Alexei A. Efros
VLM
VPVLM
Re-assign community
ArXiv (abs)
PDF
HTML
HuggingFace (1 upvotes)
Papers citing
"Visual Prompting via Image Inpainting"
50 / 180 papers shown
Title
Rethinking Visual Intelligence: Insights from Video Pretraining
Pablo Acuaviva
A. Davtyan
Mariam Hassan
Sebastian Stapf
Ahmad Rahimi
Alexandre Alahi
Paolo Favaro
VLM
LRM
62
0
0
28 Oct 2025
Point Prompting: Counterfactual Tracking with Video Diffusion Models
Ayush Shrivastava
Sanyam Mehta
Daniel Geng
Andrew Owens
DiffM
VGen
36
0
0
13 Oct 2025
Towards Reliable and Holistic Visual In-Context Learning Prompt Selection
Wenxiao Wu
Jing-Hao Xue
C. Xu
Chen Liu
Xinwei Sun
Changxin Gao
Nong Sang
Yanwei Fu
65
0
0
30 Sep 2025
Personalized Vision via Visual In-Context Learning
Yuxin Jiang
Yuchao Gu
Yiren Song
Ivor Tsang
Mike Zheng Shou
VLM
58
2
0
29 Sep 2025
PANICL: Mitigating Over-Reliance on Single Prompt in Visual In-Context Learning
Jiahao Zhang
Bowen Wang
Hong Liu
Yuta Nakashima
Hajime Nagahara
MLLM
VLM
66
1
0
26 Sep 2025
UniVid: Unifying Vision Tasks with Pre-trained Video Generation Models
Lan Chen
Yuchao Gu
Qi Mao
VGen
28
0
0
26 Sep 2025
History-Aware Visuomotor Policy Learning via Point Tracking
Jingjing Chen
Hongjie Fang
Chenxi Wang
Shiquan Wang
Cewu Lu
64
0
0
21 Sep 2025
Enhancing Video Large Language Models with Structured Multi-Video Collaborative Reasoning (early version)
Zhihao He
Tianyao He
Tieyuan Chen
Yun Xu
Huabin Liu
Chaofan Gan
Gui Zou
W. Lin
60
0
0
16 Sep 2025
RoboChemist: Long-Horizon and Safety-Compliant Robotic Chemical Experimentation
Z. Zhang
Chenghao Yue
Haobo Xu
Minwen Liao
Xianglin Qi
Huan-ang Gao
Ziwei Wang
Hao Zhao
56
1
0
10 Sep 2025
Human-in-Context: Unified Cross-Domain 3D Human Motion Modeling via In-Context Learning
Mengyuan Liu
Xinshun Wang
Zhongbin Fang
Deheng Ye
Xia Li
Tao Tang
Songtao Wu
Xiangtai Li
Ming-Hsuan Yang
3DH
71
0
0
14 Aug 2025
Stable Diffusion Models are Secretly Good at Visual In-Context Learning
Trevine Oorloff
Vishwanath Sindagi
Wele Gedara Chaminda Bandara
Ali Shafahi
Amin Ghiasi
Charan Prakash
R. Ardekani
DiffM
VLM
67
1
0
13 Aug 2025
Exploring Scalable Unified Modeling for General Low-Level Vision
Xiangyu Chen
Kaiwen Zhu
Yuandong Pu
Shuo Cao
Xiaohui Li
Wenlong Zhang
Yihao Liu
Yu Qiao
Jiantao Zhou
Chao Dong
58
2
0
20 Jul 2025
DIP: Unsupervised Dense In-Context Post-training of Visual Representations
Sophia Sirko-Galouchenko
Spyros Gidaris
Antonín Vobecký
Andrei Bursuc
Nicolas Thome
141
0
0
23 Jun 2025
Visual-Instructed Degradation Diffusion for All-in-One Image Restoration
Computer Vision and Pattern Recognition (CVPR), 2025
Wenyang Luo
Haina Qin
Zewen Chen
L. xilinx Wang
Dandan Zheng
Yuming Li
Yufan Liu
B. Li
Weiming Hu
117
3
0
20 Jun 2025
Conquering the Retina: Bringing Visual in-Context Learning to OCT
Alessio Negrini
Simon Reiß
102
0
0
18 Jun 2025
Vision Generalist Model: A Survey
International Journal of Computer Vision (IJCV), 2025
Ziyi Wang
Yongming Rao
Shuofeng Sun
Xinrun Liu
Yi Wei
...
Zuyan Liu
Yanbo Wang
Hongmin Liu
Jie Zhou
Jiwen Lu
185
0
0
11 Jun 2025
PairEdit: Learning Semantic Variations for Exemplar-based Image Editing
Haoguang Lu
Jiacheng Chen
Zhenguo Yang
Aurele Tohokantche Gnanha
Fu Lee Wang
Li Qing
Xudong Mao
DiffM
183
0
0
09 Jun 2025
Neural Network Reprogrammability: A Unified Theme on Model Reprogramming, Prompt Tuning, and Prompt Instruction
Zesheng Ye
C. Cai
Ruijiang Dong
Jianzhong Qi
Bingquan Shen
Pin-Yu Chen
Feng Liu
395
1
0
05 Jun 2025
A Generative Adaptive Replay Continual Learning Model for Temporal Knowledge Graph Reasoning
Annual Meeting of the Association for Computational Linguistics (ACL), 2025
Zhiyu Zhang
Wei Chen
Youfang Lin
Huaiyu Wan
OffRL
CLL
259
0
0
04 Jun 2025
gen2seg: Generative Models Enable Generalizable Instance Segmentation
Om Khangaonkar
Hamed Pirsiavash
DiffM
VLM
256
0
0
21 May 2025
Embracing Collaboration Over Competition: Condensing Multiple Prompts for Visual In-Context Learning
Computer Vision and Pattern Recognition (CVPR), 2025
Jiadong Wang
Tianci Luo
Yaohua Zha
Yan Feng
Ruisheng Luo
Bin Chen
Tao Dai
Long Chen
Yaowei Wang
Shu-Tao Xia
VLM
188
0
0
30 Apr 2025
E-InMeMo: Enhanced Prompting for Visual In-Context Learning
Journal of Imaging (JI), 2025
Jiahao Zhang
Bowen Wang
Hong Liu
Liangzhi Li
Yuta Nakashima
Hajime Nagahara
VLM
278
1
0
25 Apr 2025
Visual Prompting for One-shot Controllable Video Editing without Inversion
Computer Vision and Pattern Recognition (CVPR), 2025
Zitao Gao
Yuxi Zhou
Duo Peng
Joo-Hwee Lim
Zhigang Tu
De Wen Soh
Lin Geng Foo
DiffM
213
3
0
19 Apr 2025
RefComp: A Reference-guided Unified Framework for Unpaired Point Cloud Completion
IEEE transactions on multimedia (TMM), 2025
Yixuan Yang
Jinyu Yang
Zixiang Zhao
Victor Sanchez
Feng Zheng
140
0
0
18 Apr 2025
DC-SAM: In-Context Segment Anything in Images and Videos via Dual Consistency
Mengshi Qi
Pengfei Zhu
Xianrui Li
Xiaoyang Bi
Lu Qi
Huadong Ma
Ming-Hsuan Yang
VOS
VLM
253
0
0
16 Apr 2025
DSM: Constructing a Diverse Semantic Map for 3D Visual Grounding
Qinghongbing Xie
Zijian Liang
Fuhao Li
Long Zeng
163
0
0
11 Apr 2025
VisualCloze: A Universal Image Generation Framework via Visual In-Context Learning
Zhong-Yu Li
Ruoyi Du
Juncheng Yan
Le Zhuo
Zhen Li
Peng Gao
Zhanyu Ma
Ming-Ming Cheng
VLM
237
18
0
10 Apr 2025
Test-Time Visual In-Context Tuning
Computer Vision and Pattern Recognition (CVPR), 2025
Jiahao Xie
A. Tonioni
N. Rauschmayr
F. Tombari
Bernt Schiele
OOD
VLM
143
3
0
27 Mar 2025
PAVE: Patching and Adapting Video Large Language Models
Computer Vision and Pattern Recognition (CVPR), 2025
Zhuoming Liu
Yiquan Li
Khoi Duc Nguyen
Yiwu Zhong
Yin Li
KELM
LRM
232
1
0
25 Mar 2025
MotionDiff: Training-free Zero-shot Interactive Motion Editing via Flow-assisted Multi-view Diffusion
Yikun Ma
Yiqing Li
Jiawei Wu
Xing Luo
Zhi Jin
DiffM
VGen
408
1
0
22 Mar 2025
A Recipe for Generating 3D Worlds From a Single Image
Katja Schwarz
Denys Rozumnyi
Samuel Rota Buló
Lorenzo Porzi
Peter Kontschieder
VGen
195
5
0
20 Mar 2025
CAM-Seg: A Continuous-valued Embedding Approach for Semantic Image Generation
Masud Ahmed
Zahid Hasan
Syed Arefinul Haque
A. Faridee
S. Purushotham
Suya You
Nirmalya Roy
305
0
0
19 Mar 2025
HiMTok: Learning Hierarchical Mask Tokens for Image Segmentation with Large Multimodal Model
Tao Wang
Changxu Cheng
Lingfeng Wang
Senda Chen
Wuyue Zhao
VLM
209
6
0
17 Mar 2025
Edit Transfer: Learning Image Editing via Vision In-Context Relations
Lan Chen
Qi Mao
Yuchao Gu
Mike Zheng Shou
243
6
0
17 Mar 2025
KUDA: Keypoints to Unify Dynamics Learning and Visual Prompting for Open-Vocabulary Robotic Manipulation
IEEE International Conference on Robotics and Automation (ICRA), 2025
Zixian Liu
Mingtong Zhang
Yunzhu Li
156
5
0
13 Mar 2025
Towards Ambiguity-Free Spatial Foundation Model: Rethinking and Decoupling Depth Ambiguity
Xiaohao Xu
Feng Xue
Xianrui Li
Haowei Li
Steve Yang
Tianze Zhang
Matthew Johnson-Roberson
Xiaonan Huang
3DV
178
0
0
08 Mar 2025
Synthetic data enables context-aware bioacoustic sound event detection
Benjamin Hoffman
David Robinson
Marius Miron
V. Baglione
D. Canestrari
...
Eva Trapote
Olivier Pietquin
M. Cusimano
Masato Hagiwara
Olivier Pietquin
272
2
0
01 Mar 2025
Denoising as Adaptation: Noise-Space Domain Adaptation for Image Restoration
International Conference on Learning Representations (ICLR), 2024
Kang Liao
Zongsheng Yue
Zhouxia Wang
Chen Change Loy
258
9
0
20 Feb 2025
Audio Texture Manipulation by Exemplar-Based Analogy
IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2025
Kan Jen Cheng
Tingle Li
Gopala Anumanchipalli
DiffM
93
1
0
21 Jan 2025
Gaussian Masked Autoencoders
Jathushan Rajasegaran
Xinlei Chen
Rulilong Li
Christoph Feichtenhofer
Jitendra Malik
Shiry Ginosar
3DGS
140
2
0
06 Jan 2025
Differentiable Prompt Learning for Vision Language Models
International Joint Conference on Artificial Intelligence (IJCAI), 2024
Zhenhan Huang
Tejaswini Pedapati
Pin-Yu Chen
Jianxi Gao
VLM
162
0
0
03 Jan 2025
SAMIC: Segment Anything with In-Context Spatial Prompt Engineering
S. Nagendra
Kashif Rashid
Chaopeng Shen
Daniel Kifer
VLM
211
4
0
16 Dec 2024
Unleashing In-context Learning of Autoregressive Models for Few-shot Image Manipulation
Computer Vision and Pattern Recognition (CVPR), 2024
Bolin Lai
F. Xu
Miao Liu
Xiaoliang Dai
Nikhil Mehta
...
Zeyi Huang
James M. Rehg
Sangmin Lee
Ning Zhang
Tong Xiao
239
6
0
02 Dec 2024
Enhancing Parameter-Efficient Fine-Tuning of Vision Transformers through Frequency-Based Adaptation
S. Ly
Hien Nguyen
239
4
0
28 Nov 2024
LoRA of Change: Learning to Generate LoRA for the Editing Instruction from A Single Before-After Image Pair
Xue Song
Jiequan Cui
Haiqi Zhang
Jiaxin Shi
Jingjing Chen
Chi Zhang
Yu-Gang Jiang
301
1
0
28 Nov 2024
Seeing the Undefined: Chain-of-Action for Generative Semantic Labels
Meng Wei
Zhongnian Li
Peng Ying
Xinzheng Xu
VLM
201
0
0
26 Nov 2024
MICAS: Multi-grained In-Context Adaptive Sampling for 3D Point Cloud Processing
Computer Vision and Pattern Recognition (CVPR), 2024
Feifei Shao
Ping Liu
Zhao Wang
Yawei Luo
Hongwei Wang
Jun Xiao
3DPC
248
1
0
25 Nov 2024
Med-PerSAM: One-Shot Visual Prompt Tuning for Personalized Segment Anything Model in Medical Domain
Hangyul Yoon
Doohyuk Jang
JungEun Kim
Eunho Yang
VLM
MedIm
174
1
0
25 Nov 2024
There is no SAMantics! Exploring SAM as a Backbone for Visual Understanding Tasks
Miguel Espinosa
Chenhongyi Yang
Linus Ericsson
Jingyu Sun
Elliot J. Crowley
VLM
190
3
0
22 Nov 2024
LaVin-DiT: Large Vision Diffusion Transformer
Computer Vision and Pattern Recognition (CVPR), 2024
Zhaoqing Wang
Xiaobo Xia
Runnan Chen
Dongdong Yu
Changhu Wang
Mingming Gong
Tongliang Liu
417
15
0
18 Nov 2024
1
2
3
4
Next