Papers
Communities
Organizations
Events
Blog
Pricing
Search
Open menu
Home
Papers
2209.00647
Cited By
Visual Prompting via Image Inpainting
1 September 2022
Amir Bar
Yossi Gandelsman
Trevor Darrell
Amir Globerson
Alexei A. Efros
VLM
VPVLM
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"Visual Prompting via Image Inpainting"
50 / 166 papers shown
Title
Flexible visual prompts for in-context learning in computer vision
Thomas Foster
Ioana Croitoru
Robert Dorfman
Christoffer Edlund
Thomas Varsavsky
Jon Almazán
VLM
VOS
77
0
0
11 Dec 2023
From Static to Dynamic: Adapting Landmark-Aware Image Models for Facial Expression Recognition in Videos
Yin Chen
Jia Li
Shiguang Shan
Meng Wang
Richang Hong
93
35
0
09 Dec 2023
NeRFiller: Completing Scenes via Generative 3D Inpainting
Ethan Weber
Aleksander Holyñski
Varun Jampani
Saurabh Saxena
Noah Snavely
Abhishek Kar
Angjoo Kanazawa
114
32
0
07 Dec 2023
UPOCR: Towards Unified Pixel-Level OCR Interface
Dezhi Peng
Zhenhua Yang
Jiaxin Zhang
Chongyu Liu
Yongxin Shi
Kai Ding
Fengjun Guo
Lianwen Jin
129
12
0
05 Dec 2023
Towards More Unified In-context Visual Understanding
Dianmo Sheng
DongDong Chen
Zhentao Tan
Qiankun Liu
Qi Chu
Jianmin Bao
Tao Gong
Bin Liu
Shengwei Xu
Nenghai Yu
MLLM
VLM
98
10
0
05 Dec 2023
IMProv: Inpainting-based Multimodal Prompting for Computer Vision Tasks
Jiarui Xu
Yossi Gandelsman
Amir Bar
Jianwei Yang
Jianfeng Gao
Trevor Darrell
Xiaolong Wang
VLM
60
3
0
04 Dec 2023
Retrieval-augmented Multi-modal Chain-of-Thoughts Reasoning for Large Language Models
Bingshuai Liu
Chenyang Lyu
Zijun Min
Zhanyu Wang
Jinsong Su
Longyue Wang
LRM
108
8
0
04 Dec 2023
Regressor-Segmenter Mutual Prompt Learning for Crowd Counting
Mingyue Guo
Li Yuan
Zhaoyi Yan
Binghui Chen
Yaowei Wang
QiXiang Ye
120
6
0
04 Dec 2023
Improving In-Context Learning in Diffusion Models with Visual Context-Modulated Prompts
Tianqi Chen
Yongfei Liu
Zhendong Wang
Jianbo Yuan
Quanzeng You
Hongxia Yang
Mingyuan Zhou
VLM
79
6
0
03 Dec 2023
Sequential Modeling Enables Scalable Learning for Large Vision Models
Yutong Bai
Xinyang Geng
K. Mangalam
Amir Bar
Alan Yuille
Trevor Darrell
Jitendra Malik
Alexei A. Efros
MLLM
VLM
117
169
0
01 Dec 2023
InstructSeq: Unifying Vision Tasks with Instruction-conditioned Multi-modal Sequence Generation
Rongyao Fang
Shilin Yan
Zhaoyang Huang
Jingqiu Zhou
Hao Tian
Jifeng Dai
Hongsheng Li
MLLM
108
14
0
30 Nov 2023
SEGIC: Unleashing the Emergent Correspondence for In-Context Segmentation
Lingchen Meng
Shiyi Lan
Hengduo Li
Jose M. Alvarez
Zuxuan Wu
Yu-Gang Jiang
VLM
ISeg
MLLM
79
9
0
24 Nov 2023
MetaCloak: Preventing Unauthorized Subject-driven Text-to-image Diffusion-based Synthesis via Meta-learning
Yixin Liu
Chenrui Fan
Yutong Dai
Xun Chen
Pan Zhou
Lichao Sun
DiffM
113
23
0
22 Nov 2023
Imagine the Unseen World: A Benchmark for Systematic Generalization in Visual World Models
Yeongbin Kim
Gautam Singh
Junyeong Park
Çağlar Gülçehre
Sungjin Ahn
OCL
VLM
122
5
0
15 Nov 2023
EviPrompt: A Training-Free Evidential Prompt Generation Method for Segment Anything Model in Medical Images
Yinsong Xu
Jiaqi Tang
Aidong Men
Qingchao Chen
VLM
MedIm
100
7
0
10 Nov 2023
Instruct Me More! Random Prompting for Visual In-Context Learning
Jiahao Zhang
Bowen Wang
Liangzhi Li
Yuta Nakashima
Hajime Nagahara
VLM
79
19
0
07 Nov 2023
ExPT: Synthetic Pretraining for Few-Shot Experimental Design
Tung Nguyen
Sudhanshu Agrawal
Aditya Grover
128
17
0
30 Oct 2023
Quantifying Privacy Risks of Prompts in Visual Prompt Learning
Yixin Wu
Rui Wen
Michael Backes
Pascal Berrang
Mathias Humbert
Yun Shen
Yang Zhang
AAML
VPVLM
117
10
0
18 Oct 2023
Set-of-Mark Prompting Unleashes Extraordinary Visual Grounding in GPT-4V
Jianwei Yang
Hao Zhang
Feng Li
Xueyan Zou
Chun-yue Li
Jianfeng Gao
MLLM
VLM
150
188
0
17 Oct 2023
Context-Aware Meta-Learning
Christopher Fifty
Dennis Duan
Ronald G. Junkins
Ehsan Amid
Jurij Leskovec
Christopher Ré
Sebastian Thrun
LRM
VLM
MLLM
88
12
0
17 Oct 2023
Unifying Image Processing as Visual Prompting Question Answering
Yihao Liu
Xiangyu Chen
Xianzheng Ma
Xintao Wang
Jiantao Zhou
Yu Qiao
Chao Dong
MLLM
100
20
0
16 Oct 2023
SAIR: Learning Semantic-aware Implicit Representation
Canyu Zhang
Xiaoguang Li
Qing Guo
Song Wang
76
4
0
13 Oct 2023
AutoVP: An Automated Visual Prompting Framework and Benchmark
Hsi-Ai Tsao
Lei Hsiung
Pin-Yu Chen
Sijia Liu
Tsung-Yi Ho
VLM
97
22
0
12 Oct 2023
InstructCV: Instruction-Tuned Text-to-Image Diffusion Models as Vision Generalists
Yulu Gan
Sungwoo Park
Alexander Schubert
Anthony Philippakis
Ahmed Alaa
VLM
121
25
0
30 Sep 2023
Visual In-Context Learning for Few-Shot Eczema Segmentation
Monitirtha Dey
S. K. Bhandari
Venugopal Vasudevan
39
2
0
28 Sep 2023
SCT: A Simple Baseline for Parameter-Efficient Fine-Tuning via Salient Channels
Henry Hengyuan Zhao
Pichao Wang
Yuyang Zhao
Hao Luo
F. Wang
Mike Zheng Shou
ViT
129
14
0
15 Sep 2023
InstructDiffusion: A Generalist Modeling Interface for Vision Tasks
Zigang Geng
Binxin Yang
Tiankai Hang
Chen Li
Shuyang Gu
...
Jianmin Bao
Zheng Zhang
Han Hu
DongDong Chen
Baining Guo
DiffM
VLM
131
107
0
07 Sep 2023
Knowledge-Aware Prompt Tuning for Generalizable Vision-Language Models
Baoshuo Kan
Teng Wang
Wenpeng Lu
Xiantong Zhen
Weili Guan
Feng Zheng
VPVLM
VLM
108
26
0
22 Aug 2023
Improving Adversarial Robustness of Masked Autoencoders via Test-time Frequency-domain Prompting
Qidong Huang
Xiaoyi Dong
DongDong Chen
Yinpeng Chen
Lu Yuan
Gang Hua
Weiming Zhang
Neng H. Yu
AAML
116
10
0
20 Aug 2023
EventBind: Learning a Unified Representation to Bind Them All for Event-based Open-world Understanding
Jiazhou Zhou
Xueye Zheng
Yuanhuiyi Lyu
Lin Wang
VLM
90
17
0
06 Aug 2023
ImageBrush: Learning Visual In-Context Instructions for Exemplar-Based Image Manipulation
Yasheng Sun
Yifan Yang
Houwen Peng
Yifei Shen
Yuqing Yang
Hang-Rui Hu
Lili Qiu
Hideki Koike
DiffM
LM&Ro
87
39
0
02 Aug 2023
Visual Instruction Inversion: Image Editing via Visual Prompting
Thao Nguyen
Yuheng Li
Utkarsh Ojha
Yong Jae Lee
DiffM
49
24
0
26 Jul 2023
ProRes: Exploring Degradation-aware Visual Prompt for Universal Image Restoration
Jiaqi Ma
Tianheng Cheng
Guoli Wang
Qian Zhang
Xinggang Wang
Lefei Zhang
DiffM
VLM
86
48
0
23 Jun 2023
Explore In-Context Learning for 3D Point Cloud Understanding
Zhongbin Fang
Xiangtai Li
Xia Li
J. M. Buhmann
Chen Change Loy
Mengyuan Liu
3DPC
102
27
0
14 Jun 2023
Paste, Inpaint and Harmonize via Denoising: Subject-Driven Image Editing with Pre-Trained Diffusion Model
Xinyu Zhang
Jiaxian Guo
Paul D. Yoo
Yutaka Matsuo
Yusuke Iwasawa
DiffM
118
22
0
13 Jun 2023
Leveraging Large Language Models for Scalable Vector Graphics-Driven Image Understanding
Mu Cai
Zeyi Huang
Yuheng Li
Utkarsh Ojha
Haohan Wang
Yong Jae Lee
VLM
49
2
0
09 Jun 2023
Fine-Grained Visual Prompting
Lingfeng Yang
Yueze Wang
Xiang Li
Xinlong Wang
Jian Yang
ObjD
VLM
126
69
0
07 Jun 2023
Towards In-context Scene Understanding
Ivana Balazevic
David Steiner
Nikhil Parthasarathy
Relja Arandjelović
Olivier J. Hénaff
105
31
0
02 Jun 2023
StyleGAN knows Normal, Depth, Albedo, and More
Anand Bhattad
Daniel McKee
Derek Hoiem
David A. Forsyth
GAN
79
37
0
01 Jun 2023
Explicit Visual Prompting for Universal Foreground Segmentations
Weihuang Liu
Xi Shen
Chi-Man Pun
Xiaodong Cun
VPVLM
VLM
83
14
0
29 May 2023
Im-Promptu: In-Context Composition from Image Prompts
Bhishma Dedhia
Michael Chang
Jake C. Snell
Thomas Griffiths
N. Jha
LRM
MLLM
111
2
0
26 May 2023
VisionLLM: Large Language Model is also an Open-Ended Decoder for Vision-Centric Tasks
Wen Wang
Zhe Chen
Xiaokang Chen
Jiannan Wu
Xizhou Zhu
...
Ping Luo
Tong Lu
Jie Zhou
Yu Qiao
Jifeng Dai
MLLM
VLM
123
495
0
18 May 2023
UniControl: A Unified Diffusion Model for Controllable Visual Generation In the Wild
Can Qin
Shu Zhen Zhang
Ning Yu
Yihao Feng
Xinyi Yang
...
Caiming Xiong
Silvio Savarese
Stefano Ermon
Yun Fu
Ran Xu
145
136
0
18 May 2023
Personalize Segment Anything Model with One Shot
Renrui Zhang
Zhengkai Jiang
Ziyu Guo
Shilin Yan
Junting Pan
Xianzheng Ma
Hao Dong
Peng Gao
Hongsheng Li
MLLM
VLM
124
219
0
04 May 2023
In-Context Learning Unlocked for Diffusion Models
Zhendong Wang
Yi Ding
Yadong Lu
Yelong Shen
Pengcheng He
Weizhu Chen
Zhangyang Wang
Mingyuan Zhou
VLM
DiffM
152
78
0
01 May 2023
Instruction-ViT: Multi-Modal Prompts for Instruction Learning in ViT
Zhe Xiao
Yuzhong Chen
Lu Zhang
Jun Yao
Zihao Wu
...
Yixuan Yuan
Dinggang Shen
Dajiang Zhu
Tianming Liu
Xi Jiang
VLM
MLLM
154
17
0
29 Apr 2023
Putting People in Their Place: Affordance-Aware Human Insertion into Scenes
Sumith Kulal
Tim Brooks
A. Aiken
Jiajun Wu
Jimei Yang
Jingwan Lu
Alexei A. Efros
Krishna Kumar Singh
DiffM
118
44
0
27 Apr 2023
Analogy-Forming Transformers for Few-Shot 3D Parsing
N. Gkanatsios
M. Singh
Zhaoyuan Fang
Shubham Tulsiani
Katerina Fragkiadaki
3DPC
3DV
74
2
0
27 Apr 2023
Segment Anything in Non-Euclidean Domains: Challenges and Opportunities
Yongcheng Jing
Xinchao Wang
Dacheng Tao
100
22
0
23 Apr 2023
What does CLIP know about a red circle? Visual prompt engineering for VLMs
Aleksandar Shtedritski
Christian Rupprecht
Andrea Vedaldi
VLM
MLLM
127
162
0
13 Apr 2023
Previous
1
2
3
4
Next