Communities
Connect sessions
AI calendar
Organizations
Join Slack
Contact Sales
Search
Open menu
Home
Papers
All Papers
0 / 0 papers shown
Title
Home
Papers
2209.00647
Cited By
Visual Prompting via Image Inpainting
Neural Information Processing Systems (NeurIPS), 2022
1 September 2022
Amir Bar
Yossi Gandelsman
Trevor Darrell
Amir Globerson
Alexei A. Efros
VLM
VPVLM
Re-assign community
ArXiv (abs)
PDF
HTML
HuggingFace (1 upvotes)
Papers citing
"Visual Prompting via Image Inpainting"
50 / 180 papers shown
Title
AllRestorer: All-in-One Transformer for Image Restoration under Composite Degradations
J. Mao
Yue Yang
Xuesong Yin
Ling Shao
Hao Tang
144
1
0
16 Nov 2024
All-in-one Weather-degraded Image Restoration via Adaptive Degradation-aware Self-prompting Model
IEEE transactions on multimedia (IEEE TMM), 2024
Yuanbo Wen
Tao Gao
Ziqi Li
Jing Zhang
Kaihao Zhang
Ting Chen
VLM
DiffM
185
8
0
12 Nov 2024
Autoregressive Models in Vision: A Survey
Jing Xiong
Gongye Liu
Lun Huang
Chengyue Wu
Taiqiang Wu
...
Hao Fei
Guillermo Sapiro
Jiebo Luo
Ping Luo
Ngai Wong
VGen
354
31
0
08 Nov 2024
ReEdit: Multimodal Exemplar-Based Image Editing with Diffusion Models
IEEE Workshop/Winter Conference on Applications of Computer Vision (WACV), 2024
Ashutosh Srivastava
Tarun Ram Menta
Abhinav Java
Avadhoot Jadhav
Silky Singh
Surgan Jandial
Balaji Krishnamurthy
DiffM
154
2
0
06 Nov 2024
A Survey on All-in-One Image Restoration: Taxonomy, Evaluation and Future Trends
IEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI), 2024
Junjun Jiang
Zengyuan Zuo
Gang Wu
Kui Jiang
Xianming Liu
417
23
0
19 Oct 2024
A Simple Image Segmentation Framework via In-Context Examples
Neural Information Processing Systems (NeurIPS), 2024
Yang Liu
Chenchen Jing
Hengtao Li
Huanyi Zheng
Hao Chen
Xinlong Wang
Chunhua Shen
121
11
0
07 Oct 2024
Perceive-IR: Learning to Perceive Degradation Better for All-in-One Image Restoration
IEEE Transactions on Image Processing (TIP), 2024
Xu Zhang
Jiaqi Ma
Guoli Wang
Qian Zhang
Huan Zhang
Lefei Zhang
VLM
352
24
0
28 Aug 2024
Image Segmentation in Foundation Model Era: A Survey
Tianfei Zhou
Fei Zhang
Boyu Chang
Wenguan Wang
Ye Yuan
E. Konukoglu
Daniel Cremers
VLM
247
23
0
23 Aug 2024
Learning A Low-Level Vision Generalist via Visual Task Prompt
ACM Multimedia (MM), 2024
Xiangyu Chen
Yihao Liu
Yuandong Pu
Wenlong Zhang
Jiantao Zhou
Yu Qiao
Chao Dong
VLM
182
11
0
16 Aug 2024
Cropper: Vision-Language Model for Image Cropping through In-Context Learning
Computer Vision and Pattern Recognition (CVPR), 2024
Seung Hyun Lee
Junjie Ke
Yinxiao Li
Junfeng He
Steven Hickson
...
Irfan Essa
Sangpil Kim
Ming-Hsuan Yang
Irfan Essa
Feng Yang
VLM
170
3
0
14 Aug 2024
AutoAD-Zero: A Training-Free Framework for Zero-Shot Audio Description
Junyu Xie
Tengda Han
Max Bain
Arsha Nagrani
Gül Varol
Weidi Xie
Andrew Zisserman
VGen
130
17
0
22 Jul 2024
Text2Place: Affordance-aware Text Guided Human Placement
Rishubh Parihar
Harsh Gupta
VS Sachidanand
R. V. Babu
DiffM
146
8
0
22 Jul 2024
EarthMarker: Visual Prompt Learning for Region-level and Point-level Remote Sensing Imagery Comprehension
Wei Zhang
Miaoxin Cai
Tong Zhang
Jun Li
Zhuang Yin
Xuerui Mao
270
29
0
18 Jul 2024
GPT Sonograpy: Hand Gesture Decoding from Forearm Ultrasound Images via VLM
Keshav Bimbraw
Ye Wang
Jing Liu
T. Koike-Akino
VLM
MedIm
LM&MA
144
4
0
15 Jul 2024
Visual Prompt Selection for In-Context Learning Segmentation
Wei Suo
Lanqing Lai
Mengyang Sun
Hanwang Zhang
Peng Wang
Yanning Zhang
VLM
160
10
0
14 Jul 2024
DG-PIC: Domain Generalized Point-In-Context Learning for Point Cloud Understanding
Jincen Jiang
Qianyu Zhou
Yuhang Li
Xuequan Lu
Meili Wang
Lizhuang Ma
Jian Chang
Jian Jun Zhang
OOD
147
15
0
11 Jul 2024
Video In-context Learning: Autoregressive Transformers are Zero-Shot Video Imitators
Wentao Zhang
Junliang Guo
Tianyu He
Li Zhao
Linli Xu
Jiang Bian
235
7
0
10 Jul 2024
Toward a Diffusion-Based Generalist for Dense Vision Tasks
Yue Fan
Yongqin Xian
Xiaohua Zhai
Alexander Kolesnikov
Muhammad Ferjad Naeem
Bernt Schiele
Federico Tombari
VLM
MDE
DiffM
96
2
0
29 Jun 2024
OMG-LLaVA: Bridging Image-level, Object-level, Pixel-level Reasoning and Understanding
Tao Zhang
Xiangtai Li
Hao Fei
Haobo Yuan
Shengqiong Wu
Shunping Ji
Chen Change Loy
Shuicheng Yan
LRM
MLLM
VLM
211
112
0
27 Jun 2024
ConStyle v2: A Strong Prompter for All-in-One Image Restoration
Dongqi Fan
Junhao Zhang
Liang Chang
VLM
131
2
0
26 Jun 2024
In-Context Symmetries: Self-Supervised Learning through Contextual World Models
Sharut Gupta
Chenyu Wang
Yifei Wang
Tommi Jaakkola
Stefanie Jegelka
147
5
0
28 May 2024
ARC: A Generalist Graph Anomaly Detector with In-Context Learning
Yixin Liu
Shiyuan Li
Yu Zheng
Qingfeng Chen
Chengqi Zhang
Shirui Pan
138
27
0
27 May 2024
Unsupervised Meta-Learning via In-Context Learning
Anna Vettoruzzo
Lorenzo Braccaioli
Joaquin Vanschoren
M. Nowaczyk
SSL
197
3
0
25 May 2024
Towards Global Optimal Visual In-Context Learning Prompt Selection
Chengming Xu
Chen Liu
Yikai Wang
Yanwei Fu
81
11
0
24 May 2024
Are You Copying My Prompt? Protecting the Copyright of Vision Prompt for VPaaS via Watermark
Huali Ren
Anli Yan
Chong-zhi Gao
Hongyang Yan
Zhenxin Zhang
Jin Li
VLM
AAML
137
6
0
24 May 2024
Adapting Large Multimodal Models to Distribution Shifts: The Role of In-Context Learning
Guanglin Zhou
Zhongyi Han
Shiming Chen
Erdun Gao
Liming Zhu
Salman Khan
Xin Gao
Lina Yao
VLM
174
6
0
20 May 2024
Not All Prompts Are Secure: A Switchable Backdoor Attack Against Pre-trained Vision Transformers
Shengyuan Yang
Jiawang Bai
Kuofeng Gao
Yong-Liang Yang
Yiming Li
Shu-Tao Xia
AAML
SILM
174
5
0
17 May 2024
Analogist: Out-of-the-box Visual In-Context Learning with Image Diffusion Model
ACM Transactions on Graphics (TOG), 2024
Zheng Gu
Shiyuan Yang
Jing Liao
Jing Huo
Yang Gao
VLM
DiffM
122
13
0
16 May 2024
DocRes: A Generalist Model Toward Unifying Document Image Restoration Tasks
Jiaxin Zhang
Dezhi Peng
Chongyu Liu
Peirong Zhang
Lianwen Jin
VLM
125
22
0
07 May 2024
Customizing Text-to-Image Models with a Single Image Pair
ACM SIGGRAPH Conference and Exhibition on Computer Graphics and Interactive Techniques in Asia (SIGGRAPH Asia), 2024
Maxwell Jones
Sheng-Yu Wang
Nupur Kumari
David Bau
Jun-Yan Zhu
DiffM
198
31
0
02 May 2024
DesignProbe: A Graphic Design Benchmark for Multimodal Large Language Models
Jieru Lin
Danqing Huang
Tiejun Zhao
Dechen Zhan
Chin-Yew Lin
VLM
MLLM
120
4
0
23 Apr 2024
In-Context Translation: Towards Unifying Image Recognition, Processing, and Generation
Han Xue
Qianru Sun
Li Song
Wenjun Zhang
Zhiwu Huang
MLLM
101
0
0
15 Apr 2024
PM2: A New Prompting Multi-modal Model Paradigm for Few-shot Medical Image Classification
Zhenwei Wang
Qiule Sun
Bingbing Zhang
Pengfei Wang
Jianxin Zhang
Qiang Zhang
VLM
175
4
0
13 Apr 2024
Finding Visual Task Vectors
Alberto Hojel
Yutong Bai
Trevor Darrell
Amir Globerson
Amir Bar
168
12
0
08 Apr 2024
Many-to-many Image Generation with Auto-regressive Diffusion Models
Ying Shen
Yizhe Zhang
Shuangfei Zhai
Lifu Huang
J. Susskind
Jiatao Gu
202
6
0
03 Apr 2024
Visual Autoregressive Modeling: Scalable Image Generation via Next-Scale Prediction
Neural Information Processing Systems (NeurIPS), 2024
Keyu Tian
Yi Jiang
Zehuan Yuan
Zehuan Yuan
Liwei Wang
VGen
292
603
0
03 Apr 2024
Roadside Monocular 3D Detection Prompted by 2D Detection
Yechi Ma
Shuoquan Wei
Churun Zhang
Wei Hua
223
0
0
01 Apr 2024
InstructBrush: Learning Attention-based Instruction Optimization for Image Editing
Ruoyu Zhao
Qingnan Fan
Fei Kou
Shuai Qin
Hong Gu
Wei Wu
Pengcheng Xu
Mingrui Zhu
Nannan Wang
Xinbo Gao
121
8
0
27 Mar 2024
Middle Fusion and Multi-Stage, Multi-Form Prompts for Robust RGB-T Tracking
Qiming Wang
Yongqiang Bai
Hongxing Song
170
11
0
27 Mar 2024
In-Context Matting
Computer Vision and Pattern Recognition (CVPR), 2024
He Guo
Zixuan Ye
Zhiguo Cao
Hao Lu
VOS
108
5
0
23 Mar 2024
Few-Shot Adversarial Prompt Learning on Vision-Language Models
Yiwei Zhou
Xiaobo Xia
Zhiwei Lin
Bo Han
Tongliang Liu
VLM
162
27
0
21 Mar 2024
VL-ICL Bench: The Devil in the Details of Multimodal In-Context Learning
International Conference on Learning Representations (ICLR), 2024
Yongshuo Zong
Ondrej Bohdal
Timothy M. Hospedales
204
14
0
19 Mar 2024
OSTAF: A One-Shot Tuning Method for Improved Attribute-Focused T2I Personalization
Ye Wang
Zili Yi
Rui Ma
DiffM
124
0
0
17 Mar 2024
Borrowing Treasures from Neighbors: In-Context Learning for Multimodal Learning with Missing Modalities and Data Scarcity
Zhuo Zhi
Ziquan Liu
M. Elbadawi
Adam Daneshmend
Mine Orlu
Abdul Basit
Andreas Demosthenous
Miguel R. D. Rodrigues
208
4
0
14 Mar 2024
Explore In-Context Segmentation via Latent Diffusion Models
AAAI Conference on Artificial Intelligence (AAAI), 2024
Chaoyang Wang
Xiangtai Li
Henghui Ding
Lu Qi
Jiangning Zhang
Yunhai Tong
Chen Change Loy
Shuicheng Yan
DiffM
235
11
0
14 Mar 2024
Toward Generalist Anomaly Detection via In-context Residual Learning with Few-shot Sample Prompts
Computer Vision and Pattern Recognition (CVPR), 2024
Jiawen Zhu
Guansong Pang
VLM
232
73
0
11 Mar 2024
InstructGIE: Towards Generalizable Image Editing
European Conference on Computer Vision (ECCV), 2024
Zichong Meng
Changdi Yang
Jun Liu
Hao Tang
Pu Zhao
Yanzhi Wang
DiffM
148
12
0
08 Mar 2024
Contrastive Region Guidance: Improving Grounding in Vision-Language Models without Training
David Wan
Jaemin Cho
Elias Stengel-Eskin
Mohit Bansal
VLM
ObjD
219
48
0
04 Mar 2024
Grounding Language Models for Visual Entity Recognition
Zilin Xiao
Ming Gong
Paola Cascante-Bonilla
Xingyao Zhang
Jie Wu
Vicente Ordonez
VLM
166
13
0
28 Feb 2024
Video as the New Language for Real-World Decision Making
Sherry Yang
Jacob Walker
Jack Parker-Holder
Yilun Du
Jake Bruce
Andre Barreto
Pieter Abbeel
Dale Schuurmans
VGen
195
74
0
27 Feb 2024
Previous
1
2
3
4
Next