Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2303.13043
Cited By
Top-Down Visual Attention from Analysis by Synthesis
23 March 2023
Baifeng Shi
Trevor Darrell
Xin Eric Wang
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Top-Down Visual Attention from Analysis by Synthesis"
21 / 21 papers shown
Title
Scaling Vision Pre-Training to 4K Resolution
Baifeng Shi
Boyi Li
Han Cai
Y. Lu
Sifei Liu
...
Jan Kautz
Song Han
Trevor Darrell
Pavlo Molchanov
Hongxu Yin
CLIP
133
0
0
25 Mar 2025
CASE -- Condition-Aware Sentence Embeddings for Conditional Semantic Textual Similarity Measurement
Gaifan Zhang
Yi Zhou
Danushka Bollegala
144
0
0
21 Mar 2025
OverLoCK: An Overview-first-Look-Closely-next ConvNet with Context-Mixing Dynamic Kernels
Meng Lou
Yizhou Yu
115
1
0
27 Feb 2025
Bootstrapping Top-down Information for Self-modulating Slot Attention
Dongwon Kim
Seoyeon Kim
Suha Kwak
OCL
ObjD
32
0
0
04 Nov 2024
Unlocking the Capabilities of Masked Generative Models for Image Synthesis via Self-Guidance
Jiwan Hur
Dong-Jae Lee
Gyojin Han
Jaehyun Choi
Yunho Jeon
Junmo Kim
DiffM
30
0
0
17 Oct 2024
Conditional Language Learning with Context
X. Zhang
Miao Li
Ji Wu
49
3
0
04 Jun 2024
Learning from Observer Gaze:Zero-Shot Attention Prediction Oriented by Human-Object Interaction Recognition
Yuchen Zhou
Linkai Liu
Chao Gou
32
3
0
16 May 2024
When Do We Not Need Larger Vision Models?
Baifeng Shi
Ziyang Wu
Maolin Mao
Xin Wang
Trevor Darrell
VLM
LRM
54
40
0
19 Mar 2024
CI w/o TN: Context Injection without Task Name for Procedure Planning
Xinjie Li
31
0
0
23 Feb 2024
Frozen Transformers in Language Models Are Effective Visual Encoder Layers
Ziqi Pang
Ziyang Xie
Yunze Man
Yu-xiong Wang
47
25
0
19 Oct 2023
MindGPT: Interpreting What You See with Non-invasive Brain Recordings
Jiaxuan Chen
Yu Qi
Yueming Wang
Gang Pan
35
6
0
27 Sep 2023
Masking Strategies for Background Bias Removal in Computer Vision Models
Ananthu Aniraj
C. Dantas
Dino Ienco
Diego Marcos
24
5
0
23 Aug 2023
Towards Top-Down Stereo Image Quality Assessment via Stereo Attention
Huilin Zhang
Sumei Li
Haoxiang Chang
Peiming Lin
15
1
0
08 Aug 2023
TOAST: Transfer Learning via Attention Steering
Baifeng Shi
Siyu Gai
Trevor Darrell
Xin Wang
25
9
0
24 May 2023
Unsupervised Learning of Structured Representations via Closed-Loop Transcription
Shengbang Tong
Xili Dai
Yubei Chen
Mingyang Li
Zengyi Li
Brent Yi
Yann LeCun
Y. Ma
SSL
DRL
26
7
0
30 Oct 2022
Learning Hierarchical Image Segmentation For Recognition and By Recognition
Tsung-Wei Ke
Sangwoo Mo
Stella X. Yu
VLM
29
9
0
01 Oct 2022
GroupViT: Semantic Segmentation Emerges from Text Supervision
Jiarui Xu
Shalini De Mello
Sifei Liu
Wonmin Byeon
Thomas Breuel
Jan Kautz
X. Wang
ViT
VLM
189
499
0
22 Feb 2022
Masked Autoencoders Are Scalable Vision Learners
Kaiming He
Xinlei Chen
Saining Xie
Yanghao Li
Piotr Dollár
Ross B. Girshick
ViT
TPM
305
7,434
0
11 Nov 2021
Emerging Properties in Self-Supervised Vision Transformers
Mathilde Caron
Hugo Touvron
Ishan Misra
Hervé Jégou
Julien Mairal
Piotr Bojanowski
Armand Joulin
314
5,775
0
29 Apr 2021
Pyramid Vision Transformer: A Versatile Backbone for Dense Prediction without Convolutions
Wenhai Wang
Enze Xie
Xiang Li
Deng-Ping Fan
Kaitao Song
Ding Liang
Tong Lu
Ping Luo
Ling Shao
ViT
277
3,622
0
24 Feb 2021
Knowing When to Look: Adaptive Attention via A Visual Sentinel for Image Captioning
Jiasen Lu
Caiming Xiong
Devi Parikh
R. Socher
85
1,442
0
06 Dec 2016
1