ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2303.13043
  4. Cited By
Top-Down Visual Attention from Analysis by Synthesis

Top-Down Visual Attention from Analysis by Synthesis

23 March 2023
Baifeng Shi
Trevor Darrell
Xin Eric Wang
ArXivPDFHTML

Papers citing "Top-Down Visual Attention from Analysis by Synthesis"

21 / 21 papers shown
Title
Scaling Vision Pre-Training to 4K Resolution
Scaling Vision Pre-Training to 4K Resolution
Baifeng Shi
Boyi Li
Han Cai
Y. Lu
Sifei Liu
...
Jan Kautz
Song Han
Trevor Darrell
Pavlo Molchanov
Hongxu Yin
CLIP
133
0
0
25 Mar 2025
CASE -- Condition-Aware Sentence Embeddings for Conditional Semantic Textual Similarity Measurement
CASE -- Condition-Aware Sentence Embeddings for Conditional Semantic Textual Similarity Measurement
Gaifan Zhang
Yi Zhou
Danushka Bollegala
144
0
0
21 Mar 2025
OverLoCK: An Overview-first-Look-Closely-next ConvNet with Context-Mixing Dynamic Kernels
OverLoCK: An Overview-first-Look-Closely-next ConvNet with Context-Mixing Dynamic Kernels
Meng Lou
Yizhou Yu
115
1
0
27 Feb 2025
Bootstrapping Top-down Information for Self-modulating Slot Attention
Bootstrapping Top-down Information for Self-modulating Slot Attention
Dongwon Kim
Seoyeon Kim
Suha Kwak
OCL
ObjD
32
0
0
04 Nov 2024
Unlocking the Capabilities of Masked Generative Models for Image
  Synthesis via Self-Guidance
Unlocking the Capabilities of Masked Generative Models for Image Synthesis via Self-Guidance
Jiwan Hur
Dong-Jae Lee
Gyojin Han
Jaehyun Choi
Yunho Jeon
Junmo Kim
DiffM
30
0
0
17 Oct 2024
Conditional Language Learning with Context
Conditional Language Learning with Context
X. Zhang
Miao Li
Ji Wu
49
3
0
04 Jun 2024
Learning from Observer Gaze:Zero-Shot Attention Prediction Oriented by
  Human-Object Interaction Recognition
Learning from Observer Gaze:Zero-Shot Attention Prediction Oriented by Human-Object Interaction Recognition
Yuchen Zhou
Linkai Liu
Chao Gou
32
3
0
16 May 2024
When Do We Not Need Larger Vision Models?
When Do We Not Need Larger Vision Models?
Baifeng Shi
Ziyang Wu
Maolin Mao
Xin Wang
Trevor Darrell
VLM
LRM
54
40
0
19 Mar 2024
CI w/o TN: Context Injection without Task Name for Procedure Planning
CI w/o TN: Context Injection without Task Name for Procedure Planning
Xinjie Li
31
0
0
23 Feb 2024
Frozen Transformers in Language Models Are Effective Visual Encoder
  Layers
Frozen Transformers in Language Models Are Effective Visual Encoder Layers
Ziqi Pang
Ziyang Xie
Yunze Man
Yu-xiong Wang
47
25
0
19 Oct 2023
MindGPT: Interpreting What You See with Non-invasive Brain Recordings
MindGPT: Interpreting What You See with Non-invasive Brain Recordings
Jiaxuan Chen
Yu Qi
Yueming Wang
Gang Pan
35
6
0
27 Sep 2023
Masking Strategies for Background Bias Removal in Computer Vision Models
Masking Strategies for Background Bias Removal in Computer Vision Models
Ananthu Aniraj
C. Dantas
Dino Ienco
Diego Marcos
24
5
0
23 Aug 2023
Towards Top-Down Stereo Image Quality Assessment via Stereo Attention
Towards Top-Down Stereo Image Quality Assessment via Stereo Attention
Huilin Zhang
Sumei Li
Haoxiang Chang
Peiming Lin
15
1
0
08 Aug 2023
TOAST: Transfer Learning via Attention Steering
TOAST: Transfer Learning via Attention Steering
Baifeng Shi
Siyu Gai
Trevor Darrell
Xin Wang
25
9
0
24 May 2023
Unsupervised Learning of Structured Representations via Closed-Loop
  Transcription
Unsupervised Learning of Structured Representations via Closed-Loop Transcription
Shengbang Tong
Xili Dai
Yubei Chen
Mingyang Li
Zengyi Li
Brent Yi
Yann LeCun
Y. Ma
SSL
DRL
26
7
0
30 Oct 2022
Learning Hierarchical Image Segmentation For Recognition and By
  Recognition
Learning Hierarchical Image Segmentation For Recognition and By Recognition
Tsung-Wei Ke
Sangwoo Mo
Stella X. Yu
VLM
29
9
0
01 Oct 2022
GroupViT: Semantic Segmentation Emerges from Text Supervision
GroupViT: Semantic Segmentation Emerges from Text Supervision
Jiarui Xu
Shalini De Mello
Sifei Liu
Wonmin Byeon
Thomas Breuel
Jan Kautz
X. Wang
ViT
VLM
189
499
0
22 Feb 2022
Masked Autoencoders Are Scalable Vision Learners
Masked Autoencoders Are Scalable Vision Learners
Kaiming He
Xinlei Chen
Saining Xie
Yanghao Li
Piotr Dollár
Ross B. Girshick
ViT
TPM
305
7,434
0
11 Nov 2021
Emerging Properties in Self-Supervised Vision Transformers
Emerging Properties in Self-Supervised Vision Transformers
Mathilde Caron
Hugo Touvron
Ishan Misra
Hervé Jégou
Julien Mairal
Piotr Bojanowski
Armand Joulin
314
5,775
0
29 Apr 2021
Pyramid Vision Transformer: A Versatile Backbone for Dense Prediction
  without Convolutions
Pyramid Vision Transformer: A Versatile Backbone for Dense Prediction without Convolutions
Wenhai Wang
Enze Xie
Xiang Li
Deng-Ping Fan
Kaitao Song
Ding Liang
Tong Lu
Ping Luo
Ling Shao
ViT
277
3,622
0
24 Feb 2021
Knowing When to Look: Adaptive Attention via A Visual Sentinel for Image
  Captioning
Knowing When to Look: Adaptive Attention via A Visual Sentinel for Image Captioning
Jiasen Lu
Caiming Xiong
Devi Parikh
R. Socher
85
1,442
0
06 Dec 2016
1