Communities
Connect sessions
AI calendar
Organizations
Join Slack
Contact Sales
Search
Open menu
Home
Papers
2306.00989
Cited By
Hiera: A Hierarchical Vision Transformer without the Bells-and-Whistles
International Conference on Machine Learning (ICML), 2023
1 June 2023
Chaitanya K. Ryali
Yuan-Ting Hu
Daniel Bolya
Chen Wei
Haoqi Fan
Po-Yao (Bernie) Huang
Vaibhav Aggarwal
Arkabandhu Chowdhury
Omid Poursaeed
Judy Hoffman
Jitendra Malik
Yanghao Li
Christoph Feichtenhofer
3DH
Re-assign community
ArXiv (abs)
PDF
HTML
HuggingFace (1 upvotes)
Github (985★)
Papers citing
"Hiera: A Hierarchical Vision Transformer without the Bells-and-Whistles"
21 / 171 papers shown
Title
A Simple Baseline for Efficient Hand Mesh Reconstruction
Zhishan Zhou
Shihao Zhou
Zhi Lv
Minqiang Zou
Yao Tang
Jiajun Liang
3DH
201
26
0
04 Mar 2024
VideoMAC: Video Masked Autoencoders Meet ConvNets
Gensheng Pei
Tao Chen
XiRuo Jiang
Huafeng Liu
Zeren Sun
Yazhou Yao
VGen
201
18
0
29 Feb 2024
Revisiting Feature Prediction for Learning Visual Representations from Video
Adrien Bardes
Q. Garrido
Jean Ponce
Xinlei Chen
Michael G. Rabbat
Yann LeCun
Mahmoud Assran
Nicolas Ballas
MDE
VLM
260
153
0
15 Feb 2024
Towards Privacy-Aware Sign Language Translation at Scale
Phillip Rust
Bowen Shi
Skyler Wang
Necati Cihan Camgöz
Jean Maillard
SLR
215
33
0
14 Feb 2024
Memory Consolidation Enables Long-Context Video Understanding
Ivana Balavzević
Yuge Shi
Pinelopi Papalampidi
Rahma Chaabouni
Skanda Koppula
Olivier J. Hénaff
391
45
0
08 Feb 2024
Computer Vision for Primate Behavior Analysis in the Wild
Richard Vogg
Timo Lüddecke
Jonathan Henrich
Sharmita Dey
Matthias Nuske
...
Alexander Gail
Stefan Treue
H. Scherberger
Florentin Wörgötter
Alexander S. Ecker
368
13
0
29 Jan 2024
Enhancing Small Object Encoding in Deep Neural Networks: Introducing Fast&Focused-Net with Volume-wise Dot Product Layer
Tofik Ali
Partha Pratim Roy
ObjD
167
2
0
18 Jan 2024
Motion Guided Token Compression for Efficient Masked Video Modeling
Yukun Feng
Yangming Shi
Fengze Liu
Tan Yan
204
0
0
10 Jan 2024
Masked Modeling for Self-supervised Representation Learning on Vision and Beyond
Siyuan Li
Luyuan Zhang
Zedong Wang
Di Wu
Lirong Wu
...
Jun Xia
Cheng Tan
Yang Liu
Baigui Sun
Stan Z. Li
SSL
219
26
0
31 Dec 2023
Multiscale Vision Transformers meet Bipartite Matching for efficient single-stage Action Localization
Computer Vision and Pattern Recognition (CVPR), 2023
Ioanna Ntinou
Enrique Sanchez
Georgios Tzimiropoulos
190
6
0
29 Dec 2023
A Comprehensive Study of Vision Transformers in Image Classification Tasks
Mahmoud Khalil
Ahmad Khalil
A. Ngom
ViT
181
13
0
02 Dec 2023
Window Attention is Bugged: How not to Interpolate Position Embeddings
International Conference on Learning Representations (ICLR), 2023
Daniel Bolya
Chaitanya K. Ryali
Judy Hoffman
Christoph Feichtenhofer
160
15
0
09 Nov 2023
CHAMMI: A benchmark for channel-adaptive models in microscopy imaging
Neural Information Processing Systems (NeurIPS), 2023
Zitong S. Chen
Chau Pham
Siqi Wang
Michael Doron
Nikita Moshkov
Bryan A. Plummer
Juan C. Caicedo
174
12
0
30 Oct 2023
1st Place Solution of Egocentric 3D Hand Pose Estimation Challenge 2023 Technical Report:A Concise Pipeline for Egocentric Hand Pose Reconstruction
Zhishan Zhou
Zhi Lv
Shihao Zhou
Minqiang Zou
Tong Wu
Mochen Yu
Yao Tang
Jiajun Liang
179
4
0
07 Oct 2023
RT-GAN: Recurrent Temporal GAN for Adding Lightweight Temporal Consistency to Frame-Based Domain Translation Approaches
International Conference on Medical Image Computing and Computer-Assisted Intervention (MICCAI), 2023
Shawn Mathew
Saad Nadeem
Alvin C. Goh
Arie Kaufman
MedIm
271
0
0
02 Oct 2023
Frequency-Aware Masked Autoencoders for Multimodal Pretraining on Biosignals
Ran Liu
Ellen L. Zippi
Hadi Pouransari
Chris Sandino
Jingping Nie
Hanlin Goh
Erdrin Azemi
Ali Moin
229
16
0
12 Sep 2023
TurboViT: Generating Fast Vision Transformers via Generative Architecture Search
Alexander Wong
Saad Abbasi
Saeejith Nair
ViT
127
2
0
22 Aug 2023
Efficient Large-Scale Visual Representation Learning And Evaluation
Eden Dolev
A. Awad
Denisa Roberts
Zahra Ebrahimzadeh
Marcin Mejran
Vaibhav Malpani
Mahir Yavuz
250
2
0
22 May 2023
Generating images of rare concepts using pre-trained diffusion models
AAAI Conference on Artificial Intelligence (AAAI), 2023
Dvir Samuel
Rami Ben-Ari
Simon Raviv
N. Darshan
Gal Chechik
379
62
0
27 Apr 2023
On the Benefits of 3D Pose and Tracking for Human Action Recognition
Computer Vision and Pattern Recognition (CVPR), 2023
Jathushan Rajasegaran
Georgios Pavlakos
Angjoo Kanazawa
Christoph Feichtenhofer
Jitendra Malik
342
44
0
03 Apr 2023
Self-Supervised and Interpretable Anomaly Detection using Network Transformers
IEEE Transactions on Industrial Informatics (IEEE TII), 2022
Daniel L. Marino
Chathurika S. Wickramasinghe
C. Rieger
Milos Manic
131
14
0
25 Feb 2022
Previous
1
2
3
4