Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2201.08383
Cited By
MeMViT: Memory-Augmented Multiscale Vision Transformer for Efficient Long-Term Video Recognition
20 January 2022
Chao-Yuan Wu
Yanghao Li
K. Mangalam
Haoqi Fan
Bo Xiong
Jitendra Malik
Christoph Feichtenhofer
ViT
Re-assign community
ArXiv
PDF
HTML
Papers citing
"MeMViT: Memory-Augmented Multiscale Vision Transformer for Efficient Long-Term Video Recognition"
5 / 55 papers shown
Title
Video Transformer Network
Daniel Neimark
Omri Bar
Maya Zohar
Dotan Asselmann
ViT
204
422
0
01 Feb 2021
Memory Enhanced Global-Local Aggregation for Video Object Detection
Yihong Chen
Yue Cao
Han Hu
Liwei Wang
112
261
0
26 Mar 2020
Equalization Loss for Long-Tailed Object Recognition
Jingru Tan
Changbao Wang
Buyu Li
Quanquan Li
Wanli Ouyang
Changqing Yin
Junjie Yan
260
457
0
11 Mar 2020
ECO: Efficient Convolutional Network for Online Video Understanding
Mohammadreza Zolfaghari
Kamaljeet Singh
Thomas Brox
130
496
0
24 Apr 2018
Aggregated Residual Transformations for Deep Neural Networks
Saining Xie
Ross B. Girshick
Piotr Dollár
Z. Tu
Kaiming He
297
10,220
0
16 Nov 2016
Previous
1
2