ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1812.03982
  4. Cited By
SlowFast Networks for Video Recognition

SlowFast Networks for Video Recognition

10 December 2018
Christoph Feichtenhofer
Haoqi Fan
Jitendra Malik
Kaiming He
ArXivPDFHTML

Papers citing "SlowFast Networks for Video Recognition"

50 / 610 papers shown
Title
Object-based (yet Class-agnostic) Video Domain Adaptation
Object-based (yet Class-agnostic) Video Domain Adaptation
Dantong Niu
Amir Bar
Roei Herzig
Trevor Darrell
Anna Rohrbach
40
1
0
29 Nov 2023
End-to-End Temporal Action Detection with 1B Parameters Across 1000
  Frames
End-to-End Temporal Action Detection with 1B Parameters Across 1000 Frames
Shuming Liu
Chen-Da Liu-Zhang
Chen Zhao
Guohao Li
38
25
0
28 Nov 2023
Towards Weakly Supervised End-to-end Learning for Long-video Action
  Recognition
Towards Weakly Supervised End-to-end Learning for Long-video Action Recognition
Jiaming Zhou
Hanjun Li
Kun-Yu Lin
Junwei Liang
29
1
0
28 Nov 2023
Modality Mixer Exploiting Complementary Information for Multi-modal
  Action Recognition
Modality Mixer Exploiting Complementary Information for Multi-modal Action Recognition
Sumin Lee
Sangmin Woo
Muhammad Adi Nugroho
Changick Kim
30
0
0
21 Nov 2023
Unearthing Common Inconsistency for Generalisable Deepfake Detection
Unearthing Common Inconsistency for Generalisable Deepfake Detection
Beilin Chu
Xuan Xu
Weike You
Linna Zhou
32
0
0
20 Nov 2023
Automated Sperm Assessment Framework and Neural Network Specialized for
  Sperm Video Recognition
Automated Sperm Assessment Framework and Neural Network Specialized for Sperm Video Recognition
T. Fujii
Hayato Nakagawa
T. Takeshima
Y. Yumura
T. Hamagami
30
3
0
10 Nov 2023
MM-VID: Advancing Video Understanding with GPT-4V(ision)
MM-VID: Advancing Video Understanding with GPT-4V(ision)
Kevin Qinghong Lin
Faisal Ahmed
Linjie Li
Chung-Ching Lin
E. Azarnasab
...
Lin Liang
Zicheng Liu
Yumao Lu
Ce Liu
Lijuan Wang
MLLM
28
63
0
30 Oct 2023
S3Aug: Segmentation, Sampling, and Shift for Action Recognition
S3Aug: Segmentation, Sampling, and Shift for Action Recognition
Taiki Sugiura
Toru Tamaki
AI4TS
29
2
0
23 Oct 2023
Boundary Discretization and Reliable Classification Network for Temporal
  Action Detection
Boundary Discretization and Reliable Classification Network for Temporal Action Detection
Zhenying Fang
Jun Yu
Richang Hong
28
0
0
10 Oct 2023
Training a Large Video Model on a Single Machine in a Day
Training a Large Video Model on a Single Machine in a Day
Yue Zhao
Philipp Krahenbuhl
VLM
34
15
0
28 Sep 2023
SlowFast Network for Continuous Sign Language Recognition
SlowFast Network for Continuous Sign Language Recognition
Junseok Ahn
Youngjoon Jang
Joon Son Chung
SLR
38
10
0
21 Sep 2023
CPR-Coach: Recognizing Composite Error Actions based on Single-class
  Training
CPR-Coach: Recognizing Composite Error Actions based on Single-class Training
Shunli Wang
Qing Yu
Shuai Wang
Dingkang Yang
Liuzhen Su
Xiao Zhao
Haopeng Kuang
Pei Zhang
Peng Zhai
Lihua Zhang
41
3
0
21 Sep 2023
SkeleTR: Towrads Skeleton-based Action Recognition in the Wild
SkeleTR: Towrads Skeleton-based Action Recognition in the Wild
Haodong Duan
Mingze Xu
Bing Shuai
Davide Modolo
Zhuowen Tu
Joseph Tighe
Alessandro Bergamo
ViT
35
1
0
20 Sep 2023
Differentiable Resolution Compression and Alignment for Efficient Video
  Classification and Retrieval
Differentiable Resolution Compression and Alignment for Efficient Video Classification and Retrieval
Rui Deng
Qian Wu
Yuke Li
Haoran Fu
26
2
0
15 Sep 2023
EgoPCA: A New Framework for Egocentric Hand-Object Interaction
  Understanding
EgoPCA: A New Framework for Egocentric Hand-Object Interaction Understanding
Yue Xu
Yong-Lu Li
Zhemin Huang
Michael Xu Liu
Cewu Lu
Yu-Wing Tai
Chi-Keung Tang
EgoV
25
9
0
05 Sep 2023
UnLoc: A Unified Framework for Video Localization Tasks
UnLoc: A Unified Framework for Video Localization Tasks
Shengjia Yan
Xuehan Xiong
Arsha Nagrani
Anurag Arnab
Zhonghao Wang
Weina Ge
David A. Ross
Cordelia Schmid
33
53
0
21 Aug 2023
MGMAE: Motion Guided Masking for Video Masked Autoencoding
MGMAE: Motion Guided Masking for Video Masked Autoencoding
Bingkun Huang
Zhiyu Zhao
Guozhen Zhang
Yu Qiao
Limin Wang
39
30
0
21 Aug 2023
Inherent Redundancy in Spiking Neural Networks
Inherent Redundancy in Spiking Neural Networks
Man Yao
J. Hu
Guangshe Zhao
Yaoyuan Wang
Ziyang Zhang
Boxing Xu
Guoqi Li
30
15
0
16 Aug 2023
ARGUS: Visualization of AI-Assisted Task Guidance in AR
ARGUS: Visualization of AI-Assisted Task Guidance in AR
Sonia Castelo
Joao Rulff
Erin McGowan
Bea Steers
Guande Wu
...
Qinghong Sun
Huy Q. Vo
J. P. Bello
M. Krone
Claudio Silva
34
18
0
11 Aug 2023
Constructing Holistic Spatio-Temporal Scene Graph for Video Semantic
  Role Labeling
Constructing Holistic Spatio-Temporal Scene Graph for Video Semantic Role Labeling
Yu Zhao
Hao Fei
Yixin Cao
Bobo Li
Meishan Zhang
Jianguo Wei
Hao Fei
Tat-Seng Chua
19
13
0
09 Aug 2023
View while Moving: Efficient Video Recognition in Long-untrimmed Videos
View while Moving: Efficient Video Recognition in Long-untrimmed Videos
Ye Tian
Meng Yang
Lanshan Zhang
Zhizhen Zhang
Yang Liu
Xiao-Zhu Xie
Xirong Que
Wendong Wang
24
7
0
09 Aug 2023
Long-Distance Gesture Recognition using Dynamic Neural Networks
Long-Distance Gesture Recognition using Dynamic Neural Networks
Shubhang Bhatnagar
S. Gopal
Narendra Ahuja
Liu Ren
34
3
0
09 Aug 2023
SSTFormer: Bridging Spiking Neural Network and Memory Support Transformer for Frame-Event based Recognition
SSTFormer: Bridging Spiking Neural Network and Memory Support Transformer for Frame-Event based Recognition
Tianlin Li
Zong-Yao Wu
Yao Rong
Lin Zhu
Bowei Jiang
Jin Tang
Yonghong Tian
ViT
77
18
0
08 Aug 2023
M$^3$Net: Multi-view Encoding, Matching, and Fusion for Few-shot
  Fine-grained Action Recognition
M3^33Net: Multi-view Encoding, Matching, and Fusion for Few-shot Fine-grained Action Recognition
Hao Tang
Jun Liu
Shuanglin Yan
Rui Yan
Zechao Li
Jinhui Tang
23
38
0
06 Aug 2023
A Survey on Deep Learning-based Spatio-temporal Action Detection
A Survey on Deep Learning-based Spatio-temporal Action Detection
Peng Wang
Fanwei Zeng
Yu Qian
34
5
0
03 Aug 2023
Scene Separation & Data Selection: Temporal Segmentation Algorithm for
  Real-Time Video Stream Analysis
Scene Separation & Data Selection: Temporal Segmentation Algorithm for Real-Time Video Stream Analysis
Yuelin Xin
Zihan Zhou
Yuxuan Xia
15
2
0
01 Aug 2023
AntGPT: Can Large Language Models Help Long-term Action Anticipation
  from Videos?
AntGPT: Can Large Language Models Help Long-term Action Anticipation from Videos?
Qi Zhao
Shijie Wang
Ce Zhang
Changcheng Fu
Minh Quan Do
Nakul Agarwal
Kwonjoon Lee
Chen Sun
LM&Ro
56
49
0
31 Jul 2023
Robotic Vision for Human-Robot Interaction and Collaboration: A Survey
  and Systematic Review
Robotic Vision for Human-Robot Interaction and Collaboration: A Survey and Systematic Review
Nicole L. Robinson
Brendan Tidd
Dylan Campbell
Dana Kulić
Peter Corke
46
55
0
28 Jul 2023
Sample Less, Learn More: Efficient Action Recognition via Frame Feature
  Restoration
Sample Less, Learn More: Efficient Action Recognition via Frame Feature Restoration
Harry Cheng
Yangyang Guo
Liqiang Nie
Zhiyong Cheng
Mohan S. Kankanhalli
37
7
0
27 Jul 2023
Revisiting Event-based Video Frame Interpolation
Revisiting Event-based Video Frame Interpolation
Jiaben Chen
Yi‐Wen Zhu
Dongze Lian
Jiaqi Yang
Yifu Wang
Renrui Zhang
Xinhang Liu
Shenhan Qian
L. Kneip
Shenghua Gao
34
2
0
24 Jul 2023
GLSFormer: Gated - Long, Short Sequence Transformer for Step Recognition
  in Surgical Videos
GLSFormer: Gated - Long, Short Sequence Transformer for Step Recognition in Surgical Videos
Nisarg A. Shah
S. Sikder
S. Vedula
Vishal M. Patel
ViT
MedIm
19
7
0
20 Jul 2023
NTIRE 2023 Quality Assessment of Video Enhancement Challenge
NTIRE 2023 Quality Assessment of Video Enhancement Challenge
Xiaohong Liu
Xiongkuo Min
Wei Sun
Yulun Zhang
Peng Sun
...
Te Shi
Azadeh Mansouri
Hossein Motamednia
Amirhossein Bakhtiari
Ahmad Mahmoudi-Aznaveh
33
18
0
19 Jul 2023
What Can Simple Arithmetic Operations Do for Temporal Modeling?
What Can Simple Arithmetic Operations Do for Temporal Modeling?
Wenhao Wu
Yuxin Song
Zhun Sun
Jingdong Wang
Chang Xu
Wanli Ouyang
40
8
0
18 Jul 2023
Atlas-Based Interpretable Age Prediction In Whole-Body MR Images
Atlas-Based Interpretable Age Prediction In Whole-Body MR Images
Sophie Starck
Yadunandan Vivekanand Kini
J. Ritter
R. Braren
Daniel Rueckert
Tamara T. Mueller
27
1
0
14 Jul 2023
Video-FocalNets: Spatio-Temporal Focal Modulation for Video Action
  Recognition
Video-FocalNets: Spatio-Temporal Focal Modulation for Video Action Recognition
Syed Talal Wasim
Muhammad Uzair Khattak
Muzammal Naseer
Salman Khan
M. Shah
Fahad Shahbaz Khan
ViT
54
19
0
13 Jul 2023
A Study on Differentiable Logic and LLMs for EPIC-KITCHENS-100
  Unsupervised Domain Adaptation Challenge for Action Recognition 2023
A Study on Differentiable Logic and LLMs for EPIC-KITCHENS-100 Unsupervised Domain Adaptation Challenge for Action Recognition 2023
Yi Cheng
Ziwei Xu
Fen Fang
Dongyun Lin
Hehe Fan
Yongkang Wong
Ying Sun
Mohan S. Kankanhalli
29
0
0
13 Jul 2023
Make A Long Image Short: Adaptive Token Length for Vision Transformers
Make A Long Image Short: Adaptive Token Length for Vision Transformers
Yuqin Zhu
Yichen Zhu
ViT
72
17
0
05 Jul 2023
How can objects help action recognition?
How can objects help action recognition?
Xingyi Zhou
Anurag Arnab
Chen Sun
Cordelia Schmid
42
14
0
20 Jun 2023
Learning Fine-grained View-Invariant Representations from Unpaired
  Ego-Exo Videos via Temporal Alignment
Learning Fine-grained View-Invariant Representations from Unpaired Ego-Exo Videos via Temporal Alignment
Zihui Xue
Kristen Grauman
EgoV
43
31
0
08 Jun 2023
Atrial Septal Defect Detection in Children Based on Ultrasound Video
  Using Multiple Instances Learning
Atrial Septal Defect Detection in Children Based on Ultrasound Video Using Multiple Instances Learning
Yiman Liu
Qingming Huang
Xiaoxiang Han
Tongtong Liang
Zhi-fang Zhang
...
Angelos Stefanidis
Jionglong Su
Jiangang Chen
Qingli Li
Yuqi Zhang
23
7
0
06 Jun 2023
Human-Object Interaction Prediction in Videos through Gaze Following
Human-Object Interaction Prediction in Videos through Gaze Following
Zhifan Ni
Esteve Valls Mascaro
Hyemin Ahn
Dongheui Lee
30
10
0
06 Jun 2023
VR.net: A Real-world Dataset for Virtual Reality Motion Sickness
  Research
VR.net: A Real-world Dataset for Virtual Reality Motion Sickness Research
Elliott Wen
Chitralekha Gupta
P. Sasikumar
Mark Billinghurst
James P Wilmott
Emily Skow
Arindam Dey
Suranga Nanayakkara
27
11
0
06 Jun 2023
VIPriors 3: Visual Inductive Priors for Data-Efficient Deep Learning
  Challenges
VIPriors 3: Visual Inductive Priors for Data-Efficient Deep Learning Challenges
Robert-Jan Bruintjes
A. Lengyel
Marcos Baptista-Rios
O. Kayhan
Davide Zambrano
Nergis Tomen
Jan van Gemert
25
9
0
31 May 2023
TranSFormer: Slow-Fast Transformer for Machine Translation
TranSFormer: Slow-Fast Transformer for Machine Translation
Bei Li
Yi Jing
Xu Tan
Zhen Xing
Tong Xiao
Jingbo Zhu
49
7
0
26 May 2023
CVB: A Video Dataset of Cattle Visual Behaviors
CVB: A Video Dataset of Cattle Visual Behaviors
Ali Zia
Renuka Sharma
Reza Arablouei
G. Bishop-Hurley
Jody McNally
N. Bagnall
V. Rolland
Brano Kusy
L. Petersson
A. Ingham
34
2
0
26 May 2023
Deep Neural Networks in Video Human Action Recognition: A Review
Deep Neural Networks in Video Human Action Recognition: A Review
Zihan Wang
Yang Yang
Zhi Liu
Y. Zheng
59
4
0
25 May 2023
Audio-Visual Dataset and Method for Anomaly Detection in Traffic Videos
Audio-Visual Dataset and Method for Anomaly Detection in Traffic Videos
Błażej Leporowski
Arian Bakhtiarnia
Nicole Bonnici
A. Muscat
Luca Zanella
Yiming Wang
Alexandros Iosifidis
35
1
0
24 May 2023
Flexible and Inherently Comprehensible Knowledge Representation for
  Data-Efficient Learning and Trustworthy Human-Machine Teaming in
  Manufacturing Environments
Flexible and Inherently Comprehensible Knowledge Representation for Data-Efficient Learning and Trustworthy Human-Machine Teaming in Manufacturing Environments
Vedran Galetić
Alistair Nottle
30
1
0
19 May 2023
ReasonNet: End-to-End Driving with Temporal and Global Reasoning
ReasonNet: End-to-End Driving with Temporal and Global Reasoning
Hao Shao
Letian Wang
Ruobing Chen
Steven L. Waslander
Hongsheng Li
Y. Liu
LRM
41
71
0
17 May 2023
Learning Higher-order Object Interactions for Keypoint-based Video
  Understanding
Learning Higher-order Object Interactions for Keypoint-based Video Understanding
Yi Huang
Asim Kadav
Farley Lai
Deep Patel
H. Graf
20
1
0
16 May 2023
Previous
123456...111213
Next