ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1705.07750
  4. Cited By
Quo Vadis, Action Recognition? A New Model and the Kinetics Dataset
v1v2v3 (latest)

Quo Vadis, Action Recognition? A New Model and the Kinetics Dataset

22 May 2017
João Carreira
Andrew Zisserman
ArXiv (abs)PDFHTML

Papers citing "Quo Vadis, Action Recognition? A New Model and the Kinetics Dataset"

50 / 3,647 papers shown
Title
Class Semantics-based Attention for Action Detection
Class Semantics-based Attention for Action Detection
Deepak Sridhar
N. Quader
S. Muralidharan
Yaoxin Li
Peng Dai
Juwei Lu
68
67
0
06 Sep 2021
Temporal Shift Reinforcement Learning
Temporal Shift Reinforcement Learning
Deep Thomas
Tichakorn Wongpiromsarn
Ali Jannesari
OffRL
55
0
0
05 Sep 2021
Efficient Action Recognition Using Confidence Distillation
Efficient Action Recognition Using Confidence Distillation
Shervin Manzuri Shalmani
Fei Chiang
Ronghuo Zheng
114
6
0
05 Sep 2021
Spatiotemporal Inconsistency Learning for DeepFake Video Detection
Spatiotemporal Inconsistency Learning for DeepFake Video Detection
Zhihao Gu
Yang Chen
Taiping Yao
Shouhong Ding
Jilin Li
Feiyue Huang
Lizhuang Ma
124
156
0
04 Sep 2021
Revisiting 3D ResNets for Video Recognition
Revisiting 3D ResNets for Video Recognition
Xianzhi Du
Yeqing Li
Huayu Chen
Rui Qian
Jing Li
Irwan Bello
160
17
0
03 Sep 2021
Hierarchical 3D Feature Learning for Pancreas Segmentation
Hierarchical 3D Feature Learning for Pancreas Segmentation
Federica Proietto Salanitri
Giovanni Bellitto
Ismail Irmakci
S. Palazzo
Ulas Bagci
C. Spampinato
MedIm
44
10
0
03 Sep 2021
Video Pose Distillation for Few-Shot, Fine-Grained Sports Action
  Recognition
Video Pose Distillation for Few-Shot, Fine-Grained Sports Action Recognition
James Hong
Matthew Fisher
Michael Gharbi
Kayvon Fatahalian
3DH
109
41
0
03 Sep 2021
TrouSPI-Net: Spatio-temporal attention on parallel atrous convolutions
  and U-GRUs for skeletal pedestrian crossing prediction
TrouSPI-Net: Spatio-temporal attention on parallel atrous convolutions and U-GRUs for skeletal pedestrian crossing prediction
Joseph Gesnouin
Steve Pechberti
B. Stanciulescu
Fabien Moutarde
94
24
0
02 Sep 2021
SlowFast Rolling-Unrolling LSTMs for Action Anticipation in Egocentric
  Videos
SlowFast Rolling-Unrolling LSTMs for Action Anticipation in Egocentric Videos
Nada Osman
Guglielmo Camporese
Pasquale Coscia
Lamberto Ballan
EgoV
108
21
0
02 Sep 2021
Spatio-Temporal Perturbations for Video Attribution
Spatio-Temporal Perturbations for Video Attribution
Zhenqiang Li
Weimin Wang
Zuoyue Li
Yifei Huang
Yoichi Sato
60
6
0
01 Sep 2021
LIGAR: Lightweight General-purpose Action Recognition
LIGAR: Lightweight General-purpose Action Recognition
Evgeny Izutov
48
3
0
30 Aug 2021
Efficient Visual Recognition with Deep Neural Networks: A Survey on
  Recent Advances and New Directions
Efficient Visual Recognition with Deep Neural Networks: A Survey on Recent Advances and New Directions
Yang Wu
Dingheng Wang
Xiaotong Lu
Fan Yang
Guoqi Li
W. Dong
Jianbo Shi
108
18
0
30 Aug 2021
Searching for Two-Stream Models in Multivariate Space for Video
  Recognition
Searching for Two-Stream Models in Multivariate Space for Video Recognition
Xinyu Gong
Heng Wang
Zheng Shou
Matt Feiszli
Zhangyang Wang
Zhicheng Yan
87
9
0
30 Aug 2021
Zero-shot Natural Language Video Localization
Zero-shot Natural Language Video Localization
Jinwoo Nam
Daechul Ahn
Dongyeop Kang
S. Ha
Jonghyun Choi
191
43
0
29 Aug 2021
GroupFormer: Group Activity Recognition with Clustered Spatial-Temporal
  Transformer
GroupFormer: Group Activity Recognition with Clustered Spatial-Temporal Transformer
Shuaicheng Li
Qianggang Cao
Lingbo Liu
Kunlin Yang
Shinan Liu
Jun Hou
Shuai Yi
ViT
99
106
0
28 Aug 2021
Learning Cross-modal Contrastive Features for Video Domain Adaptation
Learning Cross-modal Contrastive Features for Video Domain Adaptation
Donghyun Kim
Yi-Hsuan Tsai
Bingbing Zhuang
Xiang Yu
Stan Sclaroff
Kate Saenko
Manmohan Chandraker
92
73
0
26 Aug 2021
Shifted Chunk Transformer for Spatio-Temporal Representational Learning
Shifted Chunk Transformer for Spatio-Temporal Representational Learning
Xuefan Zha
Wentao Zhu
Tingxun Lv
Sen Yang
Ji Liu
AI4TSViT
92
27
0
26 Aug 2021
A Unified Taxonomy and Multimodal Dataset for Events in Invasion Games
A Unified Taxonomy and Multimodal Dataset for Events in Invasion Games
Henrik Biermann
Jonas Theiner
Manuel Bassek
Dominik Raabe
D. Memmert
Ralph Ewerth
64
11
0
25 Aug 2021
Spatio-Temporal Self-Attention Network for Video Saliency Prediction
Spatio-Temporal Self-Attention Network for Video Saliency Prediction
Ziqiang Wang
Zhi Liu
Gongyang Li
Yang Wang
Tianhong Zhang
Lihua Xu
Jijun Wang
3DPC
106
47
0
24 Aug 2021
Support-Set Based Cross-Supervision for Video Grounding
Support-Set Based Cross-Supervision for Video Grounding
Xinpeng Ding
N. Wang
Shiwei Zhang
De Cheng
Xiaomeng Li
Ziyuan Huang
Mingqian Tang
Xinbo Gao
86
42
0
24 Aug 2021
ParamCrop: Parametric Cubic Cropping for Video Contrastive Learning
ParamCrop: Parametric Cubic Cropping for Video Contrastive Learning
Zhiwu Qing
Ziyuan Huang
Shiwei Zhang
Mingqian Tang
Changxin Gao
M. Ang
Ronglei Ji
Nong Sang
83
3
0
24 Aug 2021
Dynamic Network Quantization for Efficient Video Inference
Dynamic Network Quantization for Efficient Video Inference
Ximeng Sun
Yikang Shen
Chun-Fu Chen
A. Oliva
Rogerio Feris
Kate Saenko
93
46
0
23 Aug 2021
TACo: Token-aware Cascade Contrastive Learning for Video-Text Alignment
TACo: Token-aware Cascade Contrastive Learning for Video-Text Alignment
Jianwei Yang
Yonatan Bisk
Jianfeng Gao
123
140
0
23 Aug 2021
MM-ViT: Multi-Modal Video Transformer for Compressed Video Action
  Recognition
MM-ViT: Multi-Modal Video Transformer for Compressed Video Action Recognition
Jiawei Chen
C. Ho
ViT
101
78
0
20 Aug 2021
Weakly-supervised Joint Anomaly Detection and Classification
Weakly-supervised Joint Anomaly Detection and Classification
Snehashis Majhi
Srijan Das
Francois Bremond
Ratnakar Dash
Pankaj K. Sa
33
20
0
20 Aug 2021
Few Shot Activity Recognition Using Variational Inference
Few Shot Activity Recognition Using Variational Inference
Neeraj Kumar
Siddhansh Narang
BDLVLM
45
5
0
20 Aug 2021
Video Relation Detection via Tracklet based Visual Transformer
Video Relation Detection via Tracklet based Visual Transformer
Kaifeng Gao
Long Chen
Yifeng Huang
Jun Xiao
ViT
83
30
0
19 Aug 2021
Self-Supervised Video Representation Learning with Meta-Contrastive
  Network
Self-Supervised Video Representation Learning with Meta-Contrastive Network
Yuanze Lin
Xun Guo
Yan Lu
SSL
78
41
0
19 Aug 2021
Social Fabric: Tubelet Compositions for Video Relation Detection
Social Fabric: Tubelet Compositions for Video Relation Detection
Shuo Chen
Zenglin Shi
Pascal Mettes
Cees G. M. Snoek
ViT
83
21
0
18 Aug 2021
The Multi-Modal Video Reasoning and Analyzing Competition
The Multi-Modal Video Reasoning and Analyzing Competition
Haoran Peng
He Huang
Li Xu
Tianjiao Li
Jing Liu
...
Yuanzhong Liu
Tao He
Fuwei Zhang
Xianbin Liu
Tao Lin
54
2
0
18 Aug 2021
Target Adaptive Context Aggregation for Video Scene Graph Generation
Target Adaptive Context Aggregation for Video Scene Graph Generation
Yao Teng
Limin Wang
Zhifeng Li
Gangshan Wu
96
64
0
18 Aug 2021
Channel-Temporal Attention for First-Person Video Domain Adaptation
Channel-Temporal Attention for First-Person Video Domain Adaptation
Xianyuan Liu
Shuo Zhou
Tao Lei
Haiping Lu
EgoV
44
0
0
17 Aug 2021
Group-aware Contrastive Regression for Action Quality Assessment
Group-aware Contrastive Regression for Action Quality Assessment
Xumin Yu
Yongming Rao
Wenliang Zhao
Jiwen Lu
Jie Zhou
AI4TS
85
101
0
17 Aug 2021
Look Who's Talking: Active Speaker Detection in the Wild
Look Who's Talking: Active Speaker Detection in the Wild
You Jin Kim
Hee-Soo Heo
Soyeon Choe
Soo-Whan Chung
Yoohwan Kwon
Bong-Jin Lee
Youngki Kwon
Joon Son Chung
113
21
0
17 Aug 2021
Temporal Action Segmentation with High-level Complex Activity Labels
Temporal Action Segmentation with High-level Complex Activity Labels
Guodong Ding
Angela Yao
87
18
0
15 Aug 2021
Exploring Temporal Coherence for More General Video Face Forgery
  Detection
Exploring Temporal Coherence for More General Video Face Forgery Detection
Yinglin Zheng
Jianmin Bao
Dong Chen
Ming Zeng
Fang Wen
CVBMViT
84
216
0
15 Aug 2021
Few-Shot Fine-Grained Action Recognition via Bidirectional Attention and
  Contrastive Meta-Learning
Few-Shot Fine-Grained Action Recognition via Bidirectional Attention and Contrastive Meta-Learning
Jiahao Wang
Yunhong Wang
Sheng Liu
Annan Li
73
17
0
15 Aug 2021
Foreground-Action Consistency Network for Weakly Supervised Temporal
  Action Localization
Foreground-Action Consistency Network for Weakly Supervised Temporal Action Localization
Linjiang Huang
Liang Wang
Hongsheng Li
100
78
0
14 Aug 2021
Conditional Temporal Variational AutoEncoder for Action Video Prediction
Conditional Temporal Variational AutoEncoder for Action Video Prediction
Xiaogang Xu
Yi Wang
Liwei Wang
Bei Yu
Jiaya Jia
VGen
83
5
0
12 Aug 2021
Deep Motion Prior for Weakly-Supervised Temporal Action Localization
Deep Motion Prior for Weakly-Supervised Temporal Action Localization
Meng Cao
Can Zhang
Long Chen
Mike Zheng Shou
Yuexian Zou
85
22
0
12 Aug 2021
Mounting Video Metadata on Transformer-based Language Model for
  Open-ended Video Question Answering
Mounting Video Metadata on Transformer-based Language Model for Open-ended Video Question Answering
Donggeon Lee
Seongho Choi
Youwon Jang
Byoung-Tak Zhang
91
2
0
11 Aug 2021
Preventing Catastrophic Forgetting and Distribution Mismatch in
  Knowledge Distillation via Synthetic Data
Preventing Catastrophic Forgetting and Distribution Mismatch in Knowledge Distillation via Synthetic Data
Kuluhan Binici
N. Pham
T. Mitra
K. Leman
86
42
0
11 Aug 2021
Learning Action Completeness from Points for Weakly-supervised Temporal
  Action Localization
Learning Action Completeness from Points for Weakly-supervised Temporal Action Localization
Pilhyeon Lee
H. Byun
134
65
0
11 Aug 2021
Learning to Cut by Watching Movies
Learning to Cut by Watching Movies
Alejandro Pardo
Fabian Caba Heilbron
Juan Carlos León Alcázar
Ali K. Thabet
Guohao Li
VGen
125
20
0
09 Aug 2021
AutoVideo: An Automated Video Action Recognition System
AutoVideo: An Automated Video Action Recognition System
Daochen Zha
Zaid Pervaiz Bhat
Yi-Wei Chen
Yicheng Wang
Sirui Ding
...
Mohammad Bhat
AnmollKumar Jain
Alfredo Costilla Reyes
Na Zou
Helen Zhou
118
11
0
09 Aug 2021
Pose is all you need: The pose only group activity recognition system
  (POGARS)
Pose is all you need: The pose only group activity recognition system (POGARS)
Haritha Thilakarathne
Aiden Nibali
Zhen He
Stuart Morgan
53
28
0
09 Aug 2021
Discriminative Latent Semantic Graph for Video Captioning
Discriminative Latent Semantic Graph for Video Captioning
Yang Bai
Junyan Wang
Yang Long
Bingzhang Hu
Yang Song
Maurice Pagnucco
Yu Guan
90
31
0
08 Aug 2021
Skeleton-Contrastive 3D Action Representation Learning
Skeleton-Contrastive 3D Action Representation Learning
Fida Mohammad Thoker
Hazel Doughty
Cees G. M. Snoek
SSL
83
133
0
08 Aug 2021
Learning an Augmented RGB Representation with Cross-Modal Knowledge
  Distillation for Action Detection
Learning an Augmented RGB Representation with Cross-Modal Knowledge Distillation for Action Detection
Rui Dai
Srijan Das
Francois Bremond
86
40
0
08 Aug 2021
Temporal Action Localization Using Gated Recurrent Units
Temporal Action Localization Using Gated Recurrent Units
Hassan Keshvari Khojasteh
Hoda Mohammadzade
H. Behroozi
119
3
0
07 Aug 2021
Previous
123...434445...717273
Next