ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1705.07750
  4. Cited By
Quo Vadis, Action Recognition? A New Model and the Kinetics Dataset
v1v2v3 (latest)

Quo Vadis, Action Recognition? A New Model and the Kinetics Dataset

22 May 2017
João Carreira
Andrew Zisserman
ArXiv (abs)PDFHTML

Papers citing "Quo Vadis, Action Recognition? A New Model and the Kinetics Dataset"

50 / 3,647 papers shown
Title
PressureVision: Estimating Hand Pressure from a Single RGB Image
PressureVision: Estimating Hand Pressure from a Single RGB Image
Patrick Grady
Chengcheng Tang
Samarth Brahmbhatt
Christopher D. Twigg
Chengde Wan
James Hays
Charles C. Kemp
3DH
87
20
0
19 Mar 2022
DirecFormer: A Directed Attention in Transformer Approach to Robust
  Action Recognition
DirecFormer: A Directed Attention in Transformer Approach to Robust Action Recognition
Thanh-Dat Truong
Quoc-Huy Bui
C. Duong
Han-Seok Seo
Son Lam Phung
Xin Li
Khoa Luu
ViT
135
51
0
19 Mar 2022
RoVISQ: Reduction of Video Service Quality via Adversarial Attacks on
  Deep Learning-based Video Compression
RoVISQ: Reduction of Video Service Quality via Adversarial Attacks on Deep Learning-based Video Compression
Jung-Woo Chang
Mojan Javaheripi
Seira Hidano
F. Koushanfar
98
8
0
18 Mar 2022
Multi-input segmentation of damaged brain in acute ischemic stroke
  patients using slow fusion with skip connection
Multi-input segmentation of damaged brain in acute ischemic stroke patients using slow fusion with skip connection
Luca Tomasetti
M. Khanmohammadi
K. Engan
Liv Jorunn Høllesli
K. D. Kurz
52
5
0
18 Mar 2022
Local-Global Context Aware Transformer for Language-Guided Video
  Segmentation
Local-Global Context Aware Transformer for Language-Guided Video Segmentation
Chen Liang
Wenguan Wang
Tianfei Zhou
Jiaxu Miao
Yawei Luo
Yi Yang
VOS
100
79
0
18 Mar 2022
Group Contextualization for Video Recognition
Group Contextualization for Video Recognition
Y. Hao
Haotong Zhang
Chong-Wah Ngo
Xiangnan He
62
27
0
18 Mar 2022
FERV39k: A Large-Scale Multi-Scene Dataset for Facial Expression
  Recognition in Videos
FERV39k: A Large-Scale Multi-Scene Dataset for Facial Expression Recognition in Videos
Yan Wang
Yixuan Sun
Yiwen Huang
Zhongying Liu
Shuyong Gao
Wei Zhang
Weifeng Ge
Wenqiang Zhang
83
89
0
17 Mar 2022
Look Outside the Room: Synthesizing A Consistent Long-Term 3D Scene
  Video from A Single Image
Look Outside the Room: Synthesizing A Consistent Long-Term 3D Scene Video from A Single Image
Xuanchi Ren
Xiaolong Wang
VGen
108
58
0
17 Mar 2022
ABN: Agent-Aware Boundary Networks for Temporal Action Proposal
  Generation
ABN: Agent-Aware Boundary Networks for Temporal Action Proposal Generation
Khoa T. Vo
Kashu Yamazaki
Sang Truong
M. Tran
Akihiro Sugimoto
Ngan Le
EgoV
71
9
0
16 Mar 2022
Gate-Shift-Fuse for Video Action Recognition
Gate-Shift-Fuse for Video Action Recognition
Swathikiran Sudhakaran
Sergio Escalera
Oswald Lanz
90
24
0
16 Mar 2022
Know your sensORs -- A Modality Study For Surgical Action Classification
Know your sensORs -- A Modality Study For Surgical Action Classification
Lennart Bastian
Tobias Czempiel
C. Heiliger
K. Karcz
U. Eck
Benjamin Busam
Nassir Navab
83
5
0
16 Mar 2022
On the Pitfalls of Batch Normalization for End-to-End Video Learning: A
  Study on Surgical Workflow Analysis
On the Pitfalls of Batch Normalization for End-to-End Video Learning: A Study on Surgical Workflow Analysis
Dominik Rivoir
Isabel Funke
Stefanie Speidel
108
19
0
15 Mar 2022
All in One: Exploring Unified Video-Language Pre-training
All in One: Exploring Unified Video-Language Pre-training
Alex Jinpeng Wang
Yixiao Ge
Rui Yan
Yuying Ge
Xudong Lin
Guanyu Cai
Jianping Wu
Ying Shan
Xiaohu Qie
Mike Zheng Shou
131
202
0
14 Mar 2022
RCL: Recurrent Continuous Localization for Temporal Action Detection
RCL: Recurrent Continuous Localization for Temporal Action Detection
Qiang Wang
Yanhao Zhang
Yun Zheng
Pan Pan
ObjD
77
38
0
14 Mar 2022
Active Learning by Feature Mixing
Active Learning by Feature Mixing
Amin Parvaneh
Ehsan Abbasnejad
Damien Teney
Reza Haffari
Anton Van Den Hengel
Javen Qinfeng Shi
81
94
0
14 Mar 2022
Decontextualized I3D ConvNet for ultra-distance runners performance
  analysis at a glance
Decontextualized I3D ConvNet for ultra-distance runners performance analysis at a glance
David Freire-Obregón
J. Lorenzo-Navarro
Modesto Castrillón-Santana
43
5
0
13 Mar 2022
Towards Visual-Prompt Temporal Answering Grounding in Medical
  Instructional Video
Towards Visual-Prompt Temporal Answering Grounding in Medical Instructional Video
Bin Li
Yixuan Weng
Bin Sun
Shutao Li
167
33
0
13 Mar 2022
WLASL-LEX: a Dataset for Recognising Phonological Properties in American
  Sign Language
WLASL-LEX: a Dataset for Recognising Phonological Properties in American Sign Language
Federico Tavella
Viktor Schlegel
Marta Romeo
Aphrodite Galata
Angelo Cangelosi
92
10
0
11 Mar 2022
TFCNet: Temporal Fully Connected Networks for Static Unbiased Temporal
  Reasoning
TFCNet: Temporal Fully Connected Networks for Static Unbiased Temporal Reasoning
Shiwen Zhang
AI4TS
96
12
0
11 Mar 2022
BEAT: A Large-Scale Semantic and Emotional Multi-Modal Dataset for
  Conversational Gestures Synthesis
BEAT: A Large-Scale Semantic and Emotional Multi-Modal Dataset for Conversational Gestures Synthesis
Haiyang Liu
Zihao Zhu
Naoya Iwamoto
Yichen Peng
Zhengqing Li
You Zhou
E. Bozkurt
Bo Zheng
SLRCVBM
129
144
0
10 Mar 2022
A Closer Look at Debiased Temporal Sentence Grounding in Videos:
  Dataset, Metric, and Approach
A Closer Look at Debiased Temporal Sentence Grounding in Videos: Dataset, Metric, and Approach
Xiaohan Lan
Yitian Yuan
Xin Eric Wang
Long Chen
Zhi Wang
Lin Ma
Wenwu Zhu
CML
72
16
0
10 Mar 2022
End-to-End Semantic Video Transformer for Zero-Shot Action Recognition
End-to-End Semantic Video Transformer for Zero-Shot Action Recognition
Keval Doshi
Yasin Yılmaz
ViT
88
2
0
10 Mar 2022
OpenTAL: Towards Open Set Temporal Action Localization
OpenTAL: Towards Open Set Temporal Action Localization
Wentao Bao
Qi Yu
Yu Kong
EDL
74
29
0
10 Mar 2022
Do better ImageNet classifiers assess perceptual similarity better?
Do better ImageNet classifiers assess perceptual similarity better?
Manoj Kumar
N. Houlsby
Nal Kalchbrenner
E. D. Cubuk
113
34
0
09 Mar 2022
Human Gaze Guided Attention for Surgical Activity Recognition
Human Gaze Guided Attention for Surgical Activity Recognition
Abdishakour Awale
Duygu Sarikaya
66
0
0
09 Mar 2022
Source-free Video Domain Adaptation by Learning Temporal Consistency for
  Action Recognition
Source-free Video Domain Adaptation by Learning Temporal Consistency for Action Recognition
Yuecong Xu
Jianfei Yang
Haozhi Cao
Keyu Wu
Min-man Wu
Zhenghua Chen
TTA
97
33
0
09 Mar 2022
A Simple Multi-Modality Transfer Learning Baseline for Sign Language
  Translation
A Simple Multi-Modality Transfer Learning Baseline for Sign Language Translation
Yutong Chen
Fangyun Wei
Xiao Sun
Zhirong Wu
Stephen Lin
SLR
88
104
0
08 Mar 2022
End-to-End Semi-Supervised Learning for Video Action Detection
End-to-End Semi-Supervised Learning for Video Action Detection
Akash Kumar
Yogesh S Rawat
77
32
0
08 Mar 2022
Gait Recognition with Mask-based Regularization
Gait Recognition with Mask-based Regularization
Chuanfu Shen
Beibei Lin
Shunli Zhang
George Q. Huang
Shiqi Yu
Xin-cen Yu
CVBM
169
19
0
08 Mar 2022
Universal Prototype Transport for Zero-Shot Action Recognition and
  Localization
Universal Prototype Transport for Zero-Shot Action Recognition and Localization
Pascal Mettes
99
5
0
08 Mar 2022
Live Laparoscopic Video Retrieval with Compressed Uncertainty
Live Laparoscopic Video Retrieval with Compressed Uncertainty
Tong Yu
Pietro Mascagni
J. Verde
J. Marescaux
Didier Mutter
N. Padoy
84
7
0
08 Mar 2022
PAMI-AD: An Activity Detector Exploiting Part-attention and Motion
  Information in Surveillance Videos
PAMI-AD: An Activity Detector Exploiting Part-attention and Motion Information in Surveillance Videos
Yunhao Du
Zhihang Tong
Jun-Jun Wan
Binyu Zhang
Yanyun Zhao
74
3
0
08 Mar 2022
Parallel Training of GRU Networks with a Multi-Grid Solver for Long
  Sequences
Parallel Training of GRU Networks with a Multi-Grid Solver for Long Sequences
G. Moon
E. Cyr
59
5
0
07 Mar 2022
Behavior Recognition Based on the Integration of Multigranular Motion
  Features
Behavior Recognition Based on the Integration of Multigranular Motion Features
Lizong Zhang
Yiming Wang
Bei Hui
Xiu Zhang
Sijuan Liu
Shuxin Feng
34
0
0
07 Mar 2022
Learnable Irrelevant Modality Dropout for Multimodal Action Recognition
  on Modality-Specific Annotated Videos
Learnable Irrelevant Modality Dropout for Multimodal Action Recognition on Modality-Specific Annotated Videos
Saghir Alfasly
Jian Lu
C. Xu
Yuru Zou
103
20
0
06 Mar 2022
Exploring Optical-Flow-Guided Motion and Detection-Based Appearance for
  Temporal Sentence Grounding
Exploring Optical-Flow-Guided Motion and Detection-Based Appearance for Temporal Sentence Grounding
Daizong Liu
Xiang Fang
Wei Hu
Pan Zhou
102
37
0
06 Mar 2022
Weakly Supervised Temporal Action Localization via Representative
  Snippet Knowledge Propagation
Weakly Supervised Temporal Action Localization via Representative Snippet Knowledge Propagation
Linjiang Huang
Liang Wang
Hongsheng Li
AI4TS
137
69
0
06 Mar 2022
Machine Learning Applications in Lung Cancer Diagnosis, Treatment and
  Prognosis
Machine Learning Applications in Lung Cancer Diagnosis, Treatment and Prognosis
Yawei Li
Xin Wu
P. Yang
Guoqian Jiang
Yuan Luo
AI4CE
94
2
0
05 Mar 2022
Audio-visual speech separation based on joint feature representation
  with cross-modal attention
Audio-visual speech separation based on joint feature representation with cross-modal attention
Jun Xiong
Peng Zhang
Lei Xie
Wei Huang
Yufei Zha
Yanni Zhang
50
3
0
05 Mar 2022
A Large-scale Comprehensive Dataset and Copy-overlap Aware Evaluation
  Protocol for Segment-level Video Copy Detection
A Large-scale Comprehensive Dataset and Copy-overlap Aware Evaluation Protocol for Segment-level Video Copy Detection
Sifeng He
Xudong Yang
Chenhan Jiang
Gang Liang
Wei Zhang
...
Kaiming Huang
Yuan Cheng
Feng Qian
Xiaobo Zhang
Lei Yang
60
12
0
05 Mar 2022
HOI4D: A 4D Egocentric Dataset for Category-Level Human-Object
  Interaction
HOI4D: A 4D Egocentric Dataset for Category-Level Human-Object Interaction
Yunze Liu
Yun-Hai Liu
Chen Jiang
Kangbo Lyu
Weikang Wan
Hao Shen
Bo-Hua Liang
Zhoujie Fu
He Wang
Li Yi
132
188
0
03 Mar 2022
SegTAD: Precise Temporal Action Detection via Semantic Segmentation
SegTAD: Precise Temporal Action Detection via Semantic Segmentation
Chen Zhao
Merey Ramazanova
Mengmeng Xu
Guohao Li
43
7
0
03 Mar 2022
Audio Self-supervised Learning: A Survey
Audio Self-supervised Learning: A Survey
Shuo Liu
Adria Mallol-Ragolta
Emilia Parada-Cabeleiro
Kun Qian
Xingshuo Jing
Alexander Kathan
Bin Hu
Bjoern W. Schuller
SSL
104
109
0
02 Mar 2022
Colar: Effective and Efficient Online Action Detection by Consulting
  Exemplars
Colar: Effective and Efficient Online Action Detection by Consulting Exemplars
Le Yang
Junwei Han
Dingwen Zhang
90
38
0
02 Mar 2022
TransDARC: Transformer-based Driver Activity Recognition with Latent
  Space Feature Calibration
TransDARC: Transformer-based Driver Activity Recognition with Latent Space Feature Calibration
Kunyu Peng
Alina Roitberg
Kailun Yang
Jiaming Zhang
Rainer Stiefelhagen
ViT
85
34
0
02 Mar 2022
Temporal Perceiver: A General Architecture for Arbitrary Boundary
  Detection
Temporal Perceiver: A General Architecture for Arbitrary Boundary Detection
Jing Tan
Yuhong Wang
Gangshan Wu
Limin Wang
105
15
0
01 Mar 2022
Concept Graph Neural Networks for Surgical Video Understanding
Concept Graph Neural Networks for Surgical Video Understanding
Yutong Ban
J. Eckhoff
Thomas M. Ward
Daniel A. Hashimoto
O. Meireles
Daniela Rus
Guy Rosman
NAI
90
18
0
27 Feb 2022
Continuous Human Action Recognition for Human-Machine Interaction: A
  Review
Continuous Human Action Recognition for Human-Machine Interaction: A Review
Harshala Gammulle
David Ahmedt-Aristizabal
Akila Pemasiri
Lachlan Tychsen-Smith
L. Petersson
Clinton Fookes
124
28
0
26 Feb 2022
On Modality Bias Recognition and Reduction
On Modality Bias Recognition and Reduction
Yangyang Guo
Liqiang Nie
Harry Cheng
Zhiyong Cheng
Mohan S. Kankanhalli
A. Bimbo
80
28
0
25 Feb 2022
Motion-driven Visual Tempo Learning for Video-based Action Recognition
Motion-driven Visual Tempo Learning for Video-based Action Recognition
Yuanzhong Liu
Junsong Yuan
Zhigang Tu
90
61
0
24 Feb 2022
Previous
123...373839...717273
Next