ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1705.07750
  4. Cited By
Quo Vadis, Action Recognition? A New Model and the Kinetics Dataset
v1v2v3 (latest)

Quo Vadis, Action Recognition? A New Model and the Kinetics Dataset

22 May 2017
João Carreira
Andrew Zisserman
ArXiv (abs)PDFHTML

Papers citing "Quo Vadis, Action Recognition? A New Model and the Kinetics Dataset"

50 / 3,647 papers shown
Title
Medical Imaging and Machine Learning
Medical Imaging and Machine Learning
R. Shad
John P. Cunningham
Euan A. Ashley
C. Langlotz
W. Hiesinger
71
42
0
02 Mar 2021
Improved Techniques for Quantizing Deep Networks with Adaptive
  Bit-Widths
Improved Techniques for Quantizing Deep Networks with Adaptive Bit-Widths
Ximeng Sun
Yikang Shen
Chun-Fu Chen
Naigang Wang
Bowen Pan
Bowen Pan Kailash Gopalakrishnan
A. Oliva
Rogerio Feris
Kate Saenko
MQ
81
4
0
02 Mar 2021
Exploring Complementary Strengths of Invariant and Equivariant
  Representations for Few-Shot Learning
Exploring Complementary Strengths of Invariant and Equivariant Representations for Few-Shot Learning
Mamshad Nayeem Rizve
Salman Khan
Fahad Shahbaz Khan
M. Shah
146
114
0
01 Mar 2021
Coarse-Fine Networks for Temporal Activity Detection in Videos
Coarse-Fine Networks for Temporal Activity Detection in Videos
Kumara Kahatapitiya
Michael S. Ryoo
AI4TS
97
39
0
01 Mar 2021
Medical Image Segmentation with Limited Supervision: A Review of Deep
  Network Models
Medical Image Segmentation with Limited Supervision: A Review of Deep Network Models
Jialin Peng
Ye Wang
VLM
110
60
0
28 Feb 2021
Predicting post-operative right ventricular failure using video-based
  deep learning
Predicting post-operative right ventricular failure using video-based deep learning
R. Shad
Nicolas Quach
R. Fong
P. Kasinpila
C. Bowles
...
Y. Woo
J. Teuteberg
John P. Cunningham
C. Langlotz
W. Hiesinger
53
48
0
28 Feb 2021
Natural Language Video Localization: A Revisit in Span-based Question
  Answering Framework
Natural Language Video Localization: A Revisit in Span-based Question Answering Framework
Hao Zhang
Aixin Sun
Wei Jing
Liangli Zhen
Qiufeng Wang
Rick Siow Mong Goh
226
87
0
26 Feb 2021
ACDnet: An action detection network for real-time edge computing based
  on flow-guided feature approximation and memory aggregation
ACDnet: An action detection network for real-time edge computing based on flow-guided feature approximation and memory aggregation
Yu Liu
Fan Yang
D. Ginhac
94
13
0
26 Feb 2021
Application of Transfer Learning to Sign Language Recognition using an
  Inflated 3D Deep Convolutional Neural Network
Application of Transfer Learning to Sign Language Recognition using an Inflated 3D Deep Convolutional Neural Network
Roman Töngi
SLR
20
10
0
25 Feb 2021
"Train one, Classify one, Teach one" -- Cross-surgery transfer learning
  for surgical step recognition
"Train one, Classify one, Teach one" -- Cross-surgery transfer learning for surgical step recognition
Daniel Neimark
Omri Bar
Maya Zohar
Gregory Hager
Dotan Asselmann
55
14
0
24 Feb 2021
ROAD: The ROad event Awareness Dataset for Autonomous Driving
ROAD: The ROad event Awareness Dataset for Autonomous Driving
Gurkirt Singh
Stephen Akrigg
Manuele Di Maio
Valentina Fontana
Reza Javanmard Alitappeh
...
Salman Khan
S. Grazioso
Andrew Bradley
G. Gironimo
Fabio Cuzzolin
73
90
0
23 Feb 2021
Phase Space Reconstruction Network for Lane Intrusion Action Recognition
Phase Space Reconstruction Network for Lane Intrusion Action Recognition
Ruiwen Zhang
Zhidong Deng
Hongsen Lin
Hongchao Lu
34
0
0
22 Feb 2021
Transferable Visual Words: Exploiting the Semantics of Anatomical
  Patterns for Self-supervised Learning
Transferable Visual Words: Exploiting the Semantics of Anatomical Patterns for Self-supervised Learning
F. Haghighi
M. Taher
Zongwei Zhou
Michael B. Gotway
Jianming Liang
MedIm
98
107
0
21 Feb 2021
Improving Action Quality Assessment using Weighted Aggregation
Improving Action Quality Assessment using Weighted Aggregation
Shafkat Farabi
H. Himel
Fakhruddin Gazzali
Md. Bakhtiar Hasan
M. H. Kabir
M. Farazi
61
9
0
21 Feb 2021
Self-Supervised Learning via multi-Transformation Classification for
  Action Recognition
Self-Supervised Learning via multi-Transformation Classification for Action Recognition
Duc-Quang Vu
Ngan T. H. Le
Jia-Ching Wang
SSL
70
8
0
20 Feb 2021
Vision-Aided 6G Wireless Communications: Blockage Prediction and
  Proactive Handoff
Vision-Aided 6G Wireless Communications: Blockage Prediction and Proactive Handoff
Gouranga Charan
Muhammad Alrabeiah
Ahmed Alkhateeb
52
136
0
18 Feb 2021
DeeperForensics Challenge 2020 on Real-World Face Forgery Detection:
  Methods and Results
DeeperForensics Challenge 2020 on Real-World Face Forgery Detection: Methods and Results
Liming Jiang
Z. Guo
Wayne Wu
Zhaoyang Liu
Ziwei Liu
...
Feiyue Huang
Liujuan Cao
Rongrong Ji
Changlei Lu
Ganchao Tan
CVBM
76
11
0
18 Feb 2021
Learning to Recognize Actions on Objects in Egocentric Video with
  Attention Dictionaries
Learning to Recognize Actions on Objects in Egocentric Video with Attention Dictionaries
Swathikiran Sudhakaran
Sergio Escalera
Oswald Lanz
EgoV
65
16
0
16 Feb 2021
VA-RED$^2$: Video Adaptive Redundancy Reduction
VA-RED2^22: Video Adaptive Redundancy Reduction
Bowen Pan
Yikang Shen
Camilo Luciano Fosco
Chung-Ching Lin
A. Andonian
Yue Meng
Kate Saenko
A. Oliva
Rogerio Feris
84
19
0
15 Feb 2021
RMS-Net: Regression and Masking for Soccer Event Spotting
RMS-Net: Regression and Masking for Soccer Event Spotting
Matteo Tomei
Lorenzo Baraldi
Simone Calderara
Simone Bronzin
Rita Cucchiara
110
29
0
15 Feb 2021
Win-Fail Action Recognition
Win-Fail Action Recognition
Paritosh Parmar
B. Morris
64
5
0
15 Feb 2021
Learning Self-Similarity in Space and Time as Generalized Motion for
  Video Action Recognition
Learning Self-Similarity in Space and Time as Generalized Motion for Video Action Recognition
Heeseung Kwon
Manjin Kim
Suha Kwak
Minsu Cho
TTA
79
42
0
14 Feb 2021
Less is More: ClipBERT for Video-and-Language Learning via Sparse
  Sampling
Less is More: ClipBERT for Video-and-Language Learning via Sparse Sampling
Jie Lei
Linjie Li
Luowei Zhou
Zhe Gan
Tamara L. Berg
Joey Tianyi Zhou
Jingjing Liu
CLIP
197
666
0
11 Feb 2021
AdaFuse: Adaptive Temporal Fusion Network for Efficient Action
  Recognition
AdaFuse: Adaptive Temporal Fusion Network for Efficient Action Recognition
Yue Meng
Yikang Shen
Chung-Ching Lin
P. Sattigeri
Leonid Karlinsky
Kate Saenko
A. Oliva
Rogerio Feris
167
63
0
10 Feb 2021
Regional Attention with Architecture-Rebuilt 3D Network for RGB-D
  Gesture Recognition
Regional Attention with Architecture-Rebuilt 3D Network for RGB-D Gesture Recognition
Benjia Zhou
Yunan Li
Jun Wan
3DH
94
28
0
10 Feb 2021
Is Space-Time Attention All You Need for Video Understanding?
Is Space-Time Attention All You Need for Video Understanding?
Gedas Bertasius
Heng Wang
Lorenzo Torresani
ViT
448
2,080
0
09 Feb 2021
Dynamic Neural Networks: A Survey
Dynamic Neural Networks: A Survey
Yizeng Han
Gao Huang
Shiji Song
Le Yang
Honghui Wang
Yulin Wang
3DHAI4TSAI4CE
151
659
0
09 Feb 2021
RANP: Resource Aware Neuron Pruning at Initialization for 3D CNNs
RANP: Resource Aware Neuron Pruning at Initialization for 3D CNNs
Zhiwei Xu
Thalaiyasingam Ajanthan
Vibhav Vineet
Leonid Sigal
110
3
0
09 Feb 2021
Privacy-preserving Cloud-based DNN Inference
Privacy-preserving Cloud-based DNN Inference
Shangyu Xie
Bingyu Liu
Yuan Hong
FedML
31
6
0
07 Feb 2021
MOTS R-CNN: Cosine-margin-triplet loss for multi-object tracking
MOTS R-CNN: Cosine-margin-triplet loss for multi-object tracking
Amit Satish Unde
Renu M. Rameshan
VOTMQ
78
5
0
06 Feb 2021
Semi-Supervised Action Recognition with Temporal Contrastive Learning
Semi-Supervised Action Recognition with Temporal Contrastive Learning
Ankit Singh
Omprakash Chakraborty
Ashutosh Varshney
Yikang Shen
Rogerio Feris
Kate Saenko
Abir Das
82
99
0
04 Feb 2021
Relaxed Transformer Decoders for Direct Action Proposal Generation
Relaxed Transformer Decoders for Direct Action Proposal Generation
Jing Tan
Jiaqi Tang
Limin Wang
Gangshan Wu
ViT
156
182
0
03 Feb 2021
GCF-Net: Gated Clip Fusion Network for Video Action Recognition
GCF-Net: Gated Clip Fusion Network for Video Action Recognition
Jenhao Hsiao
Jiawei Chen
C. Ho
30
5
0
02 Feb 2021
Video Transformer Network
Video Transformer Network
Daniel Neimark
Omri Bar
Maya Zohar
Dotan Asselmann
ViT
313
434
0
01 Feb 2021
Forecasting Action through Contact Representations from First Person
  Video
Forecasting Action through Contact Representations from First Person Video
Eadom Dessalene
Chinmaya Devaraj
Michael Maynord
Cornelia Fermuller
Yiannis Aloimonos
EgoV
110
61
0
01 Feb 2021
Video Reenactment as Inductive Bias for Content-Motion Disentanglement
Video Reenactment as Inductive Bias for Content-Motion Disentanglement
Juan Felipe Hernandez Albarracin
Adín Ramirez Rivera
92
3
0
30 Jan 2021
Position, Padding and Predictions: A Deeper Look at Position Information
  in CNNs
Position, Padding and Predictions: A Deeper Look at Position Information in CNNs
Md. Amirul Islam
M. Kowal
Sen Jia
Konstantinos G. Derpanis
Neil D. B. Bruce
64
58
0
28 Jan 2021
VX2TEXT: End-to-End Learning of Video-Based Text Generation From
  Multimodal Inputs
VX2TEXT: End-to-End Learning of Video-Based Text Generation From Multimodal Inputs
Xudong Lin
Gedas Bertasius
Jue Wang
Shih-Fu Chang
Devi Parikh
Lorenzo Torresani
VGen
102
67
0
28 Jan 2021
NTU-X: An Enhanced Large-scale Dataset for Improving Pose-based
  Recognition of Subtle Human Actions
NTU-X: An Enhanced Large-scale Dataset for Improving Pose-based Recognition of Subtle Human Actions
Neel Trivedi
Anirudh Thatipelli
Ravi Kiran Sarvadevabhatla
99
19
0
27 Jan 2021
A Case Study of Deep Learning Based Multi-Modal Methods for Predicting
  the Age-Suitability Rating of Movie Trailers
A Case Study of Deep Learning Based Multi-Modal Methods for Predicting the Age-Suitability Rating of Movie Trailers
Mahsa Shafaei
C. Smailis
I. Kakadiaris
Thamar Solorio
408
1
0
26 Jan 2021
ACAV100M: Automatic Curation of Large-Scale Datasets for Audio-Visual
  Video Representation Learning
ACAV100M: Automatic Curation of Large-Scale Datasets for Audio-Visual Video Representation Learning
Sangho Lee
Jiwan Chung
Youngjae Yu
Gunhee Kim
Thomas Breuel
Gal Chechik
Yale Song
177
47
0
26 Jan 2021
Generic Event Boundary Detection: A Benchmark for Event Segmentation
Generic Event Boundary Detection: A Benchmark for Event Segmentation
Mike Zheng Shou
Stan Weixian Lei
Weiyao Wang
Deepti Ghadiyaram
Matt Feiszli
VOS
169
78
0
26 Jan 2021
RGB-D Salient Object Detection via 3D Convolutional Neural Networks
RGB-D Salient Object Detection via 3D Convolutional Neural Networks
Qian Chen
Ze Liu
Y. Zhang
Keren Fu
Qijun Zhao
H. Du
3DPC
85
155
0
25 Jan 2021
Weakly-supervised Video Anomaly Detection with Robust Temporal Feature
  Magnitude Learning
Weakly-supervised Video Anomaly Detection with Robust Temporal Feature Magnitude Learning
Yu Tian
Guansong Pang
Yuanhong Chen
Rajvinder Singh
Johan Verjans
G. Carneiro
AI4TS
115
314
0
25 Jan 2021
Parametric Rectified Power Sigmoid Units: Learning Nonlinear Neural
  Transfer Analytical Forms
Parametric Rectified Power Sigmoid Units: Learning Nonlinear Neural Transfer Analytical Forms
A. Atto
S. Galichet
Dominique Pastor
N. Méger
19
0
0
25 Jan 2021
Weakly Supervised Learning for Facial Behavior Analysis : A Review
Weakly Supervised Learning for Facial Behavior Analysis : A Review
G. Praveen
Member Ieee Eric Granger
Member Ieee Patrick Cardinal
CVBM
76
6
0
25 Jan 2021
A Closer Look at Temporal Sentence Grounding in Videos: Dataset and
  Metric
A Closer Look at Temporal Sentence Grounding in Videos: Dataset and Metric
Yitian Yuan
Xiaohan Lan
Xin Wang
Long Chen
Zhi Wang
Wenwu Zhu
76
54
0
22 Jan 2021
Bridging the gap between Human Action Recognition and Online Action
  Detection
Bridging the gap between Human Action Recognition and Online Action Detection
Alban Main De Boissiere
R. Noumeir
99
0
0
21 Jan 2021
Hierarchical Graph-RNNs for Action Detection of Multiple Activities
Hierarchical Graph-RNNs for Action Detection of Multiple Activities
Sovan Biswas
Yaser Souri
Juergen Gall
126
2
0
21 Jan 2021
Discovering Multi-Label Actor-Action Association in a Weakly Supervised
  Setting
Discovering Multi-Label Actor-Action Association in a Weakly Supervised Setting
Sovan Biswas
Juergen Gall
62
2
0
21 Jan 2021
Previous
123...505152...717273
Next