ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1705.07750
  4. Cited By
Quo Vadis, Action Recognition? A New Model and the Kinetics Dataset

Quo Vadis, Action Recognition? A New Model and the Kinetics Dataset

22 May 2017
João Carreira
Andrew Zisserman
ArXivPDFHTML

Papers citing "Quo Vadis, Action Recognition? A New Model and the Kinetics Dataset"

50 / 1,478 papers shown
Title
Generic Event Boundary Detection: A Benchmark for Event Segmentation
Generic Event Boundary Detection: A Benchmark for Event Segmentation
Mike Zheng Shou
Stan Weixian Lei
Weiyao Wang
Deepti Ghadiyaram
Matt Feiszli
VOS
93
76
0
26 Jan 2021
Weakly Supervised Learning for Facial Behavior Analysis : A Review
Weakly Supervised Learning for Facial Behavior Analysis : A Review
G. Praveen
Member Ieee Eric Granger
Member Ieee Patrick Cardinal
CVBM
37
6
0
25 Jan 2021
Bridging the gap between Human Action Recognition and Online Action
  Detection
Bridging the gap between Human Action Recognition and Online Action Detection
Alban Main De Boissiere
R. Noumeir
22
0
0
21 Jan 2021
Video Relation Detection with Trajectory-aware Multi-modal Features
Video Relation Detection with Trajectory-aware Multi-modal Features
W. Xie
Guanghui Ren
Si Liu
28
21
0
20 Jan 2021
TCLR: Temporal Contrastive Learning for Video Representation
TCLR: Temporal Contrastive Learning for Video Representation
I. Dave
Rohit Gupta
Mamshad Nayeem Rizve
Mubarak Shah
SSL
AI4TS
36
175
0
20 Jan 2021
Coarse Temporal Attention Network (CTA-Net) for Driver's Activity
  Recognition
Coarse Temporal Attention Network (CTA-Net) for Driver's Activity Recognition
Zachary Wharton
Ardhendu Behera
Yonghuai Liu
Nikolaos Bessis
39
35
0
17 Jan 2021
Temporal-Relational CrossTransformers for Few-Shot Action Recognition
Temporal-Relational CrossTransformers for Few-Shot Action Recognition
Toby Perrett
A. Masullo
T. Burghardt
Majid Mirmehdi
Dima Damen
ViT
31
145
0
15 Jan 2021
Learning from Weakly-labeled Web Videos via Exploring Sub-Concepts
Learning from Weakly-labeled Web Videos via Exploring Sub-Concepts
Kunpeng Li
Zizhao Zhang
Guanhang Wu
Xuehan Xiong
Chen-Yu Lee
Zhichao Lu
Y. Fu
Tomas Pfister
34
5
0
11 Jan 2021
Uncertainty-sensitive Activity Recognition: a Reliability Benchmark and
  the CARING Models
Uncertainty-sensitive Activity Recognition: a Reliability Benchmark and the CARING Models
Alina Roitberg
Monica Haurilet
Manuel Martínez
Rainer Stiefelhagen
UQCV
39
6
0
02 Jan 2021
Semantics for Robotic Mapping, Perception and Interaction: A Survey
Semantics for Robotic Mapping, Perception and Interaction: A Survey
Sourav Garg
Niko Sünderhauf
Feras Dayoub
D. Morrison
Akansel Cosgun
...
Tat-Jun Chin
Ian Reid
Stephen Gould
Peter Corke
Michael Milford
31
115
0
02 Jan 2021
Refining activation downsampling with SoftPool
Refining activation downsampling with SoftPool
Alexandros Stergiou
R. Poppe
Grigorios Kalliatakis
36
159
0
02 Jan 2021
Tensor Representations for Action Recognition
Tensor Representations for Action Recognition
Piotr Koniusz
Lei Wang
A. Cherian
41
69
0
28 Dec 2020
Context-Aware Personality Inference in Dyadic Scenarios: Introducing the
  UDIVA Dataset
Context-Aware Personality Inference in Dyadic Scenarios: Introducing the UDIVA Dataset
Cristina Palmero
Javier Selva
Sorina Smeureanu
Julio C. S. Jacques Junior
Albert Clapés
...
Zejian Zhang
D. Gallardo-Pujol
G. Guilera
D. Leiva
Sergio Escalera
35
53
0
28 Dec 2020
CNNs for JPEGs: A Study in Computational Cost
CNNs for JPEGs: A Study in Computational Cost
Samuel Felipe dos Santos
N. Sebe
Jurandy Almeida
30
2
0
26 Dec 2020
SMART Frame Selection for Action Recognition
SMART Frame Selection for Action Recognition
Shreyank N. Gowda
Marcus Rohrbach
Laura Sevilla-Lara
31
142
0
19 Dec 2020
TDN: Temporal Difference Networks for Efficient Action Recognition
TDN: Temporal Difference Networks for Efficient Action Recognition
Limin Wang
Zhan Tong
Bin Ji
Gangshan Wu
28
391
0
18 Dec 2020
Multi-shot Temporal Event Localization: a Benchmark
Multi-shot Temporal Event Localization: a Benchmark
Xiaolong Liu
Yao Hu
S. Bai
Fei Ding
X. Bai
Philip Torr
51
82
0
17 Dec 2020
GTA: Global Temporal Attention for Video Action Understanding
GTA: Global Temporal Attention for Video Action Understanding
Bo He
Xitong Yang
Zuxuan Wu
Hao Chen
Ser-Nam Lim
Abhinav Shrivastava
ViT
33
27
0
15 Dec 2020
MSAF: Multimodal Split Attention Fusion
MSAF: Multimodal Split Attention Fusion
Lang Su
Chuqing Hu
Guofa Li
Dongpu Cao
30
37
0
13 Dec 2020
A Comprehensive Study of Deep Video Action Recognition
A Comprehensive Study of Deep Video Action Recognition
Yi Zhu
Xinyu Li
Chunhui Liu
Mohammadreza Zolfaghari
Yuanjun Xiong
Chongruo Wu
Zhi-Li Zhang
Joseph Tighe
R. Manmatha
Mu Li
VLM
AI4TS
38
185
0
11 Dec 2020
D2-Net: Weakly-Supervised Action Localization via Discriminative
  Embeddings and Denoised Activations
D2-Net: Weakly-Supervised Action Localization via Discriminative Embeddings and Denoised Activations
Sanath Narayan
Hisham Cholakkal
Munawar Hayat
Fahad Shahbaz Khan
Ming-Hsuan Yang
Ling Shao
27
54
0
11 Dec 2020
Intrinsic Temporal Regularization for High-resolution Human Video
  Synthesis
Intrinsic Temporal Regularization for High-resolution Human Video Synthesis
Lingbo Yang
Zhanning Gao
Peiran Ren
Siwei Ma
Wen Gao
3DH
24
1
0
11 Dec 2020
Deep Lesion Tracker: Monitoring Lesions in 4D Longitudinal Imaging
  Studies
Deep Lesion Tracker: Monitoring Lesions in 4D Longitudinal Imaging Studies
Jinzheng Cai
Youbao Tang
K. Yan
Adam P. Harrison
Jing Xiao
Gigin Lin
Le Lu
MedIm
41
29
0
09 Dec 2020
Multi-Scale 2D Temporal Adjacent Networks for Moment Localization with
  Natural Language
Multi-Scale 2D Temporal Adjacent Networks for Moment Localization with Natural Language
Songyang Zhang
Houwen Peng
Jianlong Fu
Yijuan Lu
Jiebo Luo
27
51
0
04 Dec 2020
Spatial-Temporal Alignment Network for Action Recognition and Detection
Spatial-Temporal Alignment Network for Action Recognition and Detection
Junwei Liang
Liangliang Cao
Xuehan Xiong
Ting Yu
Alexander G. Hauptmann
3DPC
18
9
0
04 Dec 2020
Video Self-Stitching Graph Network for Temporal Action Localization
Video Self-Stitching Graph Network for Temporal Action Localization
Chen Zhao
Ali K. Thabet
Guohao Li
26
138
0
30 Nov 2020
Depth-Aware Action Recognition: Pose-Motion Encoding through Temporal
  Heatmaps
Depth-Aware Action Recognition: Pose-Motion Encoding through Temporal Heatmaps
Mattia Segu
Federico Pirovano
Gianmario Fumagalli
Amedeo Fabris
23
2
0
26 Nov 2020
SoccerNet-v2: A Dataset and Benchmarks for Holistic Understanding of
  Broadcast Soccer Videos
SoccerNet-v2: A Dataset and Benchmarks for Holistic Understanding of Broadcast Soccer Videos
Adrien Deliège
A. Cioppa
Silvio Giancola
M. J. Seikavandi
J. Dueholm
Kamal Nasrollahi
Guohao Li
T. Moeslund
Marc Van Droogenbroeck
18
152
0
26 Nov 2020
t-EVA: Time-Efficient t-SNE Video Annotation
t-EVA: Time-Efficient t-SNE Video Annotation
Soroosh Poorgholi
O. Kayhan
Jan van Gemert
16
5
0
26 Nov 2020
Sign language segmentation with temporal convolutional networks
Sign language segmentation with temporal convolutional networks
Katrin Renz
N. Stache
Samuel Albanie
Gül Varol
SLR
22
25
0
25 Nov 2020
TSP: Temporally-Sensitive Pretraining of Video Encoders for Localization
  Tasks
TSP: Temporally-Sensitive Pretraining of Video Encoders for Localization Tasks
Humam Alwassel
Silvio Giancola
Guohao Li
33
123
0
23 Nov 2020
Hierarchically Decoupled Spatial-Temporal Contrast for Self-supervised
  Video Representation Learning
Hierarchically Decoupled Spatial-Temporal Contrast for Self-supervised Video Representation Learning
Zehua Zhang
David J. Crandall
AI4TS
SSL
28
23
0
23 Nov 2020
We don't Need Thousand Proposals$\colon$ Single Shot Actor-Action
  Detection in Videos
We don't Need Thousand Proposals ⁣:\colon: Single Shot Actor-Action Detection in Videos
A. J. Rana
Yogesh S Rawat
ViT
13
11
0
22 Nov 2020
Visual Recognition of Great Ape Behaviours in the Wild
Visual Recognition of Great Ape Behaviours in the Wild
Faizaan Sakib
T. Burghardt
22
24
0
21 Nov 2020
Game Plan: What AI can do for Football, and What Football can do for AI
Game Plan: What AI can do for Football, and What Football can do for AI
K. Tuyls
Shayegan Omidshafiei
Paul Muller
Zhe Wang
Jerome T. Connor
...
Simon Bouton
Nathalie Beauguerlange
Jackson Broshear
T. Graepel
Demis Hassabis
46
78
0
18 Nov 2020
3D CNNs with Adaptive Temporal Feature Resolutions
3D CNNs with Adaptive Temporal Feature Resolutions
Mohsen Fayyaz
Emad Bahrami Rad
Ali Diba
M. Noroozi
Ehsan Adeli
Luc Van Gool
Juergen Gall
3DPC
24
30
0
17 Nov 2020
Semi-Supervised Few-Shot Atomic Action Recognition
Semi-Supervised Few-Shot Atomic Action Recognition
Xiaoyuan Ni
Sizhe Song
Yu-Wing Tai
Chi-Keung Tang
19
3
0
17 Nov 2020
Multi-Modal Hybrid Architecture for Pedestrian Action Prediction
Multi-Modal Hybrid Architecture for Pedestrian Action Prediction
Amir Rasouli
Tiffany Yau
Mohsen Rohani
Jun Luo
31
43
0
16 Nov 2020
JOLO-GCN: Mining Joint-Centered Light-Weight Information for
  Skeleton-Based Action Recognition
JOLO-GCN: Mining Joint-Centered Light-Weight Information for Skeleton-Based Action Recognition
Jinmiao Cai
Nianjuan Jiang
Xiaoguang Han
Kui Jia
Jiangbo Lu
24
84
0
16 Nov 2020
Multimodal Pretraining for Dense Video Captioning
Multimodal Pretraining for Dense Video Captioning
Gabriel Huang
Bo Pang
Zhenhai Zhu
Clara E. Rivera
Radu Soricut
21
81
0
10 Nov 2020
Selective Spatio-Temporal Aggregation Based Pose Refinement System:
  Towards Understanding Human Activities in Real-World Videos
Selective Spatio-Temporal Aggregation Based Pose Refinement System: Towards Understanding Human Activities in Real-World Videos
Di Yang
Rui Dai
Yaohui Wang
Rupayan Mallick
Luca Minciullo
Gianpiero Francesca
Francois Bremond
35
16
0
10 Nov 2020
Temporal Stochastic Softmax for 3D CNNs: An Application in Facial
  Expression Recognition
Temporal Stochastic Softmax for 3D CNNs: An Application in Facial Expression Recognition
T. Ayral
M. Pedersoli
Simon L Bacon
Eric Granger
CVBM
3DH
13
11
0
10 Nov 2020
Multi-Temporal Convolutions for Human Action Recognition in Videos
Multi-Temporal Convolutions for Human Action Recognition in Videos
Alexandros Stergiou
R. Poppe
29
1
0
08 Nov 2020
AOT: Appearance Optimal Transport Based Identity Swapping for Forgery
  Detection
AOT: Appearance Optimal Transport Based Identity Swapping for Forgery Detection
Hao Zhu
Chaoyou Fu
Qianyi Wu
Wayne Wu
Chao Qian
Ran He
41
32
0
05 Nov 2020
Content-based Analysis of the Cultural Differences between TikTok and
  Douyin
Content-based Analysis of the Cultural Differences between TikTok and Douyin
Li-yao Sun
Haoqi Zhang
Songyang Zhang
Jiebo Luo
13
24
0
03 Nov 2020
COOT: Cooperative Hierarchical Transformer for Video-Text Representation
  Learning
COOT: Cooperative Hierarchical Transformer for Video-Text Representation Learning
Simon Ging
Mohammadreza Zolfaghari
Hamed Pirsiavash
Thomas Brox
ViT
CLIP
31
169
0
01 Nov 2020
A Survey on Contrastive Self-supervised Learning
A Survey on Contrastive Self-supervised Learning
Ashish Jaiswal
Ashwin Ramesh Babu
Mohammad Zaki Zadeh
Debapriya Banerjee
F. Makedon
SSL
57
1,361
0
31 Oct 2020
Pretext-Contrastive Learning: Toward Good Practices in Self-supervised
  Video Representation Leaning
Pretext-Contrastive Learning: Toward Good Practices in Self-supervised Video Representation Leaning
L. Tao
Xueting Wang
T. Yamasaki
VLM
SSL
23
14
0
29 Oct 2020
ElderSim: A Synthetic Data Generation Platform for Human Action
  Recognition in Eldercare Applications
ElderSim: A Synthetic Data Generation Platform for Human Action Recognition in Eldercare Applications
Hochul Hwang
Cheongjae Jang
Geonwoo Park
Junghyun Cho
Ig-Jae Kim
34
70
0
28 Oct 2020
Deep Analysis of CNN-based Spatio-temporal Representations for Action
  Recognition
Deep Analysis of CNN-based Spatio-temporal Representations for Action Recognition
Chun-Fu Chen
Yikang Shen
K. Ramakrishnan
Rogerio Feris
J. M. Cohn
A. Oliva
Quanfu Fan
23
95
0
22 Oct 2020
Previous
123...202122...282930
Next