ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1705.07750
  4. Cited By
Quo Vadis, Action Recognition? A New Model and the Kinetics Dataset

Quo Vadis, Action Recognition? A New Model and the Kinetics Dataset

22 May 2017
João Carreira
Andrew Zisserman
ArXivPDFHTML

Papers citing "Quo Vadis, Action Recognition? A New Model and the Kinetics Dataset"

50 / 1,478 papers shown
Title
Connecting Language and Vision for Natural Language-Based Vehicle
  Retrieval
Connecting Language and Vision for Natural Language-Based Vehicle Retrieval
Shuai Bai
Zhedong Zheng
Xiaohan Wang
Junyang Lin
Zhu Zhang
Chang Zhou
Yi Yang
Hongxia Yang
24
27
0
31 May 2021
Towards Diverse Paragraph Captioning for Untrimmed Videos
Towards Diverse Paragraph Captioning for Untrimmed Videos
Yuqing Song
Shizhe Chen
Qin Jin
21
37
0
30 May 2021
SSCAP: Self-supervised Co-occurrence Action Parsing for Unsupervised
  Temporal Action Segmentation
SSCAP: Self-supervised Co-occurrence Action Parsing for Unsupervised Temporal Action Segmentation
Zhe Wang
Hao Chen
Xinyu Li
Chunhui Liu
Yuanjun Xiong
Joseph Tighe
Charless C. Fowlkes
30
20
0
29 May 2021
Unsupervised Action Segmentation by Joint Representation Learning and
  Online Clustering
Unsupervised Action Segmentation by Joint Representation Learning and Online Clustering
Sateesh Kumar
S. Haresh
Awais Ahmed
Andrey Konin
M. Zia
Quoc-Huy Tran
SSL
27
47
0
27 May 2021
Detecting Biological Locomotion in Video: A Computational Approach
Detecting Biological Locomotion in Video: A Computational Approach
Soo-Min Kang
Richard P. Wildes
17
0
0
26 May 2021
Improving Sign Language Translation with Monolingual Data by Sign
  Back-Translation
Improving Sign Language Translation with Monolingual Data by Sign Back-Translation
Hao Zhou
Wen-gang Zhou
Weizhen Qi
Junfu Pu
Houqiang Li
SLR
35
182
0
26 May 2021
DSANet: Dynamic Segment Aggregation Network for Video-Level
  Representation Learning
DSANet: Dynamic Segment Aggregation Network for Video-Level Representation Learning
Wenhao Wu
Yuxiang Zhao
Yanwu Xu
Xiao Tan
Dongliang He
...
Jinxing Ye
Yingying Li
Mingde Yao
Zichao Dong
Yifeng Shi
AI4TS
30
27
0
25 May 2021
Temporal Action Proposal Generation with Transformers
Temporal Action Proposal Generation with Transformers
Lining Wang
Haosen Yang
Wenhao Wu
Huanjin Yao
Hujie Huang
ViT
38
27
0
25 May 2021
ST-HOI: A Spatial-Temporal Baseline for Human-Object Interaction
  Detection in Videos
ST-HOI: A Spatial-Temporal Baseline for Human-Object Interaction Detection in Videos
Meng-Jiun Chiou
Chun-Yu Liao
Li-Wei Wang
Roger Zimmermann
Jiashi Feng
43
24
0
25 May 2021
FineAction: A Fine-Grained Video Dataset for Temporal Action
  Localization
FineAction: A Fine-Grained Video Dataset for Temporal Action Localization
Yi Liu
Limin Wang
Yali Wang
Xiao Ma
Yu Qiao
24
56
0
24 May 2021
Coarse to Fine Multi-Resolution Temporal Convolutional Network
Coarse to Fine Multi-Resolution Temporal Convolutional Network
Dipika Singhania
R. Rahaman
Angela Yao
AI4TS
24
55
0
23 May 2021
Sharing Pain: Using Pain Domain Transfer for Video Recognition of Low
  Grade Orthopedic Pain in Horses
Sharing Pain: Using Pain Domain Transfer for Video Recognition of Low Grade Orthopedic Pain in Horses
Sofia Broomé
K. Ask
Maheen Rashid-Engström
Pia Haubro Andersen
Hedvig Kjellström
21
12
0
21 May 2021
Parallel Attention Network with Sequence Matching for Video Grounding
Parallel Attention Network with Sequence Matching for Video Grounding
Hao Zhang
Aixin Sun
Wei Jing
Liangli Zhen
Qiufeng Wang
Rick Siow Mong Goh
23
40
0
18 May 2021
NExT-QA:Next Phase of Question-Answering to Explaining Temporal Actions
NExT-QA:Next Phase of Question-Answering to Explaining Temporal Actions
Junbin Xiao
Xindi Shang
Angela Yao
Tat-Seng Chua
45
448
0
18 May 2021
VPN++: Rethinking Video-Pose embeddings for understanding Activities of
  Daily Living
VPN++: Rethinking Video-Pose embeddings for understanding Activities of Daily Living
Srijan Das
Rui Dai
Di Yang
Francois Bremond
ViT
48
67
0
17 May 2021
MultiSports: A Multi-Person Video Dataset of Spatio-Temporally Localized
  Sports Actions
MultiSports: A Multi-Person Video Dataset of Spatio-Temporally Localized Sports Actions
Yixuan Li
Lei Chen
Runyu He
Zhenzhi Wang
Gangshan Wu
Limin Wang
27
97
0
16 May 2021
Cross-Modal Progressive Comprehension for Referring Segmentation
Cross-Modal Progressive Comprehension for Referring Segmentation
Si Liu
Tianrui Hui
Shaofei Huang
Yunchao Wei
Bo Li
Guanbin Li
EgoV
VOS
28
124
0
15 May 2021
MutualNet: Adaptive ConvNet via Mutual Learning from Different Model
  Configurations
MutualNet: Adaptive ConvNet via Mutual Learning from Different Model Configurations
Taojiannan Yang
Sijie Zhu
Matías Mendieta
Pu Wang
Ravikumar Balakrishnan
Minwoo Lee
T. Han
M. Shah
Chong Chen
3DH
OOD
30
23
0
14 May 2021
Video Corpus Moment Retrieval with Contrastive Learning
Video Corpus Moment Retrieval with Contrastive Learning
Hao Zhang
Aixin Sun
Wei Jing
Guoshun Nan
Liangli Zhen
Qiufeng Wang
Rick Siow Mong Goh
46
81
0
13 May 2021
Representation Learning via Global Temporal Alignment and
  Cycle-Consistency
Representation Learning via Global Temporal Alignment and Cycle-Consistency
Isma Hadji
Konstantinos G. Derpanis
Allan D. Jepson
AI4TS
35
54
0
11 May 2021
AdaMML: Adaptive Multi-Modal Learning for Efficient Video Recognition
AdaMML: Adaptive Multi-Modal Learning for Efficient Video Recognition
Yikang Shen
Chun-Fu Chen
Quanfu Fan
Ximeng Sun
Kate Saenko
A. Oliva
Rogerio Feris
38
47
0
11 May 2021
Coupling Intent and Action for Pedestrian Crossing Behavior Prediction
Coupling Intent and Action for Pedestrian Crossing Behavior Prediction
Yu Yao
E. Atkins
Matthew Johnson-Roberson
Ram Vasudevan
Xiaoxiao Du
21
33
0
10 May 2021
Adaptive Focus for Efficient Video Recognition
Adaptive Focus for Efficient Video Recognition
Yulin Wang
Zhaoxi Chen
Haojun Jiang
Shiji Song
Yizeng Han
Gao Huang
45
98
0
07 May 2021
Human Object Interaction Detection using Two-Direction Spatial
  Enhancement and Exclusive Object Prior
Human Object Interaction Detection using Two-Direction Spatial Enhancement and Exclusive Object Prior
Lu Liu
R. Tan
31
9
0
07 May 2021
Unsupervised Discriminative Embedding for Sub-Action Learning in Complex
  Activities
Unsupervised Discriminative Embedding for Sub-Action Learning in Complex Activities
S. Swetha
Hilde Kuehne
Yogesh S Rawat
M. Shah
29
16
0
30 Apr 2021
BiCnet-TKS: Learning Efficient Spatial-Temporal Representation for Video
  Person Re-Identification
BiCnet-TKS: Learning Efficient Spatial-Temporal Representation for Video Person Re-Identification
Rui Hou
Hong Chang
Bingpeng Ma
Rui Huang
Shiguang Shan
32
85
0
30 Apr 2021
Action Unit Memory Network for Weakly Supervised Temporal Action
  Localization
Action Unit Memory Network for Weakly Supervised Temporal Action Localization
Wang Luo
Tianzhu Zhang
Wenfei Yang
Jingen Liu
Tao Mei
Feng Wu
Yongdong Zhang
24
79
0
29 Apr 2021
Learning Synergistic Attention for Light Field Salient Object Detection
Learning Synergistic Attention for Light Field Salient Object Detection
Y. Zhang
Geng Chen
Qian Chen
Yujia Sun
Yong Xia
Olivier Déforges
W. Hamidouche
Lu Zhang
43
23
0
28 Apr 2021
Sign Segmentation with Changepoint-Modulated Pseudo-Labelling
Sign Segmentation with Changepoint-Modulated Pseudo-Labelling
Katrin Renz
N. Stache
Neil Fox
Gül Varol
Samuel Albanie
45
18
0
28 Apr 2021
Medical Transformer: Universal Brain Encoder for 3D MRI Analysis
Medical Transformer: Universal Brain Encoder for 3D MRI Analysis
E. Jun
Seungwoo Jeong
Da-Woon Heo
Heung-Il Suk
ViT
MedIm
49
42
0
28 Apr 2021
Multimodal Clustering Networks for Self-supervised Learning from
  Unlabeled Videos
Multimodal Clustering Networks for Self-supervised Learning from Unlabeled Videos
Brian Chen
Andrew Rouditchenko
Kevin Duarte
Hilde Kuehne
Samuel Thomas
...
Rogerio Feris
David Harwath
James R. Glass
M. Picheny
Shih-Fu Chang
SSL
36
89
0
26 Apr 2021
VidTr: Video Transformer Without Convolutions
VidTr: Video Transformer Without Convolutions
Yanyi Zhang
Xinyu Li
Chunhui Liu
Bing Shuai
Yi Zhu
Biagio Brattoli
Hao Chen
I. Marsic
Joseph Tighe
ViT
148
193
0
23 Apr 2021
Supervised Video Summarization via Multiple Feature Sets with Parallel
  Attention
Supervised Video Summarization via Multiple Feature Sets with Parallel Attention
J. Ghauri
Sherzod Hakimov
Ralph Ewerth
21
45
0
23 Apr 2021
Modeling long-term interactions to enhance action recognition
Modeling long-term interactions to enhance action recognition
Alejandro Cartas
Petia Radeva
Mariella Dimiccoli
EgoV
27
6
0
23 Apr 2021
SportsCap: Monocular 3D Human Motion Capture and Fine-grained
  Understanding in Challenging Sports Videos
SportsCap: Monocular 3D Human Motion Capture and Fine-grained Understanding in Challenging Sports Videos
Xin Chen
Anqi Pang
Wei Yang
Yuexin Ma
Lan Xu
Jingyi Yu
149
56
0
23 Apr 2021
Multiscale Vision Transformers
Multiscale Vision Transformers
Haoqi Fan
Bo Xiong
K. Mangalam
Yanghao Li
Zhicheng Yan
Jitendra Malik
Christoph Feichtenhofer
ViT
63
1,226
0
22 Apr 2021
Evaluating the Immediate Applicability of Pose Estimation for Sign
  Language Recognition
Evaluating the Immediate Applicability of Pose Estimation for Sign Language Recognition
Amit Moryossef
Ioannis Tsochantaridis
Joe Dinn
Necati Cihan Camgöz
Richard Bowden
Tao Jiang
Annette Rios Gonzales
Mathias Müller
Sarah Ebling
SLR
14
51
0
20 Apr 2021
MGSampler: An Explainable Sampling Strategy for Video Action Recognition
MGSampler: An Explainable Sampling Strategy for Video Action Recognition
Yuan Zhi
Zhan Tong
Limin Wang
Gangshan Wu
TTA
19
72
0
20 Apr 2021
Camera Calibration and Player Localization in SoccerNet-v2 and
  Investigation of their Representations for Action Spotting
Camera Calibration and Player Localization in SoccerNet-v2 and Investigation of their Representations for Action Spotting
A. Cioppa
Adrien Deliège
Floriane Magera
Silvio Giancola
Olivier Barnich
Guohao Li
Marc Van Droogenbroeck
40
56
0
19 Apr 2021
Higher Order Recurrent Space-Time Transformer for Video Action
  Prediction
Higher Order Recurrent Space-Time Transformer for Video Action Prediction
Tsung-Ming Tai
G. Fiameni
Cheng-Kuang Lee
Oswald Lanz
41
9
0
17 Apr 2021
VGNMN: Video-grounded Neural Module Network to Video-Grounded Language
  Tasks
VGNMN: Video-grounded Neural Module Network to Video-Grounded Language Tasks
Hung Le
Nancy F. Chen
Guosheng Lin
MLLM
30
19
0
16 Apr 2021
Temporally-Aware Feature Pooling for Action Spotting in Soccer
  Broadcasts
Temporally-Aware Feature Pooling for Action Spotting in Soccer Broadcasts
Silvio Giancola
Guohao Li
35
45
0
14 Apr 2021
Tensor Processing Primitives: A Programming Abstraction for Efficiency
  and Portability in Deep Learning & HPC Workloads
Tensor Processing Primitives: A Programming Abstraction for Efficiency and Portability in Deep Learning & HPC Workloads
E. Georganas
Dhiraj D. Kalamkar
Sasikanth Avancha
Menachem Adelman
Deepti Aggarwal
...
Ramanarayan Mohanty
Hans Pabst
Brian Retford
Barukh Ziv
A. Heinecke
42
17
0
12 Apr 2021
Object Priors for Classifying and Localizing Unseen Actions
Object Priors for Classifying and Localizing Unseen Actions
Pascal Mettes
William Thong
Cees G. M. Snoek
32
20
0
10 Apr 2021
Unidentified Video Objects: A Benchmark for Dense, Open-World
  Segmentation
Unidentified Video Objects: A Benchmark for Dense, Open-World Segmentation
Weiyao Wang
Matt Feiszli
Heng Wang
Du Tran
VOS
15
123
0
10 Apr 2021
Few-Shot Action Recognition with Compromised Metric via Optimal
  Transport
Few-Shot Action Recognition with Compromised Metric via Optimal Transport
Su Lu
Han-Jia Ye
De-Chuan Zhan
31
18
0
08 Apr 2021
Progressive Temporal Feature Alignment Network for Video Inpainting
Progressive Temporal Feature Alignment Network for Video Inpainting
Xueyan Zou
Linjie Yang
Ding Liu
Yong Jae Lee
19
56
0
08 Apr 2021
Self-Supervised Learning for Semi-Supervised Temporal Action Proposal
Self-Supervised Learning for Semi-Supervised Temporal Action Proposal
Xiang Wang
Shiwei Zhang
Zhiwu Qing
Yuanjie Shao
Changxin Gao
Nong Sang
33
68
0
07 Apr 2021
The SARAS Endoscopic Surgeon Action Detection (ESAD) dataset: Challenges
  and methods
The SARAS Endoscopic Surgeon Action Detection (ESAD) dataset: Challenges and methods
V. Bawa
Gurkirt Singh
Francis KapingA
I. Skarga-Bandurova
Elettra Oleari
...
Li Li
Armando Stabile
Francesco Setti
R. Muradore
Fabio Cuzzolin
25
36
0
07 Apr 2021
The Multi-Agent Behavior Dataset: Mouse Dyadic Social Interactions
The Multi-Agent Behavior Dataset: Mouse Dyadic Social Interactions
Jennifer J. Sun
Tomomi Karigo
Dipam Chakraborty
Sharada Mohanty
Benjamin Wild
...
Chen Chen
D. Anderson
Pietro Perona
Yisong Yue
Ann Kennedy
39
48
0
06 Apr 2021
Previous
123...181920...282930
Next