Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1705.07750
Cited By
v1
v2
v3 (latest)
Quo Vadis, Action Recognition? A New Model and the Kinetics Dataset
22 May 2017
João Carreira
Andrew Zisserman
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"Quo Vadis, Action Recognition? A New Model and the Kinetics Dataset"
50 / 3,647 papers shown
Title
Feature-Supervised Action Modality Transfer
Fida Mohammad Thoker
Cees G. M. Snoek
40
2
0
06 Aug 2021
Interpretable Visual Understanding with Cognitive Attention Network
Xuejiao Tang
Wenbin Zhang
Yi Yu
Kea Turner
Hanyu Wang
Mengyu Wang
Eirini Ntoutsi
136
12
0
06 Aug 2021
Elaborative Rehearsal for Zero-shot Action Recognition
Shizhe Chen
Dong Huang
VLM
97
96
0
05 Aug 2021
Token Shift Transformer for Video Classification
Hao Zhang
Y. Hao
Chong-Wah Ngo
ViT
87
119
0
05 Aug 2021
Hybrid Reasoning Network for Video-based Commonsense Captioning
Weijiang Yu
Jian Liang
Lei Ji
Lu Li
Yuejian Fang
Nong Xiao
Nan Duan
69
10
0
05 Aug 2021
Enhancing Self-supervised Video Representation Learning via Multi-level Feature Optimization
Rui Qian
Yuxi Li
Huabin Liu
John See
Shuangrui Ding
Xian Liu
Dian Li
Weiyao Lin
84
42
0
04 Aug 2021
Optimizing Latency for Online Video CaptioningUsing Audio-Visual Transformers
Chiori Hori
Takaaki Hori
Jonathan Le Roux
56
4
0
04 Aug 2021
Skeleton Cloud Colorization for Unsupervised 3D Action Representation Learning
Siyuan Yang
Jun Liu
Shijian Lu
Meng Hwa Er
Alex C. Kot
3DH
3DPC
117
95
0
04 Aug 2021
OncoNet: Weakly Supervised Siamese Network to automate cancer treatment response assessment between longitudinal FDG PET/CT examinations
Anirudh Joshi
Sabri Eyuboglu
Shih-Cheng Huang
Jared A. Dunnmon
Arjun Soin
G. Davidzon
Akshay S. Chaudhari
M. Lungren
23
3
0
03 Aug 2021
Domain Adaptor Networks for Hyperspectral Image Recognition
Gustavo Pérez
Subhransu Maji
31
0
0
03 Aug 2021
MTVR: Multilingual Moment Retrieval in Videos
Jie Lei
Tamara L. Berg
Joey Tianyi Zhou
77
11
0
30 Jul 2021
Recognizing Emotions evoked by Movies using Multitask Learning
Hassan Hayat
Carles Ventura
Àgata Lapedriza
25
4
0
30 Jul 2021
The interpretation of endobronchial ultrasound image using 3D convolutional neural network for differentiating malignant and benign mediastinal lesions
Ching-Kai Lin
Shaojie Wu
Jerry S Chang
Yun-Chien Cheng
14
3
0
29 Jul 2021
Video Generation from Text Employing Latent Path Construction for Temporal Modeling
Amir Mazaheri
M. Shah
75
8
0
29 Jul 2021
Spot What Matters: Learning Context Using Graph Convolutional Networks for Weakly-Supervised Action Detection
Michail Tsiaousis
Gertjan J. Burghouts
Fieke Hillerstrom
P. V. D. Putten
68
0
0
28 Jul 2021
Insights from Generative Modeling for Neural Video Compression
Ruihan Yang
Yibo Yang
Joseph Marino
Stephan Mandt
VGen
113
16
0
28 Jul 2021
A New Split for Evaluating True Zero-Shot Action Recognition
Shreyank N. Gowda
Laura Sevilla-Lara
Kiyoon Kim
Frank Keller
Marcus Rohrbach
VLM
77
25
0
27 Jul 2021
Enriching Local and Global Contexts for Temporal Action Localization
Zixin Zhu
Wei Tang
Le Wang
N. Zheng
G. Hua
99
112
0
27 Jul 2021
Vision-Guided Forecasting -- Visual Context for Multi-Horizon Time Series Forecasting
Eitan Kosman
Dotan Di Castro
AI4TS
52
1
0
27 Jul 2021
PiSLTRc: Position-informed Sign Language Transformer with Content-aware Convolution
Pan Xie
Mengyi Zhao
Xiaohui Hu
ViT
SLR
99
35
0
27 Jul 2021
Cross-modal Consensus Network for Weakly Supervised Temporal Action Localization
Fa-Ting Hong
Jialuo Feng
Dan Xu
Ying Shan
Weishi Zheng
117
89
0
27 Jul 2021
Towards Efficient Tensor Decomposition-Based DNN Model Compression with Optimization Framework
Miao Yin
Yang Sui
Siyu Liao
Bo Yuan
60
81
0
26 Jul 2021
Spatial-Temporal Transformer for Dynamic Scene Graph Generation
Yuren Cong
Wentong Liao
H. Ackermann
Bodo Rosenhahn
M. Yang
ViT
72
129
0
26 Jul 2021
HANet: Hierarchical Alignment Networks for Video-Text Retrieval
Peng Wu
Xiangteng He
Mingqian Tang
Yiliang Lv
Jing Liu
103
56
0
26 Jul 2021
Temporal Alignment Prediction for Few-Shot Video Classification
Fei Pan
Chunlei Xu
Jie Guo
Yanwen Guo
AI4TS
58
1
0
26 Jul 2021
Spatio-Temporal Representation Factorization for Video-based Person Re-Identification
Abhishek Aich
Meng Zheng
Srikrishna Karanam
Terrence Chen
Amit K. Roy-Chowdhury
Ziyan Wu
128
72
0
25 Jul 2021
Transcript to Video: Efficient Clip Sequencing from Texts
Yu Xiong
Fabian Caba Heilbron
Dahua Lin
CLIP
62
10
0
25 Jul 2021
Adaptive Recursive Circle Framework for Fine-grained Action Recognition
Hanxi Lin
Xinxiao Wu
Jiebo Luo
65
2
0
25 Jul 2021
Self-Conditioned Probabilistic Learning of Video Rescaling
Yuan Tian
Guo Lu
Xiongkuo Min
Zhaohui Che
Guangtao Zhai
G. Guo
Zhiyong Gao
41
26
0
24 Jul 2021
TinyAction Challenge: Recognizing Real-world Low-resolution Activities in Videos
Praveen Tirupattur
A. J. Rana
Tushar Sangam
Shruti Vyas
Yogesh S Rawat
M. Shah
42
6
0
24 Jul 2021
EAN: Event Adaptive Network for Enhanced Action Recognition
Yuan Tian
Yichao Yan
Guangtao Zhai
G. Guo
Zhiyong Gao
84
42
0
22 Jul 2021
Evidential Deep Learning for Open Set Action Recognition
Wentao Bao
Qi Yu
Yu Kong
CML
EDL
116
141
0
21 Jul 2021
Multi-modal Residual Perceptron Network for Audio-Video Emotion Recognition
Xin Chang
W. Skarbek
67
20
0
21 Jul 2021
Looking for the Signs: Identifying Isolated Sign Instances in Continuous Video Footage
Tao Jiang
Necati Cihan Camgöz
Richard Bowden
52
13
0
21 Jul 2021
UNIK: A Unified Framework for Real-world Skeleton-based Action Recognition
Di Yang
Yaohui Wang
A. Dantcheva
Lorenzo Garattoni
Gianpiero Francesca
Francois Bremond
86
49
0
19 Jul 2021
Action Forecasting with Feature-wise Self-Attention
Yan Bin Ng
Basura Fernando
EgoV
28
0
0
19 Jul 2021
Federated Action Recognition on Heterogeneous Embedded Devices
Pranjali Jain
Shreyas Goenka
S. Bagchi
Biplab Banerjee
Somali Chaterji
FedML
81
8
0
18 Jul 2021
CCVS: Context-aware Controllable Video Synthesis
G. L. Moing
Jean Ponce
Cordelia Schmid
105
81
0
16 Jul 2021
Is attention to bounding boxes all you need for pedestrian action prediction?
Lina Achaji
Julien Moreau
Thibault Fouqueray
François Aioun
François Charpillet
82
34
0
16 Jul 2021
Training for temporal sparsity in deep neural networks, application in video processing
Amirreza Yousefzadeh
Manolis Sifalakis
73
3
0
15 Jul 2021
What and When to Look?: Temporal Span Proposal Network for Video Relation Detection
Sangmin Woo
Junhyug Noh
Kangil Kim
54
2
0
15 Jul 2021
Developmental Stage Classification of Embryos Using Two-Stream Neural Network with Linear-Chain Conditional Random Field
Stanislav Lukyanenko
Won-Dong Jang
D. Wei
R. Struyven
Yoon Kim
...
Helen Y Yang
Alexander M. Rush
D. Ben-Yosef
D. Needleman
Hanspeter Pfister
53
9
0
13 Jul 2021
End-to-end Multi-modal Video Temporal Grounding
Yi-Wen Chen
Yi-Hsuan Tsai
Ming-Hsuan Yang
78
51
0
12 Jul 2021
Let's Play for Action: Recognizing Activities of Daily Living by Learning from Life Simulation Video Games
Alina Roitberg
David Schneider
Aulia Djamal
C. Seibold
Simon Reiß
Rainer Stiefelhagen
91
31
0
12 Jul 2021
Delta Sampling R-BERT for limited data and low-light action recognition
Sanchit Hira
Ritwik Das
Abhinav Modi
D. Pakhomov
111
17
0
12 Jul 2021
Review of Video Predictive Understanding: Early Action Recognition and Future Action Prediction
He Zhao
Richard P. Wildes
77
10
0
11 Jul 2021
Interpretable Deep Feature Propagation for Early Action Recognition
He Zhao
Richard P. Wildes
FAtt
63
8
0
11 Jul 2021
Aligning Correlation Information for Domain Adaptation in Action Recognition
Yuecong Xu
Jianfei Yang
Haozhi Cao
K. Mao
Jianxiong Yin
Simon See
89
39
0
11 Jul 2021
COVID Detection in Chest CTs: Improving the Baseline on COV19-CT-DB
R. Miron
Cosmin Moisii
Sergiu-Andrei Dinu
Mihaela Breaban
38
6
0
10 Jul 2021
TA2N: Two-Stage Action Alignment Network for Few-shot Action Recognition
Shuyuan Li
Huabin Liu
Rui Qian
Yuxi Li
John See
Mengjuan Fei
Xiaoyuan Yu
W. Lin
112
79
0
10 Jul 2021
Previous
1
2
3
...
44
45
46
...
71
72
73
Next