ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1705.07750
  4. Cited By
Quo Vadis, Action Recognition? A New Model and the Kinetics Dataset
v1v2v3 (latest)

Quo Vadis, Action Recognition? A New Model and the Kinetics Dataset

22 May 2017
João Carreira
Andrew Zisserman
ArXiv (abs)PDFHTML

Papers citing "Quo Vadis, Action Recognition? A New Model and the Kinetics Dataset"

50 / 3,647 papers shown
Title
Feature-Supervised Action Modality Transfer
Feature-Supervised Action Modality Transfer
Fida Mohammad Thoker
Cees G. M. Snoek
40
2
0
06 Aug 2021
Interpretable Visual Understanding with Cognitive Attention Network
Interpretable Visual Understanding with Cognitive Attention Network
Xuejiao Tang
Wenbin Zhang
Yi Yu
Kea Turner
Hanyu Wang
Mengyu Wang
Eirini Ntoutsi
136
12
0
06 Aug 2021
Elaborative Rehearsal for Zero-shot Action Recognition
Elaborative Rehearsal for Zero-shot Action Recognition
Shizhe Chen
Dong Huang
VLM
97
96
0
05 Aug 2021
Token Shift Transformer for Video Classification
Token Shift Transformer for Video Classification
Hao Zhang
Y. Hao
Chong-Wah Ngo
ViT
87
119
0
05 Aug 2021
Hybrid Reasoning Network for Video-based Commonsense Captioning
Hybrid Reasoning Network for Video-based Commonsense Captioning
Weijiang Yu
Jian Liang
Lei Ji
Lu Li
Yuejian Fang
Nong Xiao
Nan Duan
69
10
0
05 Aug 2021
Enhancing Self-supervised Video Representation Learning via Multi-level
  Feature Optimization
Enhancing Self-supervised Video Representation Learning via Multi-level Feature Optimization
Rui Qian
Yuxi Li
Huabin Liu
John See
Shuangrui Ding
Xian Liu
Dian Li
Weiyao Lin
84
42
0
04 Aug 2021
Optimizing Latency for Online Video CaptioningUsing Audio-Visual
  Transformers
Optimizing Latency for Online Video CaptioningUsing Audio-Visual Transformers
Chiori Hori
Takaaki Hori
Jonathan Le Roux
56
4
0
04 Aug 2021
Skeleton Cloud Colorization for Unsupervised 3D Action Representation
  Learning
Skeleton Cloud Colorization for Unsupervised 3D Action Representation Learning
Siyuan Yang
Jun Liu
Shijian Lu
Meng Hwa Er
Alex C. Kot
3DH3DPC
117
95
0
04 Aug 2021
OncoNet: Weakly Supervised Siamese Network to automate cancer treatment
  response assessment between longitudinal FDG PET/CT examinations
OncoNet: Weakly Supervised Siamese Network to automate cancer treatment response assessment between longitudinal FDG PET/CT examinations
Anirudh Joshi
Sabri Eyuboglu
Shih-Cheng Huang
Jared A. Dunnmon
Arjun Soin
G. Davidzon
Akshay S. Chaudhari
M. Lungren
23
3
0
03 Aug 2021
Domain Adaptor Networks for Hyperspectral Image Recognition
Domain Adaptor Networks for Hyperspectral Image Recognition
Gustavo Pérez
Subhransu Maji
31
0
0
03 Aug 2021
MTVR: Multilingual Moment Retrieval in Videos
MTVR: Multilingual Moment Retrieval in Videos
Jie Lei
Tamara L. Berg
Joey Tianyi Zhou
77
11
0
30 Jul 2021
Recognizing Emotions evoked by Movies using Multitask Learning
Recognizing Emotions evoked by Movies using Multitask Learning
Hassan Hayat
Carles Ventura
Àgata Lapedriza
25
4
0
30 Jul 2021
The interpretation of endobronchial ultrasound image using 3D
  convolutional neural network for differentiating malignant and benign
  mediastinal lesions
The interpretation of endobronchial ultrasound image using 3D convolutional neural network for differentiating malignant and benign mediastinal lesions
Ching-Kai Lin
Shaojie Wu
Jerry S Chang
Yun-Chien Cheng
14
3
0
29 Jul 2021
Video Generation from Text Employing Latent Path Construction for
  Temporal Modeling
Video Generation from Text Employing Latent Path Construction for Temporal Modeling
Amir Mazaheri
M. Shah
75
8
0
29 Jul 2021
Spot What Matters: Learning Context Using Graph Convolutional Networks
  for Weakly-Supervised Action Detection
Spot What Matters: Learning Context Using Graph Convolutional Networks for Weakly-Supervised Action Detection
Michail Tsiaousis
Gertjan J. Burghouts
Fieke Hillerstrom
P. V. D. Putten
68
0
0
28 Jul 2021
Insights from Generative Modeling for Neural Video Compression
Insights from Generative Modeling for Neural Video Compression
Ruihan Yang
Yibo Yang
Joseph Marino
Stephan Mandt
VGen
113
16
0
28 Jul 2021
A New Split for Evaluating True Zero-Shot Action Recognition
A New Split for Evaluating True Zero-Shot Action Recognition
Shreyank N. Gowda
Laura Sevilla-Lara
Kiyoon Kim
Frank Keller
Marcus Rohrbach
VLM
77
25
0
27 Jul 2021
Enriching Local and Global Contexts for Temporal Action Localization
Enriching Local and Global Contexts for Temporal Action Localization
Zixin Zhu
Wei Tang
Le Wang
N. Zheng
G. Hua
99
112
0
27 Jul 2021
Vision-Guided Forecasting -- Visual Context for Multi-Horizon Time
  Series Forecasting
Vision-Guided Forecasting -- Visual Context for Multi-Horizon Time Series Forecasting
Eitan Kosman
Dotan Di Castro
AI4TS
52
1
0
27 Jul 2021
PiSLTRc: Position-informed Sign Language Transformer with Content-aware
  Convolution
PiSLTRc: Position-informed Sign Language Transformer with Content-aware Convolution
Pan Xie
Mengyi Zhao
Xiaohui Hu
ViTSLR
99
35
0
27 Jul 2021
Cross-modal Consensus Network for Weakly Supervised Temporal Action
  Localization
Cross-modal Consensus Network for Weakly Supervised Temporal Action Localization
Fa-Ting Hong
Jialuo Feng
Dan Xu
Ying Shan
Weishi Zheng
117
89
0
27 Jul 2021
Towards Efficient Tensor Decomposition-Based DNN Model Compression with
  Optimization Framework
Towards Efficient Tensor Decomposition-Based DNN Model Compression with Optimization Framework
Miao Yin
Yang Sui
Siyu Liao
Bo Yuan
60
81
0
26 Jul 2021
Spatial-Temporal Transformer for Dynamic Scene Graph Generation
Spatial-Temporal Transformer for Dynamic Scene Graph Generation
Yuren Cong
Wentong Liao
H. Ackermann
Bodo Rosenhahn
M. Yang
ViT
72
129
0
26 Jul 2021
HANet: Hierarchical Alignment Networks for Video-Text Retrieval
HANet: Hierarchical Alignment Networks for Video-Text Retrieval
Peng Wu
Xiangteng He
Mingqian Tang
Yiliang Lv
Jing Liu
103
56
0
26 Jul 2021
Temporal Alignment Prediction for Few-Shot Video Classification
Temporal Alignment Prediction for Few-Shot Video Classification
Fei Pan
Chunlei Xu
Jie Guo
Yanwen Guo
AI4TS
58
1
0
26 Jul 2021
Spatio-Temporal Representation Factorization for Video-based Person
  Re-Identification
Spatio-Temporal Representation Factorization for Video-based Person Re-Identification
Abhishek Aich
Meng Zheng
Srikrishna Karanam
Terrence Chen
Amit K. Roy-Chowdhury
Ziyan Wu
128
72
0
25 Jul 2021
Transcript to Video: Efficient Clip Sequencing from Texts
Transcript to Video: Efficient Clip Sequencing from Texts
Yu Xiong
Fabian Caba Heilbron
Dahua Lin
CLIP
62
10
0
25 Jul 2021
Adaptive Recursive Circle Framework for Fine-grained Action Recognition
Adaptive Recursive Circle Framework for Fine-grained Action Recognition
Hanxi Lin
Xinxiao Wu
Jiebo Luo
65
2
0
25 Jul 2021
Self-Conditioned Probabilistic Learning of Video Rescaling
Self-Conditioned Probabilistic Learning of Video Rescaling
Yuan Tian
Guo Lu
Xiongkuo Min
Zhaohui Che
Guangtao Zhai
G. Guo
Zhiyong Gao
41
26
0
24 Jul 2021
TinyAction Challenge: Recognizing Real-world Low-resolution Activities
  in Videos
TinyAction Challenge: Recognizing Real-world Low-resolution Activities in Videos
Praveen Tirupattur
A. J. Rana
Tushar Sangam
Shruti Vyas
Yogesh S Rawat
M. Shah
42
6
0
24 Jul 2021
EAN: Event Adaptive Network for Enhanced Action Recognition
EAN: Event Adaptive Network for Enhanced Action Recognition
Yuan Tian
Yichao Yan
Guangtao Zhai
G. Guo
Zhiyong Gao
84
42
0
22 Jul 2021
Evidential Deep Learning for Open Set Action Recognition
Evidential Deep Learning for Open Set Action Recognition
Wentao Bao
Qi Yu
Yu Kong
CMLEDL
116
141
0
21 Jul 2021
Multi-modal Residual Perceptron Network for Audio-Video Emotion
  Recognition
Multi-modal Residual Perceptron Network for Audio-Video Emotion Recognition
Xin Chang
W. Skarbek
67
20
0
21 Jul 2021
Looking for the Signs: Identifying Isolated Sign Instances in Continuous
  Video Footage
Looking for the Signs: Identifying Isolated Sign Instances in Continuous Video Footage
Tao Jiang
Necati Cihan Camgöz
Richard Bowden
52
13
0
21 Jul 2021
UNIK: A Unified Framework for Real-world Skeleton-based Action
  Recognition
UNIK: A Unified Framework for Real-world Skeleton-based Action Recognition
Di Yang
Yaohui Wang
A. Dantcheva
Lorenzo Garattoni
Gianpiero Francesca
Francois Bremond
86
49
0
19 Jul 2021
Action Forecasting with Feature-wise Self-Attention
Action Forecasting with Feature-wise Self-Attention
Yan Bin Ng
Basura Fernando
EgoV
28
0
0
19 Jul 2021
Federated Action Recognition on Heterogeneous Embedded Devices
Federated Action Recognition on Heterogeneous Embedded Devices
Pranjali Jain
Shreyas Goenka
S. Bagchi
Biplab Banerjee
Somali Chaterji
FedML
81
8
0
18 Jul 2021
CCVS: Context-aware Controllable Video Synthesis
CCVS: Context-aware Controllable Video Synthesis
G. L. Moing
Jean Ponce
Cordelia Schmid
105
81
0
16 Jul 2021
Is attention to bounding boxes all you need for pedestrian action
  prediction?
Is attention to bounding boxes all you need for pedestrian action prediction?
Lina Achaji
Julien Moreau
Thibault Fouqueray
François Aioun
François Charpillet
82
34
0
16 Jul 2021
Training for temporal sparsity in deep neural networks, application in
  video processing
Training for temporal sparsity in deep neural networks, application in video processing
Amirreza Yousefzadeh
Manolis Sifalakis
73
3
0
15 Jul 2021
What and When to Look?: Temporal Span Proposal Network for Video
  Relation Detection
What and When to Look?: Temporal Span Proposal Network for Video Relation Detection
Sangmin Woo
Junhyug Noh
Kangil Kim
54
2
0
15 Jul 2021
Developmental Stage Classification of Embryos Using Two-Stream Neural
  Network with Linear-Chain Conditional Random Field
Developmental Stage Classification of Embryos Using Two-Stream Neural Network with Linear-Chain Conditional Random Field
Stanislav Lukyanenko
Won-Dong Jang
D. Wei
R. Struyven
Yoon Kim
...
Helen Y Yang
Alexander M. Rush
D. Ben-Yosef
D. Needleman
Hanspeter Pfister
53
9
0
13 Jul 2021
End-to-end Multi-modal Video Temporal Grounding
End-to-end Multi-modal Video Temporal Grounding
Yi-Wen Chen
Yi-Hsuan Tsai
Ming-Hsuan Yang
78
51
0
12 Jul 2021
Let's Play for Action: Recognizing Activities of Daily Living by
  Learning from Life Simulation Video Games
Let's Play for Action: Recognizing Activities of Daily Living by Learning from Life Simulation Video Games
Alina Roitberg
David Schneider
Aulia Djamal
C. Seibold
Simon Reiß
Rainer Stiefelhagen
91
31
0
12 Jul 2021
Delta Sampling R-BERT for limited data and low-light action recognition
Delta Sampling R-BERT for limited data and low-light action recognition
Sanchit Hira
Ritwik Das
Abhinav Modi
D. Pakhomov
111
17
0
12 Jul 2021
Review of Video Predictive Understanding: Early Action Recognition and
  Future Action Prediction
Review of Video Predictive Understanding: Early Action Recognition and Future Action Prediction
He Zhao
Richard P. Wildes
77
10
0
11 Jul 2021
Interpretable Deep Feature Propagation for Early Action Recognition
Interpretable Deep Feature Propagation for Early Action Recognition
He Zhao
Richard P. Wildes
FAtt
63
8
0
11 Jul 2021
Aligning Correlation Information for Domain Adaptation in Action
  Recognition
Aligning Correlation Information for Domain Adaptation in Action Recognition
Yuecong Xu
Jianfei Yang
Haozhi Cao
K. Mao
Jianxiong Yin
Simon See
89
39
0
11 Jul 2021
COVID Detection in Chest CTs: Improving the Baseline on COV19-CT-DB
COVID Detection in Chest CTs: Improving the Baseline on COV19-CT-DB
R. Miron
Cosmin Moisii
Sergiu-Andrei Dinu
Mihaela Breaban
38
6
0
10 Jul 2021
TA2N: Two-Stage Action Alignment Network for Few-shot Action Recognition
TA2N: Two-Stage Action Alignment Network for Few-shot Action Recognition
Shuyuan Li
Huabin Liu
Rui Qian
Yuxi Li
John See
Mengjuan Fei
Xiaoyuan Yu
W. Lin
112
79
0
10 Jul 2021
Previous
123...444546...717273
Next