ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2010.10864
  4. Cited By
A Short Note on the Kinetics-700-2020 Human Action Dataset

A Short Note on the Kinetics-700-2020 Human Action Dataset

21 October 2020
Lucas Smaira
João Carreira
Eric Noland
Ellen Clancy
Amy Wu
Andrew Zisserman
ArXivPDFHTML

Papers citing "A Short Note on the Kinetics-700-2020 Human Action Dataset"

21 / 71 papers shown
Title
Socratic Models: Composing Zero-Shot Multimodal Reasoning with Language
Socratic Models: Composing Zero-Shot Multimodal Reasoning with Language
Andy Zeng
Maria Attarian
Brian Ichter
K. Choromanski
Adrian S. Wong
...
Michael S. Ryoo
Vikas Sindhwani
Johnny Lee
Vincent Vanhoucke
Peter R. Florence
ReLM
LRM
47
574
0
01 Apr 2022
Surgical Workflow Recognition: from Analysis of Challenges to
  Architectural Study
Surgical Workflow Recognition: from Analysis of Challenges to Architectural Study
Tobias Czempiel
Aidean Sharghi
Magdalini Paschali
Nassir Navab
Omid Mohareri
19
8
0
17 Mar 2022
Gate-Shift-Fuse for Video Action Recognition
Gate-Shift-Fuse for Video Action Recognition
Swathikiran Sudhakaran
Sergio Escalera
Oswald Lanz
25
22
0
16 Mar 2022
Diffusion Probabilistic Modeling for Video Generation
Diffusion Probabilistic Modeling for Video Generation
Ruihan Yang
Prakhar Srivastava
Stephan Mandt
DiffM
VGen
59
257
0
16 Mar 2022
End-to-End Semantic Video Transformer for Zero-Shot Action Recognition
End-to-End Semantic Video Transformer for Zero-Shot Action Recognition
Keval Doshi
Yasin Yılmaz
ViT
35
2
0
10 Mar 2022
PAMI-AD: An Activity Detector Exploiting Part-attention and Motion
  Information in Surveillance Videos
PAMI-AD: An Activity Detector Exploiting Part-attention and Motion Information in Surveillance Videos
Yunhao Du
Zhihang Tong
Jun-Jun Wan
Binyu Zhang
Yanyun Zhao
24
3
0
08 Mar 2022
HAA4D: Few-Shot Human Atomic Action Recognition via 3D Spatio-Temporal
  Skeletal Alignment
HAA4D: Few-Shot Human Atomic Action Recognition via 3D Spatio-Temporal Skeletal Alignment
Mu-Ruei Tseng
Abhishek Gupta
Chi-Keung Tang
Yu-Wing Tai
3DH
30
7
0
15 Feb 2022
Video Violence Recognition and Localization Using a Semi-Supervised Hard
  Attention Model
Video Violence Recognition and Localization Using a Semi-Supervised Hard Attention Model
Hamid Reza Mohammadi
Ehsan Nazerfard
27
24
0
04 Feb 2022
Sports Video: Fine-Grained Action Detection and Classification of Table
  Tennis Strokes from Videos for MediaEval 2021
Sports Video: Fine-Grained Action Detection and Classification of Table Tennis Strokes from Videos for MediaEval 2021
Pierre-Etienne Martin
J. Calandre
Boris Mansencal
J. Benois-Pineau
Renaud Péteri
L. Mascarilla
J. Morlier
AI4TS
27
7
0
16 Dec 2021
Uni-Perceiver: Pre-training Unified Architecture for Generic Perception
  for Zero-shot and Few-shot Tasks
Uni-Perceiver: Pre-training Unified Architecture for Generic Perception for Zero-shot and Few-shot Tasks
Xizhou Zhu
Jinguo Zhu
Hao Li
Xiaoshi Wu
Xiaogang Wang
Hongsheng Li
Xiaohua Wang
Jifeng Dai
56
129
0
02 Dec 2021
Zero-Shot Action Recognition from Diverse Object-Scene Compositions
Zero-Shot Action Recognition from Diverse Object-Scene Compositions
Carlo Bretti
Pascal Mettes
OCL
11
9
0
26 Oct 2021
Three-Stream 3D/1D CNN for Fine-Grained Action Classification and
  Segmentation in Table Tennis
Three-Stream 3D/1D CNN for Fine-Grained Action Classification and Segmentation in Table Tennis
Pierre-Etienne Martin
J. Benois-Pineau
Renaud Péteri
J. Morlier
MedIm
32
14
0
29 Sep 2021
How much human-like visual experience do current self-supervised
  learning algorithms need in order to achieve human-level object recognition?
How much human-like visual experience do current self-supervised learning algorithms need in order to achieve human-level object recognition?
Emin Orhan
OOD
54
4
0
23 Sep 2021
Perceiver IO: A General Architecture for Structured Inputs & Outputs
Perceiver IO: A General Architecture for Structured Inputs & Outputs
Andrew Jaegle
Sebastian Borgeaud
Jean-Baptiste Alayrac
Carl Doersch
Catalin Ionescu
...
Olivier J. Hénaff
M. Botvinick
Andrew Zisserman
Oriol Vinyals
João Carreira
MLLM
VLM
GNN
20
567
0
30 Jul 2021
TinyAction Challenge: Recognizing Real-world Low-resolution Activities
  in Videos
TinyAction Challenge: Recognizing Real-world Low-resolution Activities in Videos
Praveen Tirupattur
A. J. Rana
Tushar Sangam
Shruti Vyas
Yogesh S Rawat
M. Shah
14
6
0
24 Jul 2021
Self-supervised Representation Learning Framework for Remote
  Physiological Measurement Using Spatiotemporal Augmentation Loss
Self-supervised Representation Learning Framework for Remote Physiological Measurement Using Spatiotemporal Augmentation Loss
Hao Wang
Euijoon Ahn
Jinman Kim
29
46
0
16 Jul 2021
Multiview Pseudo-Labeling for Semi-supervised Learning from Video
Multiview Pseudo-Labeling for Semi-supervised Learning from Video
Bo Xiong
Haoqi Fan
Kristen Grauman
Christoph Feichtenhofer
SSL
22
49
0
01 Apr 2021
MDMMT: Multidomain Multimodal Transformer for Video Retrieval
MDMMT: Multidomain Multimodal Transformer for Video Retrieval
Maksim Dzabraev
M. Kalashnikov
Stepan Alekseevich Komkov
Aleksandr Petiushko
24
128
0
19 Mar 2021
Less is More: ClipBERT for Video-and-Language Learning via Sparse
  Sampling
Less is More: ClipBERT for Video-and-Language Learning via Sparse Sampling
Jie Lei
Linjie Li
Luowei Zhou
Zhe Gan
Tamara L. Berg
Joey Tianyi Zhou
Jingjing Liu
CLIP
46
647
0
11 Feb 2021
Video Action Understanding
Video Action Understanding
Matthew Hutchinson
V. Gadepally
40
20
0
13 Oct 2020
AViD Dataset: Anonymized Videos from Diverse Countries
AViD Dataset: Anonymized Videos from Diverse Countries
A. Piergiovanni
Michael S. Ryoo
33
35
0
10 Jul 2020
Previous
12