Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2010.06647
Cited By
Video Action Understanding
13 October 2020
Matthew Hutchinson
V. Gadepally
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Video Action Understanding"
14 / 14 papers shown
Title
S3Aug: Segmentation, Sampling, and Shift for Action Recognition
Taiki Sugiura
Toru Tamaki
AI4TS
22
2
0
23 Oct 2023
VLTinT: Visual-Linguistic Transformer-in-Transformer for Coherent Video Paragraph Captioning
Kashu Yamazaki
Khoa T. Vo
Sang Truong
Bhiksha Raj
Ngan Le
26
35
0
28 Nov 2022
Object-ABN: Learning to Generate Sharp Attention Maps for Action Recognition
Tomoya Nitta
Tsubasa Hirakawa
H. Fujiyoshi
Toru Tamaki
58
0
0
27 Jul 2022
Model-agnostic Multi-Domain Learning with Domain-Specific Adapters for Action Recognition
Kazuki Omi
Jun Kimata
Toru Tamaki
21
7
0
15 Apr 2022
ObjectMix: Data Augmentation by Copy-Pasting Objects in Videos for Action Recognition
Jun Kimata
Tomoya Nitta
Toru Tamaki
29
10
0
01 Apr 2022
Is Space-Time Attention All You Need for Video Understanding?
Gedas Bertasius
Heng Wang
Lorenzo Torresani
ViT
280
1,982
0
09 Feb 2021
Video Transformer Network
Daniel Neimark
Omri Bar
Maya Zohar
Dotan Asselmann
ViT
198
422
0
01 Feb 2021
Fixing the train-test resolution discrepancy: FixEfficientNet
Hugo Touvron
Andrea Vedaldi
Matthijs Douze
Hervé Jégou
AAML
196
110
0
18 Mar 2020
Grouped Spatial-Temporal Aggregation for Efficient Action Recognition
Chenxu Luo
Alan Yuille
127
150
0
28 Sep 2019
Gaussian Temporal Awareness Networks for Action Localization
Fuchen Long
Ting Yao
Zhaofan Qiu
Xinmei Tian
Jiebo Luo
Tao Mei
143
319
0
09 Sep 2019
BSN: Boundary Sensitive Network for Temporal Action Proposal Generation
Tianwei Lin
Xu Zhao
Haisheng Su
Chongjing Wang
Ming Yang
139
700
0
08 Jun 2018
ECO: Efficient Convolutional Network for Online Video Understanding
Mohammadreza Zolfaghari
Kamaljeet Singh
Thomas Brox
130
496
0
24 Apr 2018
Hierarchical Deep Temporal Models for Group Activity Recognition
Mostafa S. Ibrahim
S. Muralidharan
Zhiwei Deng
Arash Vahdat
Greg Mori
77
445
0
09 Jul 2016
Learning Human Activities and Object Affordances from RGB-D Videos
H. Koppula
Rudhir Gupta
Ashutosh Saxena
96
727
0
04 Oct 2012
1