Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2405.01337
Cited By
Multi-view Action Recognition via Directed Gromov-Wasserstein Discrepancy
2 May 2024
Hoang-Quan Nguyen
Thanh-Dat Truong
Khoa Luu
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Multi-view Action Recognition via Directed Gromov-Wasserstein Discrepancy"
33 / 33 papers shown
Title
FALCON: Fairness Learning via Contrastive Attention Approach to Continual Semantic Scene Understanding
Thanh-Dat Truong
Utsav Prabhu
Bhiksha Raj
Jackson Cothren
Khoa Luu
CLL
100
3
0
27 Nov 2023
SoGAR: Self-supervised Spatiotemporal Attention-based Social Group Activity Recognition
N. V. R. Chappa
Pha Nguyen
Alec Nelson
Han-Seok Seo
Xin Li
P. Dobbs
Khoa Luu
ViT
44
8
0
27 Apr 2023
Adapting Self-Supervised Vision Transformers by Probing Attention-Conditioned Masking Consistency
Viraj Prabhu
Sriram Yenamandra
Aaditya K. Singh
Judy Hoffman
57
14
0
16 Jun 2022
DirecFormer: A Directed Attention in Transformer Approach to Robust Action Recognition
Thanh-Dat Truong
Quoc-Huy Bui
C. Duong
Han-Seok Seo
Son Lam Phung
Xin Li
Khoa Luu
ViT
72
49
0
19 Mar 2022
ActionFormer: Localizing Moments of Actions with Transformers
Chen-Da Liu-Zhang
Jianxin Wu
Yin Li
ViT
59
336
0
16 Feb 2022
Neural Fields in Visual Computing and Beyond
Yiheng Xie
Towaki Takikawa
Shunsuke Saito
Or Litany
Shiqin Yan
Numair Khan
Federico Tombari
James Tompkin
Vincent Sitzmann
Srinath Sridhar
3DH
144
623
0
22 Nov 2021
Video Swin Transformer
Ze Liu
Jia Ning
Yue Cao
Yixuan Wei
Zheng Zhang
Stephen Lin
Han Hu
ViT
84
1,458
0
24 Jun 2021
DyGLIP: A Dynamic Graph Model with Link Prediction for Accurate Multi-Camera Multiple Object Tracking
Kha Gia Quach
Pha Nguyen
Huu Le
Thanh-Dat Truong
C. Duong
M. Tran
Khoa Luu
44
53
0
12 Jun 2021
Multiscale Vision Transformers
Haoqi Fan
Bo Xiong
K. Mangalam
Yanghao Li
Zhicheng Yan
Jitendra Malik
Christoph Feichtenhofer
ViT
125
1,248
0
22 Apr 2021
ViViT: A Video Vision Transformer
Anurag Arnab
Mostafa Dehghani
G. Heigold
Chen Sun
Mario Lucic
Cordelia Schmid
ViT
166
2,119
0
29 Mar 2021
Is Space-Time Attention All You Need for Video Understanding?
Gedas Bertasius
Heng Wang
Lorenzo Torresani
ViT
345
2,016
0
09 Feb 2021
Human Action Recognition from Various Data Modalities: A Review
Zehua Sun
Qiuhong Ke
Hossein Rahmani
Mohammed Bennamoun
Gang Wang
Jun Liu
MU
80
514
0
22 Dec 2020
INeRF: Inverting Neural Radiance Fields for Pose Estimation
Yen-Chen Lin
Peter R. Florence
Jonathan T. Barron
Alberto Rodriguez
Phillip Isola
Nayeon Lee
105
451
0
10 Dec 2020
Nerfies: Deformable Neural Radiance Fields
Keunhong Park
U. Sinha
Jonathan T. Barron
Sofien Bouaziz
Dan B. Goldman
S. M. Seitz
Ricardo Martín Brualla
3DH
122
1,123
0
25 Nov 2020
NeRF++: Analyzing and Improving Neural Radiance Fields
Kai Zhang
Gernot Riegler
Noah Snavely
V. Koltun
73
1,035
0
15 Oct 2020
GRF: Learning a General Radiance Field for 3D Representation and Rendering
Alex Trevithick
Bo Yang
3DV
3DH
60
230
0
09 Oct 2020
MotionSqueeze: Neural Motion Feature Learning for Video Understanding
Heeseung Kwon
Manjin Kim
Suha Kwak
Minsu Cho
FAtt
71
128
0
20 Jul 2020
More Is Less: Learning Efficient Video Representations by Big-Little Network and Depthwise Temporal Aggregation
Quanfu Fan
Chun-Fu Chen
Hilde Kuehne
Marco Pistoia
David D. Cox
68
126
0
02 Dec 2019
A Short Note on the Kinetics-700 Human Action Dataset
João Carreira
Eric Noland
Chloe Hillier
Andrew Zisserman
64
446
0
15 Jul 2019
NTU RGB+D 120: A Large-Scale Benchmark for 3D Human Activity Understanding
Jun Liu
Amir Shahroudy
Mauricio Perez
G. Wang
Ling-yu Duan
Alex C. Kot
75
1,268
0
12 May 2019
BSN: Boundary Sensitive Network for Temporal Action Proposal Generation
Tianwei Lin
Xu Zhao
Haisheng Su
Chongjing Wang
Ming Yang
192
700
0
08 Jun 2018
Deep Co-Training for Semi-Supervised Image Recognition
Siyuan Qiao
Wei Shen
Zhishuai Zhang
Bo Wang
Alan Yuille
52
446
0
15 Mar 2018
AVA: A Video Dataset of Spatio-temporally Localized Atomic Visual Actions
Chunhui Gu
Chen Sun
David A. Ross
Carl Vondrick
C. Pantofaru
...
G. Toderici
Susanna Ricco
Rahul Sukthankar
Cordelia Schmid
Jitendra Malik
VGen
94
1,021
0
23 May 2017
Quo Vadis, Action Recognition? A New Model and the Kinetics Dataset
João Carreira
Andrew Zisserman
212
7,961
0
22 May 2017
Temporal Segment Networks for Action Recognition in Videos
Limin Wang
Yuanjun Xiong
Zhe Wang
Yu Qiao
Dahua Lin
Xiaoou Tang
Luc Van Gool
ViT
101
807
0
08 May 2017
Rotation equivariant vector field networks
Diego Marcos
Michele Volpi
N. Komodakis
D. Tuia
58
269
0
29 Dec 2016
Paying More Attention to Attention: Improving the Performance of Convolutional Neural Networks via Attention Transfer
Sergey Zagoruyko
N. Komodakis
108
2,561
0
12 Dec 2016
Aggregated Residual Transformations for Deep Neural Networks
Saining Xie
Ross B. Girshick
Piotr Dollár
Zhuowen Tu
Kaiming He
463
10,281
0
16 Nov 2016
Layer Normalization
Jimmy Lei Ba
J. Kiros
Geoffrey E. Hinton
312
10,412
0
21 Jul 2016
SqueezeNet: AlexNet-level accuracy with 50x fewer parameters and <0.5MB model size
F. Iandola
Song Han
Matthew W. Moskewicz
Khalid Ashraf
W. Dally
Kurt Keutzer
132
7,448
0
24 Feb 2016
Learning Deep Features for Discriminative Localization
Bolei Zhou
A. Khosla
Àgata Lapedriza
A. Oliva
Antonio Torralba
SSL
SSeg
FAtt
210
9,280
0
14 Dec 2015
Two-Stream Convolutional Networks for Action Recognition in Videos
Karen Simonyan
Andrew Zisserman
231
7,518
0
09 Jun 2014
Sinkhorn Distances: Lightspeed Computation of Optimal Transportation Distances
Marco Cuturi
OT
166
4,210
0
04 Jun 2013
1