Multi-view Action Recognition via Directed Gromov-Wasserstein Discrepancy

2 May 2024

Papers citing "Multi-view Action Recognition via Directed Gromov-Wasserstein Discrepancy"

33 / 33 papers shown

Title
FALCON: Fairness Learning via Contrastive Attention Approach to Continual Semantic Scene Understanding Thanh-Dat Truong Utsav Prabhu Bhiksha Raj Jackson Cothren Khoa Luu CLL 100 3 0 27 Nov 2023
SoGAR: Self-supervised Spatiotemporal Attention-based Social Group Activity Recognition N. V. R. Chappa Pha Nguyen Alec Nelson Han-Seok Seo Xin Li P. Dobbs Khoa Luu ViT 44 8 0 27 Apr 2023
Adapting Self-Supervised Vision Transformers by Probing Attention-Conditioned Masking Consistency Viraj Prabhu Sriram Yenamandra Aaditya K. Singh Judy Hoffman 57 14 0 16 Jun 2022
DirecFormer: A Directed Attention in Transformer Approach to Robust Action Recognition Thanh-Dat Truong Quoc-Huy Bui C. Duong Han-Seok Seo Son Lam Phung Xin Li Khoa Luu ViT 72 49 0 19 Mar 2022
ActionFormer: Localizing Moments of Actions with Transformers Chen-Da Liu-Zhang Jianxin Wu Yin Li ViT 59 336 0 16 Feb 2022
Neural Fields in Visual Computing and Beyond Yiheng Xie Towaki Takikawa Shunsuke Saito Or Litany Shiqin Yan Numair Khan Federico Tombari James Tompkin Vincent Sitzmann Srinath Sridhar 3DH 144 623 0 22 Nov 2021
Video Swin Transformer Ze Liu Jia Ning Yue Cao Yixuan Wei Zheng Zhang Stephen Lin Han Hu ViT 84 1,458 0 24 Jun 2021
DyGLIP: A Dynamic Graph Model with Link Prediction for Accurate Multi-Camera Multiple Object Tracking Kha Gia Quach Pha Nguyen Huu Le Thanh-Dat Truong C. Duong M. Tran Khoa Luu 44 53 0 12 Jun 2021
Multiscale Vision Transformers Haoqi Fan Bo Xiong K. Mangalam Yanghao Li Zhicheng Yan Jitendra Malik Christoph Feichtenhofer ViT 125 1,248 0 22 Apr 2021
ViViT: A Video Vision Transformer Anurag Arnab Mostafa Dehghani G. Heigold Chen Sun Mario Lucic Cordelia Schmid ViT 166 2,119 0 29 Mar 2021
Is Space-Time Attention All You Need for Video Understanding? Gedas Bertasius Heng Wang Lorenzo Torresani ViT 345 2,016 0 09 Feb 2021
Human Action Recognition from Various Data Modalities: A Review Zehua Sun Qiuhong Ke Hossein Rahmani Mohammed Bennamoun Gang Wang Jun Liu MU 80 514 0 22 Dec 2020
INeRF: Inverting Neural Radiance Fields for Pose Estimation Yen-Chen Lin Peter R. Florence Jonathan T. Barron Alberto Rodriguez Phillip Isola Nayeon Lee 105 451 0 10 Dec 2020
Nerfies: Deformable Neural Radiance Fields Keunhong Park U. Sinha Jonathan T. Barron Sofien Bouaziz Dan B. Goldman S. M. Seitz Ricardo Martín Brualla 3DH 122 1,123 0 25 Nov 2020
NeRF++: Analyzing and Improving Neural Radiance Fields Kai Zhang Gernot Riegler Noah Snavely V. Koltun 73 1,035 0 15 Oct 2020
GRF: Learning a General Radiance Field for 3D Representation and Rendering Alex Trevithick Bo Yang 3DV 3DH 60 230 0 09 Oct 2020
MotionSqueeze: Neural Motion Feature Learning for Video Understanding Heeseung Kwon Manjin Kim Suha Kwak Minsu Cho FAtt 71 128 0 20 Jul 2020
More Is Less: Learning Efficient Video Representations by Big-Little Network and Depthwise Temporal Aggregation Quanfu Fan Chun-Fu Chen Hilde Kuehne Marco Pistoia David D. Cox 68 126 0 02 Dec 2019
A Short Note on the Kinetics-700 Human Action Dataset João Carreira Eric Noland Chloe Hillier Andrew Zisserman 64 446 0 15 Jul 2019
NTU RGB+D 120: A Large-Scale Benchmark for 3D Human Activity Understanding Jun Liu Amir Shahroudy Mauricio Perez G. Wang Ling-yu Duan Alex C. Kot 75 1,268 0 12 May 2019
BSN: Boundary Sensitive Network for Temporal Action Proposal Generation Tianwei Lin Xu Zhao Haisheng Su Chongjing Wang Ming Yang 192 700 0 08 Jun 2018
Deep Co-Training for Semi-Supervised Image Recognition Siyuan Qiao Wei Shen Zhishuai Zhang Bo Wang Alan Yuille 52 446 0 15 Mar 2018
AVA: A Video Dataset of Spatio-temporally Localized Atomic Visual Actions Chunhui Gu Chen Sun David A. Ross Carl Vondrick C. Pantofaru ... G. Toderici Susanna Ricco Rahul Sukthankar Cordelia Schmid Jitendra Malik VGen 94 1,021 0 23 May 2017
Quo Vadis, Action Recognition? A New Model and the Kinetics Dataset João Carreira Andrew Zisserman 212 7,961 0 22 May 2017
Temporal Segment Networks for Action Recognition in Videos Limin Wang Yuanjun Xiong Zhe Wang Yu Qiao Dahua Lin Xiaoou Tang Luc Van Gool ViT 101 807 0 08 May 2017
Rotation equivariant vector field networks Diego Marcos Michele Volpi N. Komodakis D. Tuia 58 269 0 29 Dec 2016
Paying More Attention to Attention: Improving the Performance of Convolutional Neural Networks via Attention Transfer Sergey Zagoruyko N. Komodakis 108 2,561 0 12 Dec 2016
Aggregated Residual Transformations for Deep Neural Networks Saining Xie Ross B. Girshick Piotr Dollár Zhuowen Tu Kaiming He 463 10,281 0 16 Nov 2016
Layer Normalization Jimmy Lei Ba J. Kiros Geoffrey E. Hinton 312 10,412 0 21 Jul 2016
SqueezeNet: AlexNet-level accuracy with 50x fewer parameters and <0.5MB model size F. Iandola Song Han Matthew W. Moskewicz Khalid Ashraf W. Dally Kurt Keutzer 132 7,448 0 24 Feb 2016
Learning Deep Features for Discriminative Localization Bolei Zhou A. Khosla Àgata Lapedriza A. Oliva Antonio Torralba SSL SSeg FAtt 210 9,280 0 14 Dec 2015
Two-Stream Convolutional Networks for Action Recognition in Videos Karen Simonyan Andrew Zisserman 231 7,518 0 09 Jun 2014
Sinkhorn Distances: Lightspeed Computation of Optimal Transportation Distances Marco Cuturi OT 166 4,210 0 04 Jun 2013