ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1711.10305
  4. Cited By
Learning Spatio-Temporal Representation with Pseudo-3D Residual Networks

Learning Spatio-Temporal Representation with Pseudo-3D Residual Networks

28 November 2017
Zhaofan Qiu
Ting Yao
Tao Mei
ArXivPDFHTML

Papers citing "Learning Spatio-Temporal Representation with Pseudo-3D Residual Networks"

50 / 263 papers shown
Title
DCAN: Improving Temporal Action Detection via Dual Context Aggregation
DCAN: Improving Temporal Action Detection via Dual Context Aggregation
Guo Chen
Yin-Dong Zheng
Limin Wang
Tong Lu
AI4TS
31
70
0
07 Dec 2021
MViTv2: Improved Multiscale Vision Transformers for Classification and
  Detection
MViTv2: Improved Multiscale Vision Transformers for Classification and Detection
Yanghao Li
Chaoxia Wu
Haoqi Fan
K. Mangalam
Bo Xiong
Jitendra Malik
Christoph Feichtenhofer
ViT
75
679
0
02 Dec 2021
Modeling Temporal Concept Receptive Field Dynamically for Untrimmed
  Video Analysis
Modeling Temporal Concept Receptive Field Dynamically for Untrimmed Video Analysis
Zhaobo Qi
Shuhui Wang
Chi Su
Li Su
Weigang Zhang
Qingming Huang
27
10
0
23 Nov 2021
Will You Ever Become Popular? Learning to Predict Virality of Dance
  Clips
Will You Ever Become Popular? Learning to Predict Virality of Dance Clips
Jiahao Wang
Yunhong Wang
Nina Weng
Tianrui Chai
Annan Li
Faxi Zhang
Sansi Yu
27
13
0
06 Nov 2021
Multi-Task and Multi-Modal Learning for RGB Dynamic Gesture Recognition
Multi-Task and Multi-Modal Learning for RGB Dynamic Gesture Recognition
Dinghao Fan
Hengjie Lu
Shugong Xu
Shan Cao
32
15
0
29 Oct 2021
Temporal-attentive Covariance Pooling Networks for Video Recognition
Temporal-attentive Covariance Pooling Networks for Video Recognition
Zilin Gao
Qilong Wang
Bingbing Zhang
Q. Hu
P. Li
21
25
0
27 Oct 2021
Using Motion History Images with 3D Convolutional Networks in Isolated
  Sign Language Recognition
Using Motion History Images with 3D Convolutional Networks in Isolated Sign Language Recognition
Hamed Valizadegan
D. Caldwell
SLR
24
48
0
24 Oct 2021
TEAM-Net: Multi-modal Learning for Video Action Recognition with Partial
  Decoding
TEAM-Net: Multi-modal Learning for Video Action Recognition with Partial Decoding
Zhengwei Wang
Qi She
A. Smolic
21
9
0
17 Oct 2021
TAda! Temporally-Adaptive Convolutions for Video Understanding
TAda! Temporally-Adaptive Convolutions for Video Understanding
Ziyuan Huang
Shiwei Zhang
Liang Pan
Zhiwu Qing
Mingqian Tang
Ziwei Liu
M. Ang
53
49
0
12 Oct 2021
Video Is Graph: Structured Graph Module for Video Action Recognition
Video Is Graph: Structured Graph Module for Video Action Recognition
Rongjie Li
Xiaojun Wu
Tianyang Xu
46
12
0
12 Oct 2021
SignBERT: Pre-Training of Hand-Model-Aware Representation for Sign
  Language Recognition
SignBERT: Pre-Training of Hand-Model-Aware Representation for Sign Language Recognition
Hezhen Hu
Weichao Zhao
Wen-gang Zhou
Yuechen Wang
Houqiang Li
ViT
32
63
0
11 Oct 2021
Deep Learning-based Action Detection in Untrimmed Videos: A Survey
Deep Learning-based Action Detection in Untrimmed Videos: A Survey
Elahe Vahdani
Yingli Tian
52
60
0
30 Sep 2021
Comparative Validation of Machine Learning Algorithms for Surgical
  Workflow and Skill Analysis with the HeiChole Benchmark
Comparative Validation of Machine Learning Algorithms for Surgical Workflow and Skill Analysis with the HeiChole Benchmark
M. Wagner
Beat-Peter Müller-Stich
A. Kisilenko
Duc Tran
P. Heger
...
M. Frankenberg
F. Mathis-Ullrich
Lena Maier-Hein
Stefanie Speidel
S. Bodenstedt
30
67
0
30 Sep 2021
TSM: Temporal Shift Module for Efficient and Scalable Video
  Understanding on Edge Device
TSM: Temporal Shift Module for Efficient and Scalable Video Understanding on Edge Device
Ji Lin
Chuang Gan
Kuan-Chieh Jackson Wang
Song Han
40
64
0
27 Sep 2021
A survey on deep learning approaches for breast cancer diagnosis
A survey on deep learning approaches for breast cancer diagnosis
Timothy C. H. Kwong
S. Mazaheri
MedIm
30
4
0
18 Sep 2021
Asymmetric 3D Context Fusion for Universal Lesion Detection
Asymmetric 3D Context Fusion for Universal Lesion Detection
Jiancheng Yang
Yi He
Kaiming Kuang
Zudi Lin
Hanspeter Pfister
Bingbing Ni
3DPC
MedIm
35
22
0
17 Sep 2021
LIGAR: Lightweight General-purpose Action Recognition
LIGAR: Lightweight General-purpose Action Recognition
Evgeny Izutov
15
3
0
30 Aug 2021
Separable Convolutions for Optimizing 3D Stereo Networks
Separable Convolutions for Optimizing 3D Stereo Networks
Rafia Rahim
Faranak Shamsafar
A. Zell
3DPC
3DV
29
8
0
23 Aug 2021
MobileStereoNet: Towards Lightweight Deep Networks for Stereo Matching
MobileStereoNet: Towards Lightweight Deep Networks for Stereo Matching
Faranak Shamsafar
Samuel Woerz
Rafia Rahim
A. Zell
3DV
27
85
0
22 Aug 2021
MM-ViT: Multi-Modal Video Transformer for Compressed Video Action
  Recognition
MM-ViT: Multi-Modal Video Transformer for Compressed Video Action Recognition
Jiawei Chen
C. Ho
ViT
26
77
0
20 Aug 2021
Efficient Light Field Reconstruction via Spatio-Angular Dense Network
Efficient Light Field Reconstruction via Spatio-Angular Dense Network
Zexi Hu
H. W. F. Yeung
Xiaoming Chen
Yuk Ying Chung
Haisheng Li
3DV
26
16
0
08 Aug 2021
Token Shift Transformer for Video Classification
Token Shift Transformer for Video Classification
Hao Zhang
Y. Hao
Chong-Wah Ngo
ViT
32
116
0
05 Aug 2021
Spatio-Temporal Representation Factorization for Video-based Person
  Re-Identification
Spatio-Temporal Representation Factorization for Video-based Person Re-Identification
Abhishek Aich
Meng Zheng
Srikrishna Karanam
Terrence Chen
Amit K. Roy-Chowdhury
Ziyan Wu
37
70
0
25 Jul 2021
Adaptive Recursive Circle Framework for Fine-grained Action Recognition
Adaptive Recursive Circle Framework for Fine-grained Action Recognition
Hanxi Lin
Xinxiao Wu
Jiebo Luo
25
1
0
25 Jul 2021
EAN: Event Adaptive Network for Enhanced Action Recognition
EAN: Event Adaptive Network for Enhanced Action Recognition
Yuan Tian
Yichao Yan
Guangtao Zhai
G. Guo
Zhiyong Gao
35
41
0
22 Jul 2021
TA2N: Two-Stage Action Alignment Network for Few-shot Action Recognition
TA2N: Two-Stage Action Alignment Network for Few-shot Action Recognition
Shuyuan Li
Huabin Liu
Rui Qian
Yuxi Li
John See
Mengjuan Fei
Xiaoyuan Yu
W. Lin
23
75
0
10 Jul 2021
Keeping Your Eye on the Ball: Trajectory Attention in Video Transformers
Keeping Your Eye on the Ball: Trajectory Attention in Video Transformers
Mandela Patrick
Dylan Campbell
Yuki M. Asano
Ishan Misra
Ishan Misra Florian Metze
Christoph Feichtenhofer
Andrea Vedaldi
João F. Henriques
30
274
0
09 Jun 2021
How to Design a Three-Stage Architecture for Audio-Visual Active Speaker
  Detection in the Wild
How to Design a Three-Stage Architecture for Audio-Visual Active Speaker Detection in the Wild
Okan Kopuklu
Maja Taseska
Gerhard Rigoll
3DV
29
45
0
07 Jun 2021
CT-Net: Channel Tensorization Network for Video Classification
CT-Net: Channel Tensorization Network for Video Classification
Kunchang Li
Xianhang Li
Yali Wang
Jun Wang
Yu Qiao
ViT
30
55
0
03 Jun 2021
DSANet: Dynamic Segment Aggregation Network for Video-Level
  Representation Learning
DSANet: Dynamic Segment Aggregation Network for Video-Level Representation Learning
Wenhao Wu
Yuxiang Zhao
Yanwu Xu
Xiao Tan
Dongliang He
...
Jinxing Ye
Yingying Li
Mingde Yao
Zichao Dong
Yifeng Shi
AI4TS
30
27
0
25 May 2021
Temporal Action Proposal Generation with Transformers
Temporal Action Proposal Generation with Transformers
Lining Wang
Haosen Yang
Wenhao Wu
H. Yao
Hujie Huang
ViT
38
27
0
25 May 2021
VPN++: Rethinking Video-Pose embeddings for understanding Activities of
  Daily Living
VPN++: Rethinking Video-Pose embeddings for understanding Activities of Daily Living
Srijan Das
Rui Dai
Di Yang
F. Brémond
ViT
43
67
0
17 May 2021
MutualNet: Adaptive ConvNet via Mutual Learning from Different Model
  Configurations
MutualNet: Adaptive ConvNet via Mutual Learning from Different Model Configurations
Taojiannan Yang
Sijie Zhu
Matías Mendieta
Pu Wang
Ravikumar Balakrishnan
Minwoo Lee
T. Han
M. Shah
Chong Chen
3DH
OOD
28
23
0
14 May 2021
REGINA - Reasoning Graph Convolutional Networks in Human Action
  Recognition
REGINA - Reasoning Graph Convolutional Networks in Human Action Recognition
Bruno Degardin
Vasco Lopes
Hugo Proencca
3DH
GNN
38
10
0
14 May 2021
BiCnet-TKS: Learning Efficient Spatial-Temporal Representation for Video
  Person Re-Identification
BiCnet-TKS: Learning Efficient Spatial-Temporal Representation for Video Person Re-Identification
Rui Hou
Hong Chang
Bingpeng Ma
Rui Huang
Shiguang Shan
27
85
0
30 Apr 2021
Multiscale Vision Transformers
Multiscale Vision Transformers
Haoqi Fan
Bo Xiong
K. Mangalam
Yanghao Li
Zhicheng Yan
Jitendra Malik
Christoph Feichtenhofer
ViT
63
1,224
0
22 Apr 2021
MGSampler: An Explainable Sampling Strategy for Video Action Recognition
MGSampler: An Explainable Sampling Strategy for Video Action Recognition
Yuan Zhi
Zhan Tong
Limin Wang
Gangshan Wu
TTA
19
72
0
20 Apr 2021
Writing in The Air: Unconstrained Text Recognition from Finger Movement
  Using Spatio-Temporal Convolution
Writing in The Air: Unconstrained Text Recognition from Finger Movement Using Spatio-Temporal Convolution
Ue-Hwan Kim
Yewon Hwang
Sun-Kyung Lee
Jong-Hwan Kim
33
19
0
19 Apr 2021
Object-Centric Representation Learning for Video Question Answering
Object-Centric Representation Learning for Video Question Answering
Long Hoang Dang
T. Le
Vuong Le
T. Tran
27
7
0
12 Apr 2021
Adaptive Configuration of In Situ Lossy Compression for Cosmology
  Simulations via Fine-Grained Rate-Quality Modeling
Adaptive Configuration of In Situ Lossy Compression for Cosmology Simulations via Fine-Grained Rate-Quality Modeling
Sian Jin
Jesus Pulido
Pascal Grosset
Jiannan Tian
Dingwen Tao
J. Ahrens
33
22
0
01 Apr 2021
Graph-based Facial Affect Analysis: A Review
Graph-based Facial Affect Analysis: A Review
Yang Liu
Xingming Zhang
Yante Li
Jinzhao Zhou
Xin-hui Li
Guoying Zhao
CVBM
46
24
0
29 Mar 2021
Busy-Quiet Video Disentangling for Video Classification
Busy-Quiet Video Disentangling for Video Classification
Guoxi Huang
A. Bors
28
6
0
29 Mar 2021
Low-Fidelity End-to-End Video Encoder Pre-training for Temporal Action
  Localization
Low-Fidelity End-to-End Video Encoder Pre-training for Temporal Action Localization
Mengmeng Xu
Juan-Manuel Perez-Rua
Xiatian Zhu
Guohao Li
Brais Martinez
17
27
0
28 Mar 2021
Temporal Context Aggregation Network for Temporal Action Proposal
  Refinement
Temporal Context Aggregation Network for Temporal Action Proposal Refinement
Zhiwu Qing
Haisheng Su
Weihao Gan
Dongliang Wang
Wei Wu
Xiang Wang
Yu Qiao
Junjie Yan
Changxin Gao
Nong Sang
30
173
0
24 Mar 2021
Learning Salient Boundary Feature for Anchor-free Temporal Action
  Localization
Learning Salient Boundary Feature for Anchor-free Temporal Action Localization
Chuming Lin
C. Xu
Donghao Luo
Yabiao Wang
Ying Tai
Chengjie Wang
Jilin Li
Feiyue Huang
Yanwei Fu
35
250
0
24 Mar 2021
AdaSGN: Adapting Joint Number and Model Size for Efficient
  Skeleton-Based Action Recognition
AdaSGN: Adapting Joint Number and Model Size for Efficient Skeleton-Based Action Recognition
Lei Shi
Yifan Zhang
Jian Cheng
Hanqing Lu
30
46
0
22 Mar 2021
ACTION-Net: Multipath Excitation for Action Recognition
ACTION-Net: Multipath Excitation for Action Recognition
Zhengwei Wang
Qi She
A. Smolic
3DPC
39
165
0
11 Mar 2021
TransMed: Transformers Advance Multi-modal Medical Image Classification
TransMed: Transformers Advance Multi-modal Medical Image Classification
Yin Dai
Yifan Gao
ViT
MedIm
38
281
0
10 Mar 2021
Time and Frequency Network for Human Action Detection in Videos
Time and Frequency Network for Human Action Detection in Videos
Changhai Li
Huawei Chen
Jingqing Lu
Yang Huang
Yingying Liu
3DH
AI4TS
13
2
0
08 Mar 2021
Less is More: ClipBERT for Video-and-Language Learning via Sparse
  Sampling
Less is More: ClipBERT for Video-and-Language Learning via Sparse Sampling
Jie Lei
Linjie Li
Luowei Zhou
Zhe Gan
Tamara L. Berg
Joey Tianyi Zhou
Jingjing Liu
CLIP
46
647
0
11 Feb 2021
Previous
123456
Next