ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2012.10071
  4. Cited By
TDN: Temporal Difference Networks for Efficient Action Recognition

TDN: Temporal Difference Networks for Efficient Action Recognition

18 December 2020
Limin Wang
Zhan Tong
Bin Ji
Gangshan Wu
ArXivPDFHTML

Papers citing "TDN: Temporal Difference Networks for Efficient Action Recognition"

50 / 161 papers shown
Title
Temporally-Adaptive Models for Efficient Video Understanding
Temporally-Adaptive Models for Efficient Video Understanding
Ziyuan Huang
Shiwei Zhang
Liang Pan
Zhiwu Qing
Yingya Zhang
Ziwei Liu
Marcelo H. Ang
38
9
0
10 Aug 2023
View while Moving: Efficient Video Recognition in Long-untrimmed Videos
View while Moving: Efficient Video Recognition in Long-untrimmed Videos
Ye Tian
Meng Yang
Lanshan Zhang
Zhizhen Zhang
Yang Liu
Xiao-Zhu Xie
Xirong Que
Wendong Wang
24
7
0
09 Aug 2023
Sample Less, Learn More: Efficient Action Recognition via Frame Feature
  Restoration
Sample Less, Learn More: Efficient Action Recognition via Frame Feature Restoration
Harry Cheng
Yangyang Guo
Liqiang Nie
Zhiyong Cheng
Mohan S. Kankanhalli
37
7
0
27 Jul 2023
What Can Simple Arithmetic Operations Do for Temporal Modeling?
What Can Simple Arithmetic Operations Do for Temporal Modeling?
Wenhao Wu
Yuxin Song
Zhun Sun
Jingdong Wang
Chang Xu
Wanli Ouyang
40
8
0
18 Jul 2023
Free-Form Composition Networks for Egocentric Action Recognition
Free-Form Composition Networks for Egocentric Action Recognition
Haoran Wang
Qinghua Cheng
Baosheng Yu
Yibing Zhan
Dapeng Tao
Liang Ding
Haibin Ling
EgoV
55
0
0
13 Jul 2023
MAE-DFER: Efficient Masked Autoencoder for Self-supervised Dynamic
  Facial Expression Recognition
MAE-DFER: Efficient Masked Autoencoder for Self-supervised Dynamic Facial Expression Recognition
Guoying Zhao
Zheng Lian
B. Liu
Jianhua Tao
37
17
0
05 Jul 2023
Make A Long Image Short: Adaptive Token Length for Vision Transformers
Make A Long Image Short: Adaptive Token Length for Vision Transformers
Yuqin Zhu
Yichen Zhu
ViT
72
17
0
05 Jul 2023
VideoComposer: Compositional Video Synthesis with Motion Controllability
VideoComposer: Compositional Video Synthesis with Motion Controllability
Xiang Wang
Hangjie Yuan
Shiwei Zhang
Dayou Chen
Jiuniu Wang
Yingya Zhang
Yujun Shen
Deli Zhao
Jingren Zhou
VGen
DiffM
33
316
0
03 Jun 2023
Teacher Agent: A Knowledge Distillation-Free Framework for
  Rehearsal-based Video Incremental Learning
Teacher Agent: A Knowledge Distillation-Free Framework for Rehearsal-based Video Incremental Learning
Shengqin Jiang
Yao-Huei Fang
Haokui Zhang
Qingshan Liu
Yuankai Qi
Yang Yang
Peifeng Wang
CLL
28
0
0
01 Jun 2023
VideoLLM: Modeling Video Sequence with Large Language Models
VideoLLM: Modeling Video Sequence with Large Language Models
Guo Chen
Yin-Dong Zheng
Jiahao Wang
Jilan Xu
Yifei Huang
...
Yi Wang
Yali Wang
Yu Qiao
Tong Lu
Limin Wang
MLLM
103
77
0
22 May 2023
MRSN: Multi-Relation Support Network for Video Action Detection
MRSN: Multi-Relation Support Network for Video Action Detection
Yin-Dong Zheng
Guo Chen
Minglei Yuan
Tong Lu
33
8
0
24 Apr 2023
Local-Global Temporal Difference Learning for Satellite Video Super-Resolution
Local-Global Temporal Difference Learning for Satellite Video Super-Resolution
Yi Xiao
Qiangqiang Yuan
Kui Jiang
Xianyu Jin
Jiang He
Lefei Zhang
Chia-Wen Lin
SupR
61
80
0
10 Apr 2023
MoLo: Motion-augmented Long-short Contrastive Learning for Few-shot
  Action Recognition
MoLo: Motion-augmented Long-short Contrastive Learning for Few-shot Action Recognition
Xiang Wang
Shiwei Zhang
Zhiwu Qing
Changxin Gao
Yingya Zhang
Deli Zhao
Nong Sang
24
40
0
03 Apr 2023
Streaming Video Model
Streaming Video Model
Yucheng Zhao
Chong Luo
Chuanxin Tang
Dongdong Chen
Noel Codella
Zhengjun Zha
36
12
0
30 Mar 2023
VideoMAE V2: Scaling Video Masked Autoencoders with Dual Masking
VideoMAE V2: Scaling Video Masked Autoencoders with Dual Masking
Limin Wang
Bingkun Huang
Zhiyu Zhao
Zhan Tong
Yinan He
Yi Wang
Yali Wang
Yu Qiao
VGen
71
326
0
29 Mar 2023
CycleACR: Cycle Modeling of Actor-Context Relations for Video Action
  Detection
CycleACR: Cycle Modeling of Actor-Context Relations for Video Action Detection
Lei Chen
Zhan Tong
Yibing Song
Gangshan Wu
Limin Wang
25
3
0
28 Mar 2023
Unmasked Teacher: Towards Training-Efficient Video Foundation Models
Unmasked Teacher: Towards Training-Efficient Video Foundation Models
Kunchang Li
Yali Wang
Yizhuo Li
Yi Wang
Yinan He
Limin Wang
Yu Qiao
VGen
57
155
0
28 Mar 2023
Frame Flexible Network
Frame Flexible Network
Yitian Zhang
Yue Bai
Chang Liu
Huan Wang
Sheng Li
Yun Fu
13
4
0
26 Mar 2023
Multi-view knowledge distillation transformer for human action
  recognition
Multi-view knowledge distillation transformer for human action recognition
Yi Lin
Vincent S. Tseng
ViT
26
1
0
25 Mar 2023
Enlarging Instance-specific and Class-specific Information for Open-set
  Action Recognition
Enlarging Instance-specific and Class-specific Information for Open-set Action Recognition
Jun Cen
Shiwei Zhang
Xiang Wang
Yixuan Pei
Zhiwu Qing
Yingya Zhang
Qifeng Chen
34
3
0
25 Mar 2023
Mutual Information-Based Temporal Difference Learning for Human Pose
  Estimation in Video
Mutual Information-Based Temporal Difference Learning for Human Pose Estimation in Video
Runyang Feng
Yixing Gao
Xueqi Ma
Tze Ho Elden Tse
H. Chang
3DH
44
21
0
15 Mar 2023
Maximizing Spatio-Temporal Entropy of Deep 3D CNNs for Efficient Video
  Recognition
Maximizing Spatio-Temporal Entropy of Deep 3D CNNs for Efficient Video Recognition
Junyan Wang
Zhenhong Sun
Yichen Qian
Dong Gong
Xiuyu Sun
Ming Lin
M. Pagnucco
Yang Song
3DPC
20
11
0
05 Mar 2023
Temporal Coherent Test-Time Optimization for Robust Video Classification
Temporal Coherent Test-Time Optimization for Robust Video Classification
Chenyu Yi
Siyuan Yang
Yufei Wang
Haoliang Li
Yap-Peng Tan
Alex C. Kot
TTA
27
12
0
28 Feb 2023
Video Action Recognition Collaborative Learning with Dynamics via
  PSO-ConvNet Transformer
Video Action Recognition Collaborative Learning with Dynamics via PSO-ConvNet Transformer
N. H. Phong
B. Ribeiro
29
15
0
17 Feb 2023
CMAE-V: Contrastive Masked Autoencoders for Video Action Recognition
CMAE-V: Contrastive Masked Autoencoders for Video Action Recognition
Cheng Lu
Xiaojie Jin
Zhicheng Huang
Qibin Hou
Mingg-Ming Cheng
Jiashi Feng
37
8
0
15 Jan 2023
Look, Listen, and Attack: Backdoor Attacks Against Video Action
  Recognition
Look, Listen, and Attack: Backdoor Attacks Against Video Action Recognition
Hasan Hammoud
Shuming Liu
Mohammad Alkhrashi
Fahad Albalawi
Guohao Li
AAML
32
8
0
03 Jan 2023
Bidirectional Cross-Modal Knowledge Exploration for Video Recognition
  with Pre-trained Vision-Language Models
Bidirectional Cross-Modal Knowledge Exploration for Video Recognition with Pre-trained Vision-Language Models
Wenhao Wu
Xiaohan Wang
Haipeng Luo
Jingdong Wang
Yi Yang
Wanli Ouyang
98
48
0
31 Dec 2022
An end-to-end multi-scale network for action prediction in videos
An end-to-end multi-scale network for action prediction in videos
Xiaofan Liu
Jianqin Yin
Yuanxi Sun
Zhicheng Zhang
Jin Tang
21
0
0
31 Dec 2022
Contextual Explainable Video Representation: Human Perception-based
  Understanding
Contextual Explainable Video Representation: Human Perception-based Understanding
Khoa T. Vo
Kashu Yamazaki
Phong H. Nguyen
Pha Nguyen
Khoa Luu
Ngan Le
21
9
0
12 Dec 2022
Masked Video Distillation: Rethinking Masked Feature Modeling for
  Self-supervised Video Representation Learning
Masked Video Distillation: Rethinking Masked Feature Modeling for Self-supervised Video Representation Learning
Rui Wang
Dongdong Chen
Zuxuan Wu
Yinpeng Chen
Xiyang Dai
Mengchen Liu
Lu Yuan
Yu-Gang Jiang
VGen
32
87
0
08 Dec 2022
VLG: General Video Recognition with Web Textual Knowledge
VLG: General Video Recognition with Web Textual Knowledge
Jintao Lin
Zhaoyang Liu
Wenhai Wang
Wayne Wu
Limin Wang
39
0
0
03 Dec 2022
Video Test-Time Adaptation for Action Recognition
Video Test-Time Adaptation for Action Recognition
Wei Lin
M. Jehanzeb Mirza
Mateusz Koziñski
Horst Possegger
Hilde Kuehne
Horst Bischof
TTA
47
31
0
24 Nov 2022
Mitigating and Evaluating Static Bias of Action Representations in the
  Background and the Foreground
Mitigating and Evaluating Static Bias of Action Representations in the Background and the Foreground
Haoxin Li
Yuan Liu
Hanwang Zhang
Boyang Li
30
15
0
23 Nov 2022
Dynamic Appearance: A Video Representation for Action Recognition with
  Joint Training
Dynamic Appearance: A Video Representation for Action Recognition with Joint Training
Guoxi Huang
A. Bors
27
1
0
23 Nov 2022
Look More but Care Less in Video Recognition
Look More but Care Less in Video Recognition
Yitian Zhang
Yue Bai
Haiquan Wang
Yi Xu
Yun Fu
27
9
0
18 Nov 2022
UniFormerV2: Spatiotemporal Learning by Arming Image ViTs with Video
  UniFormer
UniFormerV2: Spatiotemporal Learning by Arming Image ViTs with Video UniFormer
Kunchang Li
Yali Wang
Yinan He
Yizhuo Li
Yi Wang
Limin Wang
Yu Qiao
ViT
30
107
0
17 Nov 2022
AdaMAE: Adaptive Masking for Efficient Spatiotemporal Learning with
  Masked Autoencoders
AdaMAE: Adaptive Masking for Efficient Spatiotemporal Learning with Masked Autoencoders
W. G. C. Bandara
Naman Patel
A. Gholami
Mehdi Nikkhah
M. Agrawal
Vishal M. Patel
25
39
0
16 Nov 2022
Exploring State Change Capture of Heterogeneous Backbones @ Ego4D Hands
  and Objects Challenge 2022
Exploring State Change Capture of Heterogeneous Backbones @ Ego4D Hands and Objects Challenge 2022
Yin-Dong Zheng
Guo Chen
Jiahao Wang
Tong Lu
Liming Wang
37
0
0
16 Nov 2022
Dynamic Temporal Filtering in Video Models
Dynamic Temporal Filtering in Video Models
Fuchen Long
Zhaofan Qiu
Yingwei Pan
Ting Yao
Chong-Wah Ngo
Tao Mei
AI4TS
24
17
0
15 Nov 2022
PointTAD: Multi-Label Temporal Action Detection with Learnable Query
  Points
PointTAD: Multi-Label Temporal Action Detection with Learnable Query Points
Jing Tan
Xiaotong Zhao
Xintian Shi
Bingyi Kang
Limin Wang
17
26
0
20 Oct 2022
It Takes Two: Masked Appearance-Motion Modeling for Self-supervised
  Video Transformer Pre-training
It Takes Two: Masked Appearance-Motion Modeling for Self-supervised Video Transformer Pre-training
Yuxin Song
Min Yang
Wenhao Wu
Dongliang He
Fu Li
Jingdong Wang
ViT
97
8
0
11 Oct 2022
Self-supervised Video Representation Learning with Motion-Aware Masked
  Autoencoders
Self-supervised Video Representation Learning with Motion-Aware Masked Autoencoders
Haosen Yang
Deng Huang
Bin Wen
Jiannan Wu
H. Yao
Yi-Xin Jiang
Xiatian Zhu
Zehuan Yuan
37
19
0
09 Oct 2022
Heterogeneous Recurrent Spiking Neural Network for Spatio-Temporal
  Classification
Heterogeneous Recurrent Spiking Neural Network for Spatio-Temporal Classification
Biswadeep Chakraborty
Saibal Mukhopadhyay
31
20
0
22 Sep 2022
ViA: View-invariant Skeleton Action Representation Learning via Motion
  Retargeting
ViA: View-invariant Skeleton Action Representation Learning via Motion Retargeting
Di Yang
Yaohui Wang
A. Dantcheva
Lorenzo Garattoni
Gianpiero Francesca
F. Brémond
27
8
0
31 Aug 2022
Motion Sensitive Contrastive Learning for Self-supervised Video
  Representation
Motion Sensitive Contrastive Learning for Self-supervised Video Representation
Jingcheng Ni
Nana Zhou
Jie Qin
Qianrun Wu
Junqi Liu
Boxun Li
Di Huang
SSL
42
16
0
12 Aug 2022
Privacy-Preserving Action Recognition via Motion Difference Quantization
Privacy-Preserving Action Recognition via Motion Difference Quantization
Sudhakar Kumawat
Hajime Nagahara
MQ
16
20
0
04 Aug 2022
GROWN+UP: A Graph Representation Of a Webpage Network Utilizing
  Pre-training
GROWN+UP: A Graph Representation Of a Webpage Network Utilizing Pre-training
Benedict Yeoh
Huijuan Wang
GNN
31
1
0
03 Aug 2022
Can Shuffling Video Benefit Temporal Bias Problem: A Novel Training
  Framework for Temporal Grounding
Can Shuffling Video Benefit Temporal Bias Problem: A Novel Training Framework for Temporal Grounding
Jiachang Hao
Haifeng Sun
Pengfei Ren
Jingyu Wang
Q. Qi
J. Liao
31
26
0
29 Jul 2022
Spatiotemporal Self-attention Modeling with Temporal Patch Shift for
  Action Recognition
Spatiotemporal Self-attention Modeling with Temporal Patch Shift for Action Recognition
Wangmeng Xiang
Chong Li
Biao Wang
Xihan Wei
Xiangpei Hua
Lei Zhang
ViT
30
27
0
27 Jul 2022
MAR: Masked Autoencoders for Efficient Action Recognition
MAR: Masked Autoencoders for Efficient Action Recognition
Zhiwu Qing
Shiwei Zhang
Ziyuan Huang
Xiang Wang
Yuehuang Wang
Yiliang Lv
Changxin Gao
Nong Sang
32
42
0
24 Jul 2022
Previous
1234
Next