Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2204.08680
Cited By
v1
v2
v3 (latest)
Not All Tokens Are Equal: Human-centric Visual Analysis via Token Clustering Transformer
19 April 2022
Wang Zeng
Sheng Jin
Wentao Liu
Chao Qian
Ping Luo
Ouyang Wanli
Xiaogang Wang
ViT
Re-assign community
ArXiv (abs)
PDF
HTML
Github (240★)
Papers citing
"Not All Tokens Are Equal: Human-centric Visual Analysis via Token Clustering Transformer"
50 / 64 papers shown
Title
Image Recognition with Online Lightweight Vision Transformer: A Survey
Zherui Zhang
Rongtao Xu
Jie Zhou
Changwei Wang
Xingtian Pei
...
Jiguang Zhang
Li Guo
Longxiang Gao
Wenyuan Xu
Shibiao Xu
ViT
475
0
0
06 May 2025
HGFormer: Topology-Aware Vision Transformer with HyperGraph Learning
Hao Wang
Shuo Zhang
Biao Leng
ViT
257
1
0
03 Apr 2025
Rethinking Early-Fusion Strategies for Improved Multimodal Image Segmentation
Zhengwen Shen
Yulian Li
Han Zhang
Yuchen Weng
Jun Wang
81
0
0
19 Jan 2025
Pose Magic: Efficient and Temporally Consistent Human Pose Estimation with a Hybrid Mamba-GCN Network
Xinyi Zhang
Qiqi Bao
Qinpeng Cui
Wenming Yang
Qingmin Liao
3DH
Mamba
82
2
0
06 Aug 2024
PAFUSE: Part-based Diffusion for 3D Whole-Body Pose Estimation
Nermin Samet
Cédric Rommel
David Picard
Eduardo Valle
DiffM
83
0
0
14 Jul 2024
HRFormer: High-Resolution Transformer for Dense Prediction
Yuhui Yuan
Rao Fu
Lang Huang
Weihong Lin
Chao Zhang
Xilin Chen
Jingdong Wang
ViT
87
233
0
18 Oct 2021
Learning to Regress Bodies from Images using Differentiable Semantic Rendering
Sai Kumar Dwivedi
Nikos Athanasiou
Muhammed Kocabas
Michael J. Black
3DH
71
55
0
07 Oct 2021
PnP-DETR: Towards Efficient Visual Analysis with Transformers
Tao Wang
Li Yuan
Yunpeng Chen
Jiashi Feng
Shuicheng Yan
ViT
68
87
0
15 Sep 2021
Graph-Based 3D Multi-Person Pose Estimation Using Multi-View Images
Size Wu
Sheng Jin
Wentao Liu
Lei Bai
Chao Qian
Dong Liu
Wanli Ouyang
3DH
69
47
0
13 Sep 2021
Vision Transformer with Progressive Sampling
Xiaoyu Yue
Shuyang Sun
Zhanghui Kuang
Meng Wei
Philip Torr
Wayne Zhang
Dahua Lin
ViT
78
85
0
03 Aug 2021
Human Pose Regression with Residual Log-likelihood Estimation
Jiefeng Li
Siyuan Bian
Ailing Zeng
Can Wang
Bo Pang
Wentao Liu
Cewu Lu
54
197
0
23 Jul 2021
DynamicViT: Efficient Vision Transformers with Dynamic Token Sparsification
Yongming Rao
Wenliang Zhao
Benlin Liu
Jiwen Lu
Jie Zhou
Cho-Jui Hsieh
ViT
90
699
0
03 Jun 2021
ViPNAS: Efficient Video Pose Estimation via Neural Architecture Search
Lumin Xu
Yingda Guan
Sheng Jin
Wentao Liu
Chao Qian
Ping Luo
Wanli Ouyang
Xiaogang Wang
74
53
0
21 May 2021
When Human Pose Estimation Meets Robustness: Adversarial Algorithms and Benchmarks
Jiahang Wang
Sheng Jin
Wentao Liu
Weizhong Liu
Chao Qian
Ping Luo
AAML
58
58
0
13 May 2021
Evaluating Neural Word Embeddings for Sanskrit
Kevin Qinghong Lin
Om Adideva
Digumarthi Komal
Laxmidhar Behera
Pawan Goyal
84
12
0
01 Apr 2021
CvT: Introducing Convolutions to Vision Transformers
Haiping Wu
Bin Xiao
Noel Codella
Mengchen Liu
Xiyang Dai
Lu Yuan
Lei Zhang
ViT
152
1,910
0
29 Mar 2021
TFPose: Direct Human Pose Estimation with Transformers
Wei Mao
Yongtao Ge
Chunhua Shen
Zhi Tian
Xinlong Wang
Zhibin Wang
ViT
84
88
0
29 Mar 2021
Swin Transformer: Hierarchical Vision Transformer using Shifted Windows
Ze Liu
Yutong Lin
Yue Cao
Han Hu
Yixuan Wei
Zheng Zhang
Stephen Lin
B. Guo
ViT
453
21,439
0
25 Mar 2021
3D Human Pose Estimation with Spatial and Temporal Transformers
Ce Zheng
Sijie Zhu
Matías Mendieta
Taojiannan Yang
Chong Chen
Zhengming Ding
ViT
123
452
0
18 Mar 2021
Pyramid Vision Transformer: A Versatile Backbone for Dense Prediction without Convolutions
Wenhai Wang
Enze Xie
Xiang Li
Deng-Ping Fan
Kaitao Song
Ding Liang
Tong Lu
Ping Luo
Ling Shao
ViT
530
3,724
0
24 Feb 2021
Tokens-to-Token ViT: Training Vision Transformers from Scratch on ImageNet
Li-xin Yuan
Yunpeng Chen
Tao Wang
Weihao Yu
Yujun Shi
Zihang Jiang
Francis E. H. Tay
Jiashi Feng
Shuicheng Yan
ViT
133
1,939
0
28 Jan 2021
Training data-efficient image transformers & distillation through attention
Hugo Touvron
Matthieu Cord
Matthijs Douze
Francisco Massa
Alexandre Sablayrolles
Hervé Jégou
ViT
387
6,768
0
23 Dec 2020
End-to-End Human Pose and Mesh Reconstruction with Transformers
Kevin Qinghong Lin
Lijuan Wang
Zicheng Liu
ViT
74
626
0
17 Dec 2020
An Image is Worth 16x16 Words: Transformers for Image Recognition at Scale
Alexey Dosovitskiy
Lucas Beyer
Alexander Kolesnikov
Dirk Weissenborn
Xiaohua Zhai
...
Matthias Minderer
G. Heigold
Sylvain Gelly
Jakob Uszkoreit
N. Houlsby
ViT
657
41,103
0
22 Oct 2020
Pose2Mesh: Graph Convolutional Network for 3D Human Pose and Mesh Recovery from a 2D Human Pose
Hongsuk Choi
Gyeongsik Moon
Kyoung Mu Lee
3DH
61
383
0
20 Aug 2020
I2L-MeshNet: Image-to-Lixel Prediction Network for Accurate 3D Human Pose and Mesh Estimation from a Single RGB Image
Gyeongsik Moon
Kyoung Mu Lee
3DH
90
427
0
09 Aug 2020
Differentiable Hierarchical Graph Grouping for Multi-Person Pose Estimation
Sheng Jin
Wentao Liu
Enze Xie
Wenhai Wang
Chao Qian
Wanli Ouyang
Ping Luo
3DH
86
126
0
23 Jul 2020
Whole-Body Human Pose Estimation in the Wild
Sheng Jin
Lumin Xu
Jin Xu
Can Wang
Wentao Liu
Chao Qian
Wanli Ouyang
Ping Luo
3DH
190
247
0
23 Jul 2020
3D Human Mesh Regression with Dense Correspondence
Wang Zeng
Wanli Ouyang
Ping Luo
Wentao Liu
Xiaogang Wang
3DH
36
95
0
10 Jun 2020
Exemplar Fine-Tuning for 3D Human Model Fitting Towards In-the-Wild 3D Human Pose Estimation
Hanbyul Joo
Natalia Neverova
Andrea Vedaldi
3DH
50
155
0
07 Apr 2020
Weakly Supervised 3D Human Pose and Shape Reconstruction with Normalizing Flows
Andrei Zanfir
Eduard Gabriel Bazavan
Hongyi Xu
Bill Freeman
Rahul Sukthankar
C. Sminchisescu
3DH
67
135
0
23 Mar 2020
TRB: A Novel Triplet Representation for Understanding 2D Human Body
Haodong Duan
Kwan-Yee Lin
Sheng Jin
Wentao Liu
Chao Qian
Wanli Ouyang
3DH
46
17
0
25 Oct 2019
Single-Network Whole-Body Pose Estimation
Gines Hidalgo
Yaadhav Raaj
Haroon Idrees
Donglai Xiang
Hanbyul Joo
Tomas Simon
Yaser Sheikh
3DH
164
101
0
30 Sep 2019
Learning to Reconstruct 3D Human Pose and Shape via Model-fitting in the Loop
Nikos Kolotouros
Georgios Pavlakos
Michael J. Black
Kostas Daniilidis
3DH
101
987
0
27 Sep 2019
HigherHRNet: Scale-Aware Representation Learning for Bottom-Up Human Pose Estimation
Bowen Cheng
Bin Xiao
Jingdong Wang
Humphrey Shi
Thomas S. Huang
Lei Zhang
3DH
67
676
0
27 Aug 2019
Deep High-Resolution Representation Learning for Visual Recognition
Jingdong Wang
Ke Sun
Tianheng Cheng
Borui Jiang
Chaorui Deng
...
Yadong Mu
Mingkui Tan
Xinggang Wang
Wenyu Liu
Bin Xiao
393
3,614
0
20 Aug 2019
CutMix: Regularization Strategy to Train Strong Classifiers with Localizable Features
Sangdoo Yun
Dongyoon Han
Seong Joon Oh
Sanghyuk Chun
Junsuk Choe
Y. Yoo
OOD
619
4,780
0
13 May 2019
Convolutional Mesh Regression for Single-Image Human Shape Reconstruction
Nikos Kolotouros
Georgios Pavlakos
Kostas Daniilidis
3DH
89
527
0
08 May 2019
Multi-person Articulated Tracking with Spatial and Temporal Embeddings
Sheng Jin
Wentao Liu
Wanli Ouyang
Chao Qian
113
75
0
21 Mar 2019
Deep High-Resolution Representation Learning for Human Pose Estimation
Ke Sun
Bin Xiao
Dong Liu
Jingdong Wang
3DV
128
4,056
0
25 Feb 2019
OpenPose: Realtime Multi-Person 2D Pose Estimation using Part Affinity Fields
Zhe Cao
Gines Hidalgo
Tomas Simon
S. Wei
Yaser Sheikh
3DH
CVBM
124
4,592
0
18 Dec 2018
Hierarchical Graph Representation Learning with Differentiable Pooling
Rex Ying
Jiaxuan You
Christopher Morris
Xiang Ren
William L. Hamilton
J. Leskovec
GNN
297
2,148
0
22 Jun 2018
Look at Boundary: A Boundary-Aware Face Alignment Algorithm
Wayne Wu
Chao Qian
Shuo Yang
Quan Wang
Yici Cai
Qiang-feng Zhou
CVBM
3DV
75
438
0
26 May 2018
Simple Baselines for Human Pose Estimation and Tracking
Bin Xiao
Haiping Wu
Yichen Wei
3DH
VOT
121
1,792
0
17 Apr 2018
BodyNet: Volumetric Inference of 3D Human Body Shapes
Gül Varol
Duygu Ceylan
Bryan C. Russell
Jimei Yang
Ersin Yumer
Ivan Laptev
Cordelia Schmid
3DH
43
430
0
13 Apr 2018
End-to-end Recovery of Human Shape and Pose
Angjoo Kanazawa
Michael J. Black
David Jacobs
Jitendra Malik
3DH
210
1,796
0
18 Dec 2017
Wing Loss for Robust Facial Landmark Localisation with Convolutional Neural Networks
Zhenhua Feng
J. Kittler
Muhammad Awais
P. Huber
Xiaojun Wu
CVBM
56
399
0
17 Nov 2017
mixup: Beyond Empirical Risk Minimization
Hongyi Zhang
Moustapha Cissé
Yann N. Dauphin
David Lopez-Paz
NoLa
280
9,764
0
25 Oct 2017
Random Erasing Data Augmentation
Zhun Zhong
Liang Zheng
Guoliang Kang
Shaozi Li
Yi Yang
90
3,637
0
16 Aug 2017
Learning Feature Pyramids for Human Pose Estimation
Wei Yang
Shuang Li
Wanli Ouyang
Hongsheng Li
Xiaogang Wang
3DH
75
491
0
03 Aug 2017
1
2
Next