ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2504.20800
  4. Cited By
Adept: Annotation-Denoising Auxiliary Tasks with Discrete Cosine Transform Map and Keypoint for Human-Centric Pretraining

Adept: Annotation-Denoising Auxiliary Tasks with Discrete Cosine Transform Map and Keypoint for Human-Centric Pretraining

29 April 2025
Weizhen He
Yunfeng Yan
Shixiang Tang
Yiheng Deng
Yangyang Zhong
Pengxin Luo
Donglian Qi
    VLM
ArXiv (abs)PDFHTML

Papers citing "Adept: Annotation-Denoising Auxiliary Tasks with Discrete Cosine Transform Map and Keypoint for Human-Centric Pretraining"

45 / 45 papers shown
Title
Instruct-ReID++: Towards Universal Purpose Instruction-Guided Person Re-identification
Instruct-ReID++: Towards Universal Purpose Instruction-Guided Person Re-identification
Weizhen He
Yiheng Deng
Yunfeng Yan
Feng Zhu
Yizhou Wang
Lei Bai
Qingsong Xie
Donglian Qi
Wanli Ouyang
Shixiang Tang
141
3
0
28 May 2024
Enhancing Long-Term Person Re-Identification Using Global, Local Body
  Part, and Head Streams
Enhancing Long-Term Person Re-Identification Using Global, Local Body Part, and Head Streams
Duy Tran Thanh
Yeejin Lee
Byeongkeun Kang
96
2
0
05 Mar 2024
Facing the Elephant in the Room: Visual Prompt Tuning or Full
  Finetuning?
Facing the Elephant in the Room: Visual Prompt Tuning or Full Finetuning?
Cheng Han
Qifan Wang
Yiming Cui
Wenguan Wang
Lifu Huang
Siyuan Qi
Dongfang Liu
VLM
137
22
0
23 Jan 2024
Instruct-ReID: A Multi-purpose Person Re-identification Task with Instructions
Instruct-ReID: A Multi-purpose Person Re-identification Task with Instructions
Weizhen He
Yihe Deng
Shixiang Tang
Qihao Chen
Qingsong Xie
...
Feng Zhu
Rui Zhao
Wanli Ouyang
Donglian Qi
Yunfeng Yan
113
24
0
13 Jun 2023
TransFlow: Transformer as Flow Learner
TransFlow: Transformer as Flow Learner
Yawen Lu
Qifan Wang
Siqi Ma
Tong Geng
Victor Y. Chen
Huaijin Chen
Dongfang Liu
ViT
80
50
0
23 Apr 2023
HumanBench: Towards General Human-centric Perception with Projector
  Assisted Pretraining
HumanBench: Towards General Human-centric Perception with Projector Assisted Pretraining
Shixiang Tang
Cheng Chen
Qingsong Xie
Meilin Chen
Yizhou Wang
...
Feng Zhu
Haiyang Yang
Li Yi
Rui Zhao
Wanli Ouyang
VLM
95
36
0
10 Mar 2023
UniHCP: A Unified Model for Human-Centric Perceptions
UniHCP: A Unified Model for Human-Centric Perceptions
Yuanzheng Ci
Yizhou Wang
Meilin Chen
Shixiang Tang
Lei Bai
Feng Zhu
Rui Zhao
F. Yu
Donglian Qi
Wanli Ouyang
130
52
0
06 Mar 2023
Structure-Encoding Auxiliary Tasks for Improved Visual Representation in
  Vision-and-Language Navigation
Structure-Encoding Auxiliary Tasks for Improved Visual Representation in Vision-and-Language Navigation
Chia-Wen Kuo
Chih-Yao Ma
Judy Hoffman
Z. Kira
70
10
0
20 Nov 2022
VICRegL: Self-Supervised Learning of Local Visual Features
VICRegL: Self-Supervised Learning of Local Visual Features
Adrien Bardes
Jean Ponce
Yann LeCun
SSL
93
126
0
04 Oct 2022
Towards Unbiased Label Distribution Learning for Facial Pose Estimation
  Using Anisotropic Spherical Gaussian
Towards Unbiased Label Distribution Learning for Facial Pose Estimation Using Anisotropic Spherical Gaussian
Zhiwen Cao
Dongfang Liu
Qifan Wang
Victor Y. Chen
CVBM
33
17
0
19 Aug 2022
ViTPose: Simple Vision Transformer Baselines for Human Pose Estimation
ViTPose: Simple Vision Transformer Baselines for Human Pose Estimation
Yufei Xu
Jing Zhang
Qiming Zhang
Dacheng Tao
ViT
83
537
0
26 Apr 2022
MultiMAE: Multi-modal Multi-task Masked Autoencoders
MultiMAE: Multi-modal Multi-task Masked Autoencoders
Roman Bachmann
David Mizrahi
Andrei Atanov
Amir Zamir
131
277
0
04 Apr 2022
Versatile Multi-Modal Pre-Training for Human-Centric Perception
Versatile Multi-Modal Pre-Training for Human-Centric Perception
Fangzhou Hong
Liang Pan
Zhongang Cai
Ziwei Liu
VLM
69
24
0
25 Mar 2022
An End-to-End Transformer Model for Crowd Localization
An End-to-End Transformer Model for Crowd Localization
Dingkang Liang
Wei Xu
Xiang Bai
50
123
0
26 Feb 2022
Masked Autoencoders Are Scalable Vision Learners
Masked Autoencoders Are Scalable Vision Learners
Kaiming He
Xinlei Chen
Saining Xie
Yanghao Li
Piotr Dollár
Ross B. Girshick
ViTTPM
477
7,819
0
11 Nov 2021
Omnidata: A Scalable Pipeline for Making Multi-Task Mid-Level Vision
  Datasets from 3D Scans
Omnidata: A Scalable Pipeline for Making Multi-Task Mid-Level Vision Datasets from 3D Scans
Ainaz Eftekhar
Alexander Sax
Roman Bachmann
Jitendra Malik
Amir Zamir
MedIm
97
300
0
11 Oct 2021
Rethinking Counting and Localization in Crowds:A Purely Point-Based
  Framework
Rethinking Counting and Localization in Crowds:A Purely Point-Based Framework
Qingyu Song
Changan Wang
Zhengkai Jiang
Yabiao Wang
Ying Tai
Chengjie Wang
Jilin Li
Feiyue Huang
Yang Wu
74
266
0
27 Jul 2021
SimCC: a Simple Coordinate Classification Perspective for Human Pose
  Estimation
SimCC: a Simple Coordinate Classification Perspective for Human Pose Estimation
Yanjie Li
Sen Yang
Peidong Liu
Shoukui Zhang
Yunxiao Wang
Zhicheng Wang
Wankou Yang
Shutao Xia
3DH
66
130
0
07 Jul 2021
Emerging Properties in Self-Supervised Vision Transformers
Emerging Properties in Self-Supervised Vision Transformers
Mathilde Caron
Hugo Touvron
Ishan Misra
Hervé Jégou
Julien Mairal
Piotr Bojanowski
Armand Joulin
711
6,127
0
29 Apr 2021
Auxiliary Tasks and Exploration Enable ObjectNav
Auxiliary Tasks and Exploration Enable ObjectNav
Joel Ye
Dhruv Batra
Abhishek Das
Erik Wijmans
87
100
0
08 Apr 2021
Vision Transformers for Dense Prediction
Vision Transformers for Dense Prediction
René Ranftl
Alexey Bochkovskiy
V. Koltun
ViTMDE
138
1,746
0
24 Mar 2021
Cluster Contrast for Unsupervised Person Re-Identification
Cluster Contrast for Unsupervised Person Re-Identification
Zuozhuo Dai
Guangyuan Wang
Weihao Yuan
Xiaoli Liu
Siyu Zhu
P. Tan
63
215
0
22 Mar 2021
Generating Images with Sparse Representations
Generating Images with Sparse Representations
C. Nash
Jacob Menick
Sander Dieleman
Peter W. Battaglia
83
211
0
05 Mar 2021
Few-shot Conformal Prediction with Auxiliary Tasks
Few-shot Conformal Prediction with Auxiliary Tasks
Adam Fisch
Tal Schuster
Tommi Jaakkola
Regina Barzilay
367
56
0
17 Feb 2021
TransReID: Transformer-based Object Re-Identification
TransReID: Transformer-based Object Re-Identification
Shuting He
Haowen Luo
Pichao Wang
F. Wang
Hao Li
Wei Jiang
ViT
273
819
0
08 Feb 2021
DenserNet: Weakly Supervised Visual Localization Using Multi-scale
  Feature Aggregation
DenserNet: Weakly Supervised Visual Localization Using Multi-scale Feature Aggregation
Dongfang Liu
Yiming Cui
Liqi Yan
Christos Mousas
B. Yang
Yingjie Chen
147
126
0
04 Dec 2020
Propagate Yourself: Exploring Pixel-Level Consistency for Unsupervised
  Visual Representation Learning
Propagate Yourself: Exploring Pixel-Level Consistency for Unsupervised Visual Representation Learning
Zhenda Xie
Yutong Lin
Zheng Zhang
Yue Cao
Stephen Lin
Han Hu
SSL
114
415
0
19 Nov 2020
DCT-Mask: Discrete Cosine Transform Mask Representation for Instance
  Segmentation
DCT-Mask: Discrete Cosine Transform Mask Representation for Instance Segmentation
Xing Shen
Jirui Yang
Chunbo Wei
Bing Deng
Jianqiang Huang
Xiansheng Hua
Xiaoliang Cheng
K. Liang
ISeg
47
64
0
19 Nov 2020
Auxiliary Task Reweighting for Minimum-data Learning
Auxiliary Task Reweighting for Minimum-data Learning
Baifeng Shi
Judy Hoffman
Kate Saenko
Trevor Darrell
Huijuan Xu
MoMe
65
33
0
16 Oct 2020
Identity-Guided Human Semantic Parsing for Person Re-Identification
Identity-Guided Human Semantic Parsing for Person Re-Identification
Kuan Zhu
Haiyun Guo
Zhiwei Liu
Ming Tang
Jinqiao Wang
258
289
0
27 Jul 2020
Self-Supervised MultiModal Versatile Networks
Self-Supervised MultiModal Versatile Networks
Jean-Baptiste Alayrac
Adrià Recasens
R. Schneider
Relja Arandjelović
Jason Ramapuram
J. Fauw
Lucas Smaira
Sander Dieleman
Andrew Zisserman
SSL
144
375
0
29 Jun 2020
Bootstrap your own latent: A new approach to self-supervised Learning
Bootstrap your own latent: A new approach to self-supervised Learning
Jean-Bastien Grill
Florian Strub
Florent Altché
Corentin Tallec
Pierre Harvey Richemond
...
M. G. Azar
Bilal Piot
Koray Kavukcuoglu
Rémi Munos
Michal Valko
SSL
398
6,837
0
13 Jun 2020
FDA: Fourier Domain Adaptation for Semantic Segmentation
FDA: Fourier Domain Adaptation for Semantic Segmentation
Yanchao Yang
Stefano Soatto
OOD
89
898
0
11 Apr 2020
Adapting Object Detectors with Conditional Domain Normalization
Adapting Object Detectors with Conditional Domain Normalization
Peng Su
Kun Wang
Xingyu Zeng
Shixiang Tang
Dapeng Chen
Di Qiu
Xiaogang Wang
169
86
0
16 Mar 2020
Improved Baselines with Momentum Contrastive Learning
Improved Baselines with Momentum Contrastive Learning
Xinlei Chen
Haoqi Fan
Ross B. Girshick
Kaiming He
SSL
495
3,443
0
09 Mar 2020
Learning in the Frequency Domain
Learning in the Frequency Domain
Kai Xu
Minghai Qin
Fei Sun
Yuhao Wang
Yen-kuang Chen
Fengbo Ren
95
407
0
27 Feb 2020
A Simple Framework for Contrastive Learning of Visual Representations
A Simple Framework for Contrastive Learning of Visual Representations
Ting-Li Chen
Simon Kornblith
Mohammad Norouzi
Geoffrey E. Hinton
SSL
378
18,866
0
13 Feb 2020
Deep High-Resolution Representation Learning for Visual Recognition
Deep High-Resolution Representation Learning for Visual Recognition
Jingdong Wang
Ke Sun
Tianheng Cheng
Borui Jiang
Chaorui Deng
...
Yadong Mu
Mingkui Tan
Xinggang Wang
Wenyu Liu
Bin Xiao
393
3,627
0
20 Aug 2019
Contrastive Multiview Coding
Contrastive Multiview Coding
Yonglong Tian
Dilip Krishnan
Phillip Isola
SSL
174
2,409
0
13 Jun 2019
Instance-level Human Parsing via Part Grouping Network
Instance-level Human Parsing via Part Grouping Network
Ke Gong
Xiaodan Liang
Yicheng Li
Yimin Chen
Ming-Hsuan Yang
Liang Lin
3DH
88
334
0
01 Aug 2018
Auxiliary Tasks in Multi-task Learning
Auxiliary Tasks in Multi-task Learning
Lukas Liebel
Marco Körner
SSL
55
253
0
16 May 2018
DensePose: Dense Human Pose Estimation In The Wild
DensePose: Dense Human Pose Estimation In The Wild
R. Güler
Natalia Neverova
Iasonas Kokkinos
3DH
288
1,407
0
01 Feb 2018
Person Transfer GAN to Bridge Domain Gap for Person Re-Identification
Person Transfer GAN to Bridge Domain Gap for Person Re-Identification
Longhui Wei
Shiliang Zhang
Wen Gao
Q. Tian
GAN
104
1,670
0
23 Nov 2017
AI Challenger : A Large-scale Dataset for Going Deeper in Image
  Understanding
AI Challenger : A Large-scale Dataset for Going Deeper in Image Understanding
Jiahong Wu
He Zheng
Bo Zhao
Yixin Li
Baoming Yan
...
Shipei Zhou
G. Lin
Yanwei Fu
Yizhou Wang
Yonggang Wang
VLM
86
152
0
17 Nov 2017
Look into Person: Self-supervised Structure-sensitive Learning and A New
  Benchmark for Human Parsing
Look into Person: Self-supervised Structure-sensitive Learning and A New Benchmark for Human Parsing
Ke Gong
Xiaodan Liang
Dongyu Zhang
Xiaohui Shen
Liang Lin
SSL
53
477
0
16 Mar 2017
1