ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2204.01254
  4. Cited By
BatchFormerV2: Exploring Sample Relationships for Dense Representation
  Learning

BatchFormerV2: Exploring Sample Relationships for Dense Representation Learning

4 April 2022
Zhi Hou
Baosheng Yu
Chaoyue Wang
Yibing Zhan
Dacheng Tao
    ViT
ArXivPDFHTML

Papers citing "BatchFormerV2: Exploring Sample Relationships for Dense Representation Learning"

9 / 9 papers shown
Title
Dual Relation Mining Network for Zero-Shot Learning
Dual Relation Mining Network for Zero-Shot Learning
Jinwei Han
Yingguo Gao
Zhiwen Lin
Ke Yan
Shouhong Ding
Yuan Gao
Gui-Song Xia
34
0
0
06 May 2024
LatentDR: Improving Model Generalization Through Sample-Aware Latent
  Degradation and Restoration
LatentDR: Improving Model Generalization Through Sample-Aware Latent Degradation and Restoration
Ran Liu
Sahil Khose
Jingyun Xiao
Lakshmi Sathidevi
Keerthan Ramnath
Z. Kira
Eva L. Dyer
34
3
0
28 Aug 2023
Rethinking Batch Sample Relationships for Data Representation: A
  Batch-Graph Transformer based Approach
Rethinking Batch Sample Relationships for Data Representation: A Batch-Graph Transformer based Approach
Xixi Wang
Bowei Jiang
Tianlin Li
Bin Luo
ViT
28
5
0
19 Nov 2022
Masked Autoencoders Are Scalable Vision Learners
Masked Autoencoders Are Scalable Vision Learners
Kaiming He
Xinlei Chen
Saining Xie
Yanghao Li
Piotr Dollár
Ross B. Girshick
ViT
TPM
305
7,443
0
11 Nov 2021
Emerging Properties in Self-Supervised Vision Transformers
Emerging Properties in Self-Supervised Vision Transformers
Mathilde Caron
Hugo Touvron
Ishan Misra
Hervé Jégou
Julien Mairal
Piotr Bojanowski
Armand Joulin
317
5,785
0
29 Apr 2021
Pyramid Vision Transformer: A Versatile Backbone for Dense Prediction
  without Convolutions
Pyramid Vision Transformer: A Versatile Backbone for Dense Prediction without Convolutions
Wenhai Wang
Enze Xie
Xiang Li
Deng-Ping Fan
Kaitao Song
Ding Liang
Tong Lu
Ping Luo
Ling Shao
ViT
277
3,623
0
24 Feb 2021
Zero-Shot Text-to-Image Generation
Zero-Shot Text-to-Image Generation
Aditya A. Ramesh
Mikhail Pavlov
Gabriel Goh
Scott Gray
Chelsea Voss
Alec Radford
Mark Chen
Ilya Sutskever
VLM
255
4,781
0
24 Feb 2021
Is Space-Time Attention All You Need for Video Understanding?
Is Space-Time Attention All You Need for Video Understanding?
Gedas Bertasius
Heng Wang
Lorenzo Torresani
ViT
280
1,982
0
09 Feb 2021
FreiHAND: A Dataset for Markerless Capture of Hand Pose and Shape from
  Single RGB Images
FreiHAND: A Dataset for Markerless Capture of Hand Pose and Shape from Single RGB Images
Christiane Zimmermann
Duygu Ceylan
Jimei Yang
Bryan C. Russell
Max Argus
Thomas Brox
3DH
189
398
0
10 Sep 2019
1