ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2111.09886
  4. Cited By
SimMIM: A Simple Framework for Masked Image Modeling

SimMIM: A Simple Framework for Masked Image Modeling

18 November 2021
Zhenda Xie
Zheng-Wei Zhang
Yue Cao
Yutong Lin
Jianmin Bao
Zhuliang Yao
Qi Dai
Han Hu
ArXivPDFHTML

Papers citing "SimMIM: A Simple Framework for Masked Image Modeling"

50 / 849 papers shown
Title
Multi-Level Decoupled Relational Distillation for Heterogeneous Architectures
Yaoxin Yang
Peng Ye
Weihao Lin
Kangcong Li
Yan Wen
Jia Hao
Tao Chen
38
0
0
10 Feb 2025
A Self-Supervised Framework for Improved Generalisability in Ultrasound B-mode Image Segmentation
A Self-Supervised Framework for Improved Generalisability in Ultrasound B-mode Image Segmentation
Edward Ellis
A. Bulpitt
Nasim Parsa
Michael F Byrne
Sharib Ali
93
0
0
04 Feb 2025
Particle Trajectory Representation Learning with Masked Point Modeling
Particle Trajectory Representation Learning with Masked Point Modeling
Sam Young
Yeon-jae Jwa
Kazuhiro Terao
3DPC
69
1
0
04 Feb 2025
Unified 3D MRI Representations via Sequence-Invariant Contrastive Learning
Unified 3D MRI Representations via Sequence-Invariant Contrastive Learning
Liam Chalcroft
Jenny Crinion
Cathy J. Price
John Ashburner
146
0
0
21 Jan 2025
Code and Pixels: Multi-Modal Contrastive Pre-training for Enhanced Tabular Data Analysis
Code and Pixels: Multi-Modal Contrastive Pre-training for Enhanced Tabular Data Analysis
Kankana Roy
Lars Krämer
Sebastian Domaschke
Malik Haris
Roland Aydin
Fabian Isensee
Martin Held
48
0
0
13 Jan 2025
PiLaMIM: Toward Richer Visual Representations by Integrating Pixel and Latent Masked Image Modeling
PiLaMIM: Toward Richer Visual Representations by Integrating Pixel and Latent Masked Image Modeling
Junmyeong Lee
Eui Jun Hwang
Sukmin Cho
Jong C. Park
37
0
0
06 Jan 2025
Keypoint Aware Masked Image Modelling
Keypoint Aware Masked Image Modelling
Madhava Krishna
Convin.AI
73
0
0
03 Jan 2025
Enhancing Visual Representation for Text-based Person Searching
Enhancing Visual Representation for Text-based Person Searching
Wei Shen
Ming Fang
Yuxia Wang
Jiafeng Xiao
Diping Li
H. Chen
Ling Xu
Wenbo Zhang
35
1
0
31 Dec 2024
The Dynamic Duo of Collaborative Masking and Target for Advanced Masked
  Autoencoder Learning
The Dynamic Duo of Collaborative Masking and Target for Advanced Masked Autoencoder Learning
Shentong Mo
39
0
0
23 Dec 2024
Read Like a Radiologist: Efficient Vision-Language Model for 3D Medical
  Imaging Interpretation
Read Like a Radiologist: Efficient Vision-Language Model for 3D Medical Imaging Interpretation
Changsun Lee
Sangjoon Park
Cheong-Il Shin
Woo Hee Choi
Hyun Jeong Park
J. Lee
Jong Chul Ye
69
0
0
18 Dec 2024
Measurement of Medial Elbow Joint Space using Landmark Detection
Measurement of Medial Elbow Joint Space using Landmark Detection
Shizuka Akahori
Shotaro Teruya
Pragyan Shrestha
Yuichi Yoshii
Ryuhei Michinobu
S. Iizuka
I. Kitahara
78
0
0
17 Dec 2024
USDRL: Unified Skeleton-Based Dense Representation Learning with
  Multi-Grained Feature Decorrelation
USDRL: Unified Skeleton-Based Dense Representation Learning with Multi-Grained Feature Decorrelation
Wanjiang Weng
Hongsong Wang
Junbo He
Lei He
Guosen Xie
91
2
0
12 Dec 2024
Beyond [cls]: Exploring the true potential of Masked Image Modeling representations
Beyond [cls]: Exploring the true potential of Masked Image Modeling representations
Marcin Przewiȩźlikowski
Randall Balestriero
Wojciech Jasiński
Marek 'Smieja
Bartosz Zieliñski
69
0
0
04 Dec 2024
Medical Multimodal Foundation Models in Clinical Diagnosis and
  Treatment: Applications, Challenges, and Future Directions
Medical Multimodal Foundation Models in Clinical Diagnosis and Treatment: Applications, Challenges, and Future Directions
Kai Sun
Siyan Xue
F. Sun
Haoran Sun
Yu-Juan Luo
...
Xinzhou Wang
Lei Yang
Shuo Jin
Jun Yan
Jiahong Dong
AI4CE
76
2
0
03 Dec 2024
Rethinking Generalizability and Discriminability of Self-Supervised
  Learning from Evolutionary Game Theory Perspective
Rethinking Generalizability and Discriminability of Self-Supervised Learning from Evolutionary Game Theory Perspective
Jiangmeng Li
Zehua Zang
Qirui Ji
Chuxiong Sun
Jingyao Wang
Junge Zhang
Changwen Zheng
Gang Hua
Hui Xiong
SSL
69
0
0
30 Nov 2024
Lift3D Foundation Policy: Lifting 2D Large-Scale Pretrained Models for
  Robust 3D Robotic Manipulation
Lift3D Foundation Policy: Lifting 2D Large-Scale Pretrained Models for Robust 3D Robotic Manipulation
Yueru Jia
Jiaming Liu
Sixiang Chen
Chenyang Gu
Z. Wang
...
Lily Lee
Pengwei Wang
Zhongyuan Wang
Renrui Zhang
Shanghang Zhang
89
11
0
27 Nov 2024
RS-vHeat: Heat Conduction Guided Efficient Remote Sensing Foundation Model
RS-vHeat: Heat Conduction Guided Efficient Remote Sensing Foundation Model
Huiyang Hu
Peijin Wang
Hanbo Bi
Boyuan Tong
Zehua Wang
...
Ziqi Zhang
QiXiang Ye
Kun Fu
Xian Sun
Xian Sun
100
0
0
27 Nov 2024
MRIFE: A Mask-Recovering and Interactive-Feature-Enhancing Semantic
  Segmentation Network For Relic Landslide Detection
MRIFE: A Mask-Recovering and Interactive-Feature-Enhancing Semantic Segmentation Network For Relic Landslide Detection
Juefei He
Yuexing Peng
Wei Li
Junchuan Yu
Daqing Ge
Wei Xiang
63
0
0
26 Nov 2024
GenDeg: Diffusion-based Degradation Synthesis for Generalizable All-In-One Image Restoration
GenDeg: Diffusion-based Degradation Synthesis for Generalizable All-In-One Image Restoration
Sudarshan Rajagopalan
Nithin Gopalakrishnan Nair
Jay N. Paranjape
Vishal M. Patel
DiffM
90
0
0
26 Nov 2024
Multi-Token Enhancing for Vision Representation Learning
Multi-Token Enhancing for Vision Representation Learning
Zhong-Yu Li
Yu-Song Hu
Bo Yin
Ming-Ming Cheng
66
1
0
24 Nov 2024
PR-MIM: Delving Deeper into Partial Reconstruction in Masked Image
  Modeling
PR-MIM: Delving Deeper into Partial Reconstruction in Masked Image Modeling
Zhong-Yu Li
Yunheng Li
Deng-Ping Fan
Ming-Ming Cheng
73
0
0
24 Nov 2024
Improving Factuality of 3D Brain MRI Report Generation with Paired
  Image-domain Retrieval and Text-domain Augmentation
Improving Factuality of 3D Brain MRI Report Generation with Paired Image-domain Retrieval and Text-domain Augmentation
J. Lee
Y. Oh
Dahyoun Lee
Hyon Keun Joh
Chul-Ho Sohn
...
Cheol Kyu Jung
Jung Hyun Park
Kyu Sung Choi
Byung-Hoon Kim
Jong Chul Ye
DiffM
MedIm
75
0
0
23 Nov 2024
Relational Contrastive Learning and Masked Image Modeling for Scene Text Recognition
T. Lin
Jinglei Zhang
Yi Xu
Kai Chen
Rui Zhang
Cheng Chen
38
0
0
18 Nov 2024
From Prototypes to General Distributions: An Efficient Curriculum for
  Masked Image Modeling
From Prototypes to General Distributions: An Efficient Curriculum for Masked Image Modeling
Jinhong Lin
Cheng-En Wu
Huanran Li
Jifan Zhang
Yu Hen Hu
Pedro Morgado
41
0
0
16 Nov 2024
Pattern Integration and Enhancement Vision Transformer for
  Self-Supervised Learning in Remote Sensing
Pattern Integration and Enhancement Vision Transformer for Self-Supervised Learning in Remote Sensing
Kaixuan Lu
Ruiqian Zhang
Xiao Huang
Yuxing Xie
Xiaogang Ning
Hanchao Zhang
Mengke Yuan
Pan Zhang
Tao Wang
Tongkui Liao
37
0
0
09 Nov 2024
Classification Done Right for Vision-Language Pre-Training
Classification Done Right for Vision-Language Pre-Training
Zilong Huang
Qinghao Ye
Bingyi Kang
Jiashi Feng
Haoqi Fan
CLIP
VLM
50
2
0
05 Nov 2024
IO Transformer: Evaluating SwinV2-Based Reward Models for Computer
  Vision
IO Transformer: Evaluating SwinV2-Based Reward Models for Computer Vision
Maxwell Meyer
Jack Spruyt
ViT
23
0
0
31 Oct 2024
Sparsh: Self-supervised touch representations for vision-based tactile
  sensing
Sparsh: Self-supervised touch representations for vision-based tactile sensing
Carolina Higuera
Akash Sharma
Chaithanya Krishna Bodduluri
Taosha Fan
Patrick E. Lancaster
...
Michael Kaess
Byron Boots
Mike Lambeta
Tingfan Wu
Mustafa Mukadam
42
12
0
31 Oct 2024
Connecting Joint-Embedding Predictive Architecture with Contrastive
  Self-supervised Learning
Connecting Joint-Embedding Predictive Architecture with Contrastive Self-supervised Learning
Shentong Mo
Shengbang Tong
40
1
0
25 Oct 2024
Learning Versatile Skills with Curriculum Masking
Learning Versatile Skills with Curriculum Masking
Yao Tang
Zhihui Xie
Zichuan Lin
Deheng Ye
Shuai Li
OffRL
33
0
0
23 Oct 2024
A Survey on All-in-One Image Restoration: Taxonomy, Evaluation and
  Future Trends
A Survey on All-in-One Image Restoration: Taxonomy, Evaluation and Future Trends
Junjun Jiang
Zengyuan Zuo
Gang Wu
Kui Jiang
Xianming Liu
48
10
0
19 Oct 2024
Adaptive Diffusion Terrain Generator for Autonomous Uneven Terrain
  Navigation
Adaptive Diffusion Terrain Generator for Autonomous Uneven Terrain Navigation
Youwei Yu
Junhong Xu
Lantao Liu
34
3
0
14 Oct 2024
Multi-modal Vision Pre-training for Medical Image Analysis
Multi-modal Vision Pre-training for Medical Image Analysis
Shaohao Rui
Lingzhi Chen
Zhenyu Tang
Lilong Wang
M. Liu
S. Zhang
Xiaosong Wang
32
0
0
14 Oct 2024
Block-to-Scene Pre-training for Point Cloud Hybrid-Domain Masked
  Autoencoders
Block-to-Scene Pre-training for Point Cloud Hybrid-Domain Masked Autoencoders
Yaohua Zha
Tao Dai
Yanzi Wang
Hang Guo
Taolin Zhang
Zhihao Ouyang
Chunlin Fan
Bin Chen
Ke Chen
Shu-Tao Xia
3DPC
30
1
0
13 Oct 2024
Calibrated Cache Model for Few-Shot Vision-Language Model Adaptation
Calibrated Cache Model for Few-Shot Vision-Language Model Adaptation
Kun Ding
Qiang Yu
Haojian Zhang
Gaofeng Meng
Shiming Xiang
VLM
30
0
0
11 Oct 2024
On a Hidden Property in Computational Imaging
On a Hidden Property in Computational Imaging
Yinan Feng
Yinpeng Chen
Yueh Lee
Youzuo Lin
28
0
0
11 Oct 2024
C^2DA: Contrastive and Context-aware Domain Adaptive Semantic
  Segmentation
C^2DA: Contrastive and Context-aware Domain Adaptive Semantic Segmentation
Md. Al-Masrur Khan
Zheng Chen
Lantao Liu
25
0
0
10 Oct 2024
Progressive Multi-Modal Fusion for Robust 3D Object Detection
Progressive Multi-Modal Fusion for Robust 3D Object Detection
Rohit Mohan
Daniele Cattaneo
Florian Drews
Abhinav Valada
3DPC
43
3
0
09 Oct 2024
Self-Supervised Learning for Real-World Object Detection: a Survey
Self-Supervised Learning for Real-World Object Detection: a Survey
Alina Ciocarlan
Sidonie Lefebvre
S. L. Hégarat-Mascle
Arnaud Woiselle
ObjD
36
0
0
09 Oct 2024
Denoising with a Joint-Embedding Predictive Architecture
Denoising with a Joint-Embedding Predictive Architecture
Dengsheng Chen
Jie Hu
Xiaoming Wei
Enhua Wu
DiffM
52
2
0
02 Oct 2024
Domain Aware Multi-Task Pretraining of 3D Swin Transformer for
  T1-weighted Brain MRI
Domain Aware Multi-Task Pretraining of 3D Swin Transformer for T1-weighted Brain MRI
Jonghun Kim
Mansu Kim
Hyunjin Park
MedIm
ViT
23
0
0
01 Oct 2024
Self-supervised Auxiliary Learning for Texture and Model-based Hybrid
  Robust and Fair Featuring in Face Analysis
Self-supervised Auxiliary Learning for Texture and Model-based Hybrid Robust and Fair Featuring in Face Analysis
Shukesh Reddy
Nishit Poddar
Srijan Das
Abhijit Das
CVBM
30
0
0
29 Sep 2024
Restore Anything with Masks: Leveraging Mask Image Modeling for Blind
  All-in-One Image Restoration
Restore Anything with Masks: Leveraging Mask Image Modeling for Blind All-in-One Image Restoration
Chu-Jie Qin
Rui-Qi Wu
Zikun Liu
Xin Lin
Chun-Le Guo
Hyun Hee Park
Chongyi Li
23
6
0
28 Sep 2024
UniEmoX: Cross-modal Semantic-Guided Large-Scale Pretraining for
  Universal Scene Emotion Perception
UniEmoX: Cross-modal Semantic-Guided Large-Scale Pretraining for Universal Scene Emotion Perception
Chuang Chen
Xingchen Sun
Zhi Liu
33
0
0
27 Sep 2024
Face Forgery Detection with Elaborate Backbone
Face Forgery Detection with Elaborate Backbone
Zonghui Guo
Y. Liu
Jie Zhang
Haiyong Zheng
Shiguang Shan
AAML
CVBM
28
1
0
25 Sep 2024
Leveraging Text Localization for Scene Text Removal via Text-aware
  Masked Image Modeling
Leveraging Text Localization for Scene Text Removal via Text-aware Masked Image Modeling
Zixiao Wang
Hongtao Xie
Yuxin Wang
Yadong Qu
Fengjun Guo
Pengwei Liu
DiffM
33
0
0
20 Sep 2024
RingMo-Aerial: An Aerial Remote Sensing Foundation Model With A Affine Transformation Contrastive Learning
RingMo-Aerial: An Aerial Remote Sensing Foundation Model With A Affine Transformation Contrastive Learning
Wenhui Diao
Haichen Yu
Kaiyue Kang
Tong Ling
Di Liu
...
Hanbo Bi
Libo Ren
Xuexue Li
Yongqiang Mao
Xian Sun
34
1
0
20 Sep 2024
Is Tokenization Needed for Masked Particle Modelling?
Is Tokenization Needed for Masked Particle Modelling?
Matthew Leigh
Samuel Klein
François Charton
Tobias Golling
Lukas Heinrich
Michael Kagan
Ines Ochoa
Margarita Osadchy
37
7
0
19 Sep 2024
Multi-OCT-SelfNet: Integrating Self-Supervised Learning with
  Multi-Source Data Fusion for Enhanced Multi-Class Retinal Disease
  Classification
Multi-OCT-SelfNet: Integrating Self-Supervised Learning with Multi-Source Data Fusion for Enhanced Multi-Class Retinal Disease Classification
Fatema Jannat
Sina Gholami
Jennifer I. Lim
Theodore Leng
Minhaj Nur Alam
Hamed Tabkhi
30
0
0
17 Sep 2024
Sparks of Artificial General Intelligence(AGI) in Semiconductor Material
  Science: Early Explorations into the Next Frontier of Generative AI-Assisted
  Electron Micrograph Analysis
Sparks of Artificial General Intelligence(AGI) in Semiconductor Material Science: Early Explorations into the Next Frontier of Generative AI-Assisted Electron Micrograph Analysis
Sakhinana Sagar Srinivas
Geethan Sannidhi
Sreeja Gangasani
Chidaksh Ravuru
Venkataramana Runkana
33
0
0
17 Sep 2024
Previous
12345...151617
Next