ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2201.03545
  4. Cited By
A ConvNet for the 2020s

A ConvNet for the 2020s

10 January 2022
Zhuang Liu
Hanzi Mao
Chaozheng Wu
Christoph Feichtenhofer
Trevor Darrell
Saining Xie
    ViT
ArXivPDFHTML

Papers citing "A ConvNet for the 2020s"

50 / 188 papers shown
Title
Kernel Space Diffusion Model for Efficient Remote Sensing Pansharpening
Kernel Space Diffusion Model for Efficient Remote Sensing Pansharpening
Hancong Jin
Zihan Cao
Liangjian Deng
DiffM
160
0
0
25 May 2025
EVM-Fusion: An Explainable Vision Mamba Architecture with Neural Algorithmic Fusion
EVM-Fusion: An Explainable Vision Mamba Architecture with Neural Algorithmic Fusion
Zichuan Yang
208
0
0
23 May 2025
Locality-Sensitive Hashing for Efficient Hard Negative Sampling in Contrastive Learning
Locality-Sensitive Hashing for Efficient Hard Negative Sampling in Contrastive Learning
Fabian Deuser
Philipp Hausenblas
Hannah Schieber
Daniel Roth
Martin Werner
Norbert Oswald
157
0
0
23 May 2025
Rethinking Features-Fused-Pyramid-Neck for Object Detection
Rethinking Features-Fused-Pyramid-Neck for Object Detection
Hulin Li
176
0
0
19 May 2025
Deep Learning-Based Robust Optical Guidance for Hypersonic Platforms
Deep Learning-Based Robust Optical Guidance for Hypersonic Platforms
Adrien Chan-Hon-Tong
A. Plyer
Baptiste Cadalen
Laurent Serre
55
0
0
09 May 2025
Lightweight RGB-D Salient Object Detection from a Speed-Accuracy Tradeoff Perspective
Lightweight RGB-D Salient Object Detection from a Speed-Accuracy Tradeoff Perspective
Songsong Duan
Xi Yang
Nannan Wang
Xinbo Gao
104
0
0
07 May 2025
SwinLip: An Efficient Visual Speech Encoder for Lip Reading Using Swin Transformer
SwinLip: An Efficient Visual Speech Encoder for Lip Reading Using Swin Transformer
Young-Hu Park
R.-H. Park
Hyung-Min Park
93
0
0
07 May 2025
Breaking Annotation Barriers: Generalized Video Quality Assessment via Ranking-based Self-Supervision
Breaking Annotation Barriers: Generalized Video Quality Assessment via Ranking-based Self-Supervision
Linhan Cao
Wei Sun
Kaiwei Zhang
Yicong Peng
Guangtao Zhai
Xiongkuo Min
98
0
0
06 May 2025
Task-Oriented Semantic Communication in Large Multimodal Models-based Vehicle Networks
Task-Oriented Semantic Communication in Large Multimodal Models-based Vehicle Networks
Baoxia Du
H. Du
Dusit Niyato
Ruidong Li
102
0
0
05 May 2025
Handling Label Noise via Instance-Level Difficulty Modeling and Dynamic Optimization
Handling Label Noise via Instance-Level Difficulty Modeling and Dynamic Optimization
Kuan Zhang
Chengliang Chai
Jingzhe Xu
Chi Zhang
Ye Yuan
Guoren Wang
Lei Cao
NoLa
123
1
0
01 May 2025
Gradient Attention Map Based Verification of Deep Convolutional Neural Networks with Application to X-ray Image Datasets
Gradient Attention Map Based Verification of Deep Convolutional Neural Networks with Application to X-ray Image Datasets
Omid Halimi Milani
Amanda Nikho
Lauren Mills
Marouane Tliba
Ahmet Enis Cetin
Mohammed H. Elnagar
MedIm
97
0
0
29 Apr 2025
HRScene: How Far Are VLMs from Effective High-Resolution Image Understanding?
HRScene: How Far Are VLMs from Effective High-Resolution Image Understanding?
Yusen Zhang
Wenliang Zheng
Aashrith Madasu
Peng Shi
Ryo Kamoi
...
Ranran Haoran Zhang
Avitej Iyer
Renze Lou
Wenpeng Yin
Rui Zhang
233
0
0
25 Apr 2025
Perception Encoder: The best visual embeddings are not at the output of the network
Perception Encoder: The best visual embeddings are not at the output of the network
Daniel Bolya
Po-Yao (Bernie) Huang
Peize Sun
Jang Hyun Cho
Andrea Madotto
...
Shiyu Dong
Nikhila Ravi
Daniel Li
Piotr Dollár
Christoph Feichtenhofer
ObjD
VOS
223
6
0
17 Apr 2025
Deep Generative Model-Based Generation of Synthetic Individual-Specific Brain MRI Segmentations
Deep Generative Model-Based Generation of Synthetic Individual-Specific Brain MRI Segmentations
Ruijie Wang
Luca Rossetto
Susan Mérillat
Christina Röcke
Mike Martin
Abraham Bernstein
DiffM
MedIm
127
0
0
15 Apr 2025
D-Feat Occlusions: Diffusion Features for Robustness to Partial Visual Occlusions in Object Recognition
D-Feat Occlusions: Diffusion Features for Robustness to Partial Visual Occlusions in Object Recognition
Rupayan Mallick
Sibo Dong
Nataniel Ruiz
Sarah Adel Bargal
DiffM
172
0
0
08 Apr 2025
HGFormer: Topology-Aware Vision Transformer with HyperGraph Learning
HGFormer: Topology-Aware Vision Transformer with HyperGraph Learning
Hao Wang
Shuo Zhang
Biao Leng
ViT
217
1
0
03 Apr 2025
Spingarn's Method and Progressive Decoupling Beyond Elicitable Monotonicity
Spingarn's Method and Progressive Decoupling Beyond Elicitable Monotonicity
B. Evens
P. Latafat
Panagiotis Patrinos
174
1
0
01 Apr 2025
SupertonicTTS: Towards Highly Scalable and Efficient Text-to-Speech System
SupertonicTTS: Towards Highly Scalable and Efficient Text-to-Speech System
Hyeongju Kim
Jinhyeok Yang
Yechan Yu
Seunghun Ji
Jacob Morton
Frederik Bous
Joon Byun
Juheon Lee
113
0
0
29 Mar 2025
vGamba: Attentive State Space Bottleneck for efficient Long-range Dependencies in Visual Recognition
vGamba: Attentive State Space Bottleneck for efficient Long-range Dependencies in Visual Recognition
Yunusa Haruna
A. Lawan
Mamba
103
0
0
27 Mar 2025
Harmonizing Visual Representations for Unified Multimodal Understanding and Generation
Harmonizing Visual Representations for Unified Multimodal Understanding and Generation
Size Wu
Wentao Zhang
Lumin Xu
Sheng Jin
Zhonghua Wu
Qingyi Tao
Wentao Liu
Wei Li
Chen Change Loy
VGen
369
5
0
27 Mar 2025
Poly-MgNet: Polynomial Building Blocks in Multigrid-Inspired ResNets
Antonia van Betteray
Matthias Rottmann
Karsten Kahl
142
1
0
13 Mar 2025
Data-driven tool wear prediction in milling, based on a process-integrated single-sensor approach
Data-driven tool wear prediction in milling, based on a process-integrated single-sensor approach
Eric Hirsch
Christian Friedrich
105
0
0
13 Mar 2025
Referring to Any Person
Referring to Any Person
Qing Jiang
Lin Wu
Zhaoyang Zeng
Tianhe Ren
Yuda Xiong
Yihao Chen
Qin Liu
Lei Zhang
390
0
0
11 Mar 2025
Are We Truly Forgetting? A Critical Re-examination of Machine Unlearning Evaluation Protocols
Are We Truly Forgetting? A Critical Re-examination of Machine Unlearning Evaluation Protocols
Yongwoo Kim
Sungmin Cha
Donghyun Kim
MU
ELM
60
2
0
10 Mar 2025
CLIMB: Data Foundations for Large Scale Multimodal Clinical Foundation Models
CLIMB: Data Foundations for Large Scale Multimodal Clinical Foundation Models
Wei Dai
Peilin Chen
Malinda Lu
Daniel Li
Haowen Wei
Hejie Cui
Paul Pu Liang
LM&MA
118
3
0
09 Mar 2025
USP: Unified Self-Supervised Pretraining for Image Generation and Understanding
Xiangxiang Chu
Renda Li
Yong Wang
176
0
0
08 Mar 2025
ScaleFusionNet: Transformer-Guided Multi-Scale Feature Fusion for Skin Lesion Segmentation
ScaleFusionNet: Transformer-Guided Multi-Scale Feature Fusion for Skin Lesion Segmentation
Saqib Qamar
Syed Furqan Qadri
Roobaea Alroobaea
Majed Alsafyani
Abdullah M. Baqasah
ViT
MedIm
135
0
0
05 Mar 2025
Is Pre-training Applicable to the Decoder for Dense Prediction?
Is Pre-training Applicable to the Decoder for Dense Prediction?
Chao Ning
Wanshui Gan
Weihao Xuan
Naoto Yokoya
199
0
0
05 Mar 2025
CADRef: Robust Out-of-Distribution Detection via Class-Aware Decoupled Relative Feature Leveraging
Zhiwei Ling
Yachen Chang
Hailiang Zhao
Xinkui Zhao
Kingsum Chow
Shuiguang Deng
OODD
117
0
0
01 Mar 2025
OverLoCK: An Overview-first-Look-Closely-next ConvNet with Context-Mixing Dynamic Kernels
OverLoCK: An Overview-first-Look-Closely-next ConvNet with Context-Mixing Dynamic Kernels
Meng Lou
Yizhou Yu
239
1
0
27 Feb 2025
Vision-LSTM: xLSTM as Generic Vision Backbone
Vision-LSTM: xLSTM as Generic Vision Backbone
Benedikt Alkin
M. Beck
Korbinian Poppel
Sepp Hochreiter
Johannes Brandstetter
VLM
158
46
0
24 Feb 2025
YOLO-MS: Rethinking Multi-Scale Representation Learning for Real-time Object Detection
YOLO-MS: Rethinking Multi-Scale Representation Learning for Real-time Object Detection
Yuming Chen
Xinbin Yuan
Ruiqi Wu
Jiabao Wang
Qibin Hou
Mingg-Ming Cheng
Ming-Ming Cheng
ObjD
243
51
0
21 Feb 2025
Simpler Fast Vision Transformers with a Jumbo CLS Token
Simpler Fast Vision Transformers with a Jumbo CLS Token
A. Fuller
Yousef Yassin
Daniel G. Kyrollos
Evan Shelhamer
James R. Green
141
0
0
20 Feb 2025
CoRPA: Adversarial Image Generation for Chest X-rays Using Concept Vector Perturbations and Generative Models
CoRPA: Adversarial Image Generation for Chest X-rays Using Concept Vector Perturbations and Generative Models
Amy Rafferty
Rishi Ramaesh
Ajitha Rajan
MedIm
AAML
99
0
0
04 Feb 2025
iFormer: Integrating ConvNet and Transformer for Mobile Application
iFormer: Integrating ConvNet and Transformer for Mobile Application
Chuanyang Zheng
ViT
118
0
0
26 Jan 2025
Elucidating the Design Space of Dataset Condensation
Elucidating the Design Space of Dataset Condensation
Shitong Shao
Zikai Zhou
Huanran Chen
Zhiqiang Shen
DD
106
9
0
20 Jan 2025
AgRegNet: A Deep Regression Network for Flower and Fruit Density Estimation, Localization, and Counting in Orchards
AgRegNet: A Deep Regression Network for Flower and Fruit Density Estimation, Localization, and Counting in Orchards
Uddhav Bhattarai
Santosh Bhusal
Qin Zhang
Manoj Karkee
135
3
0
20 Jan 2025
MedFILIP: Medical Fine-grained Language-Image Pre-training
MedFILIP: Medical Fine-grained Language-Image Pre-training
Xinjie Liang
Xiangyu Li
Fanding Li
Jie Jiang
Qing Dong
Wei Wang
Kaidi Wang
Suyu Dong
Gongning Luo
Shuo Li
LM&MA
VLM
MedIm
122
4
0
18 Jan 2025
EmoNeXt: an Adapted ConvNeXt for Facial Emotion Recognition
EmoNeXt: an Adapted ConvNeXt for Facial Emotion Recognition
Yassine El Boudouri
Amine Bohi
126
17
0
14 Jan 2025
Learning Motion and Temporal Cues for Unsupervised Video Object Segmentation
Learning Motion and Temporal Cues for Unsupervised Video Object Segmentation
Yunzhi Zhuge
Hongyu Gu
Lu Zhang
Jinqing Qi
Huchuan Lu
VOS
129
3
0
14 Jan 2025
SPAM: Spike-Aware Adam with Momentum Reset for Stable LLM Training
SPAM: Spike-Aware Adam with Momentum Reset for Stable LLM Training
Tianjin Huang
Ziquan Zhu
Gaojie Jin
Lu Liu
Zhangyang Wang
Shiwei Liu
84
4
0
12 Jan 2025
Discovering an Image-Adaptive Coordinate System for Photography Processing
Discovering an Image-Adaptive Coordinate System for Photography Processing
Ziteng Cui
Lin Gu
Tatsuya Harada
90
2
0
11 Jan 2025
A Separable Self-attention Inspired by the State Space Model for Computer Vision
A Separable Self-attention Inspired by the State Space Model for Computer Vision
Juntao Zhang
Shaogeng Liu
Kun Bian
You Zhou
Pei Zhang
Jianning Liu
Jun Zhou
Bingyan Liu
Mamba
96
0
0
03 Jan 2025
Multi-Head Explainer: A General Framework to Improve Explainability in CNNs and Transformers
Multi-Head Explainer: A General Framework to Improve Explainability in CNNs and Transformers
Bohang Sun
Pietro Liò
ViT
AAML
114
1
0
02 Jan 2025
VMamba: Visual State Space Model
VMamba: Visual State Space Model
Yue Liu
Yunjie Tian
Yuzhong Zhao
Hongtian Yu
Lingxi Xie
Yaowei Wang
Qixiang Ye
Jianbin Jiao
Yunfan Liu
Mamba
239
675
0
31 Dec 2024
SM3Det: A Unified Model for Multi-Modal Remote Sensing Object Detection
SM3Det: A Unified Model for Multi-Modal Remote Sensing Object Detection
Yuchen Li
Xianrui Li
Yunheng Li
Yanzhe Zhang
Yimian Dai
Qibin Hou
Ming-Ming Cheng
Jian Yang
164
7
0
31 Dec 2024
RobustBlack: Challenging Black-Box Adversarial Attacks on State-of-the-Art Defenses
RobustBlack: Challenging Black-Box Adversarial Attacks on State-of-the-Art Defenses
Mohamed Djilani
Salah Ghamizi
Maxime Cordy
103
0
0
31 Dec 2024
GG-SSMs: Graph-Generating State Space Models
GG-SSMs: Graph-Generating State Space Models
Nikola Zubić
Davide Scaramuzza
Mamba
154
1
0
17 Dec 2024
SegMAN: Omni-scale Context Modeling with State Space Models and Local Attention for Semantic Segmentation
SegMAN: Omni-scale Context Modeling with State Space Models and Local Attention for Semantic Segmentation
Yunxiang Fu
Meng Lou
Yizhou Yu
258
1
0
16 Dec 2024
CLIP-PING: Boosting Lightweight Vision-Language Models with Proximus Intrinsic Neighbors Guidance
CLIP-PING: Boosting Lightweight Vision-Language Models with Proximus Intrinsic Neighbors Guidance
Chu Myaet Thwal
Ye Lin Tun
Minh N. H. Nguyen
Eui-nam Huh
Choong Seon Hong
VLM
109
0
0
05 Dec 2024
1234
Next