Communities
Connect sessions
AI calendar
Organizations
Contact Sales
Search
Open menu
Home
Papers
1611.05431
Cited By
v1
v2 (latest)
Aggregated Residual Transformations for Deep Neural Networks
16 November 2016
Saining Xie
Ross B. Girshick
Piotr Dollár
Zhuowen Tu
Kaiming He
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"Aggregated Residual Transformations for Deep Neural Networks"
50 / 3,792 papers shown
Title
Purifying, Labeling, and Utilizing: A High-Quality Pipeline for Small Object Detection
Siwei Wang
Zhiwei Chen
Liujuan Cao
Rongrong Ji
ObjD
157
0
0
29 Apr 2025
Dual Attention Driven Lumbar Magnetic Resonance Image Feature Enhancement and Automatic Diagnosis of Herniation
Lingrui Zhang
Liang Guo
Xiao An
Feng Lin
Binlong Zheng
Jun Wang
Zhiyu Li
94
0
0
28 Apr 2025
Examining the Impact of Optical Aberrations to Image Classification and Object Detection Models
Patrick Müller
Alexander Braun
Margret Keuper
148
0
0
25 Apr 2025
DMS-Net:Dual-Modal Multi-Scale Siamese Network for Binocular Fundus Image Classification
Guohao Huo
Zibo Lin
Zitong Wang
Ruiting Dai
Hao Tang
96
0
0
25 Apr 2025
DIVE: Inverting Conditional Diffusion Models for Discriminative Tasks
Yinqi Li
Hong Chang
Ruibing Hou
Shiguang Shan
Xilin Chen
DiffM
173
0
0
24 Apr 2025
A multi-scale vision transformer-based multimodal GeoAI model for mapping Arctic permafrost thaw
Wenwen Li
Chia-Yu Hsu
Sizhe Wang
Zhining Gu
Yili Yang
Brendan M. Rogers
A. Liljedahl
150
3
0
23 Apr 2025
Seeking Flat Minima over Diverse Surrogates for Improved Adversarial Transferability: A Theoretical Framework and Algorithmic Instantiation
Meixi Zheng
Kehan Wu
Yanbo Fan
Rui Huang
Baoyuan Wu
AAML
115
0
0
23 Apr 2025
An Automated Pipeline for Few-Shot Bird Call Classification: A Case Study with the Tooth-Billed Pigeon
Abhishek Jana
Moeumu Uili
James Atherton
Mark O'Brien
Joe Wood
Leandra Brickson
323
0
0
22 Apr 2025
Integrating Non-Linear Radon Transformation for Diabetic Retinopathy Grading
Farida Mohsen
S. Belhaouari
Zubair Shah
MedIm
81
0
0
22 Apr 2025
An XAI-based Analysis of Shortcut Learning in Neural Networks
Phuong Quynh Le
Jorg Schlotterer
Christin Seifert
AAML
169
0
0
22 Apr 2025
ECViT: Efficient Convolutional Vision Transformer with Local-Attention and Multi-scale Stages
Zhoujie Qian
ViT
127
1
0
21 Apr 2025
Learning Compatible Multi-Prize Subnetworks for Asymmetric Retrieval
Yushuai Sun
Zikun Zhou
Shihong Deng
Yaowei Wang
Jun Yu
Guangming Lu
Wenjie Pei
101
0
0
16 Apr 2025
Fine-Grained Rib Fracture Diagnosis with Hyperbolic Embeddings: A Detailed Annotation Framework and Multi-Label Classification Model
International Conference on Medical Image Computing and Computer-Assisted Intervention (MICCAI), 2025
Shripad Pate
Aiman Farooq
Suvrankar Datta
Musadiq Aadil Sheikh
Atin Kumar
Deepak Mishra
96
0
0
15 Apr 2025
COUNTS: Benchmarking Object Detectors and Multimodal Large Language Models under Distribution Shifts
Jiansheng Li
Xingxuan Zhang
Hao Zou
Yige Guo
Renzhe Xu
Yilong Liu
Chuzhao Zhu
Yue He
Peng Cui
VLM
141
0
0
14 Apr 2025
GFT: Gradient Focal Transformer
Boris Kriuk
Simranjit Kaur Gill
Shoaib Aslam
Amir Fakhrutdinov
124
0
0
14 Apr 2025
BabyVLM: Data-Efficient Pretraining of VLMs Inspired by Infant Learning
Shengao Wang
Arjun Chandra
Aoming Liu
Venkatesh Saligrama
Boqing Gong
MLLM
VLM
128
0
0
13 Apr 2025
Explorer: Robust Collection of Interactable GUI Elements
Iason Chaimalas
Arnas Vyšniauskas
Gabriel Brostow
111
0
0
12 Apr 2025
Exploring Synergistic Ensemble Learning: Uniting CNNs, MLP-Mixers, and Vision Transformers to Enhance Image Classification
Mk Bashar
Ocean Monjur
Samia Islam
Mohammad Galib Shams
Niamul Quader
UQCV
133
0
0
12 Apr 2025
S-EO: A Large-Scale Dataset for Geometry-Aware Shadow Detection in Remote Sensing Applications
Masquil Elías
Marí Roger
Ehret Thibaud
Meinhardt-Llopis Enric
Musé Pablo
Facciolo Gabriele
MDE
182
1
0
09 Apr 2025
Don't Lag, RAG: Training-Free Adversarial Detection Using RAG
Roie Kazoom
Raz Lapid
Moshe Sipper
Ofer Hadar
VLM
ObjD
AAML
237
4
0
07 Apr 2025
Edge Approximation Text Detector
Chuang Yang
Xu Han
T. Han
Han Han
Bingxuan Zhao
Qi Wang
186
4
0
05 Apr 2025
Mathematical Modeling of Option Pricing with an Extended Black-Scholes Framework
Nikhil Shivakumar Nayak
253
0
0
04 Apr 2025
ZFusion: An Effective Fuser of Camera and 4D Radar for 3D Object Perception in Autonomous Driving
Sheng Yang
Tong Zhan
Shichen Qiao
Jicheng Gong
Qing Yang
Jian Wang
Yanfeng Lu
3DPC
175
0
0
04 Apr 2025
Image Coding for Machines via Feature-Preserving Rate-Distortion Optimization
Samuel Fernández-Menduiña
Eduardo Pavez
Antonio Ortega
143
0
0
03 Apr 2025
Multivariate Temporal Regression at Scale: A Three-Pillar Framework Combining ML, XAI, and NLP
Jiztom Kavalakkatt Francis
Matthew J. Darr
131
1
0
02 Apr 2025
Scaling Language-Free Visual Representation Learning
David Fan
Shengbang Tong
Jiachen Zhu
Koustuv Sinha
Zhuang Liu
...
Michael G. Rabbat
Nicolas Ballas
Yann LeCun
Amir Bar
Saining Xie
CLIP
VLM
243
16
0
01 Apr 2025
Artificial Intelligence-Assisted Prostate Cancer Diagnosis for Reduced Use of Immunohistochemistry
A. Blilie
N. Mulliqi
X. Ji
Kelvin Szolnoky
Sol Erika Boman
...
S. R. Kjosavik
L. Egevad
E. Janssen
M. Eklund
K. Kartasalo
182
0
0
31 Mar 2025
HumanAesExpert: Advancing a Multi-Modality Foundation Model for Human Image Aesthetic Assessment
Zhichao Liao
Xiaokun Liu
Wenyu Qin
Qingyu Li
Qiulin Wang
Pengfei Wan
Di Zhang
Long Zeng
Pingfa Feng
293
6
0
31 Mar 2025
FIESTA: Fisher Information-based Efficient Selective Test-time Adaptation
Mohammadmahdi Honarmand
O. Mutlu
Parnian Azizian
Saimourya Surabhi
Dennis Paul Wall
TTA
140
0
0
29 Mar 2025
SURGEON: Memory-Adaptive Fully Test-Time Adaptation via Dynamic Activation Sparsity
Ke Ma
Jiaqi Tang
B. Guo
Fan Dang
Sicong Liu
...
Lei Wu
Cheng Fang
Ying-Cong Chen
Zhiwen Yu
Yunhao Liu
TTA
171
1
0
26 Mar 2025
TraNCE: Transformative Non-linear Concept Explainer for CNNs
Ugochukwu Ejike Akpudo
Yongsheng Gao
J. Zhou
Andrew Lewis
164
2
0
26 Mar 2025
Wavelet-based Global-Local Interaction Network with Cross-Attention for Multi-View Diabetic Retinopathy Detection
Yihan Hu
Yuxin Lin
Chengliang Liu
Xiaoling Luo
Xiaoyan Dou
Qihao Xu
Yong-mei Xu
96
0
0
25 Mar 2025
k-NN as a Simple and Effective Estimator of Transferability
Moein Sorkhei
Christos Matsoukas
Johan Fredin Haslum
Emir Konuk
Kevin Smith
174
0
0
24 Mar 2025
GOAL: Global-local Object Alignment Learning
Hyungyu Choi
Young Kyun Jang
Chanho Eom
VLM
613
2
0
22 Mar 2025
CoRLD: Contrastive Representation Learning Of Deformable Shapes In Images
Tonmoy Hossain
Miaomiao Zhang
189
0
0
21 Mar 2025
NdLinear: Preserving Multi-Dimensional Structure for Parameter-Efficient Neural Networks
Alex Reneau
Jerry Yao-Chieh Hu
Zhongfang Zhuang
Ting-Chun Liu
Xiang He
Judah Goldfeder
Nadav Timor
Allen Roush
Ravid Shwartz-Ziv
HAI
189
0
0
21 Mar 2025
Beyond Accuracy: What Matters in Designing Well-Behaved Models?
Robin Hesse
Doğukan Bağcı
Bernt Schiele
Simone Schaub-Meyer
Stefan Roth
VLM
173
0
0
21 Mar 2025
DermDiff: Generative Diffusion Model for Mitigating Racial Biases in Dermatology Diagnosis
Nusrat Munia
Abdullah-Al-Zubaer Imran
MedIm
151
2
0
21 Mar 2025
LIFT: Latent Implicit Functions for Task- and Data-Agnostic Encoding
Amirhossein Kazerouni
Soroush Mehraban
Michael Brudno
Babak Taati
126
3
0
19 Mar 2025
Global Renewables Watch: A Temporal Dataset of Solar and Wind Energy Derived from Satellite Imagery
Caleb Robinson
Anthony Ortiz
Allen Kim
Rahul Dodhia
Andrew Zolli
Shivaprakash K. Nagaraju
J. O
J. Kiesecker
J. L. Ferres
142
2
0
19 Mar 2025
Improving Adversarial Transferability on Vision Transformers via Forward Propagation Refinement
Yuchen Ren
Subrat Kishore Dutta
Chenhao Lin
Bo Yang
Zhe Liu
Jiafei Wu
Chao Shen
ViT
142
3
0
19 Mar 2025
DPImageBench: A Unified Benchmark for Differentially Private Image Synthesis
Chen Gong
Kecen Li
Zinan Lin
Tianhao Wang
360
6
0
18 Mar 2025
Shape Bias and Robustness Evaluation via Cue Decomposition for Image Classification and Segmentation
Edgar Heinert
Thomas Gottwald
Annika Mütze
Matthias Rottmann
176
1
0
16 Mar 2025
dFLMoE: Decentralized Federated Learning via Mixture of Experts for Medical Data Analysis
Luyuan Xie
Tianyu Luan
Wenyuan Cai
Guochen Yan
Zhaoyu Chen
Nan Xi
Yuejian Fang
Qingni Shen
Zhonghai Wu
Junsong Yuan
FedML
484
0
0
13 Mar 2025
Transformers without Normalization
Jiachen Zhu
Xinlei Chen
Kaiming He
Yann LeCun
Zhuang Liu
OffRL
ViT
217
48
0
13 Mar 2025
Poly-MgNet: Polynomial Building Blocks in Multigrid-Inspired ResNets
Antonia van Betteray
Matthias Rottmann
Karsten Kahl
241
1
0
13 Mar 2025
Structural and Statistical Texture Knowledge Distillation and Learning for Segmentation
Deyi Ji
Feng Zhao
Hongtao Lu
Feng Wu
Jieping Ye
195
5
0
11 Mar 2025
Towards All-in-One Medical Image Re-Identification
Yuan Tian
Kaiyuan Ji
Rongzhao Zhang
Yankai Jiang
Chunyi Li
Xiaosong Wang
Guoquan Zheng
VLM
98
2
0
11 Mar 2025
Elderly Activity Recognition in the Wild: Results from the EAR Challenge
Anh-Kiet Duong
99
0
0
10 Mar 2025
Impoola: The Power of Average Pooling for Image-Based Deep Reinforcement Learning
Raphael Trumpp
Ansgar Schäfftlein
Mirco Theile
Marco Caccamo
152
1
0
07 Mar 2025
Previous
1
2
3
4
5
6
...
74
75
76
Next