Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1502.03167
Cited By
v1
v2
v3 (latest)
Batch Normalization: Accelerating Deep Network Training by Reducing Internal Covariate Shift
11 February 2015
Sergey Ioffe
Christian Szegedy
OOD
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"Batch Normalization: Accelerating Deep Network Training by Reducing Internal Covariate Shift"
50 / 11,080 papers shown
Title
How far away are truly hyperparameter-free learning algorithms?
Priya Kasimbeg
Vincent Roulet
Naman Agarwal
Sourabh Medapati
Fabian Pedregosa
Atish Agarwala
George E. Dahl
22
0
0
29 May 2025
RC-AutoCalib: An End-to-End Radar-Camera Automatic Calibration Network
Van-Tin Luu
Yon-Lin Cai
Vu-Hoang Tran
Wei-Chen Chiu
Yi-Ting Chen
Ching-Chun Huang
12
0
0
28 May 2025
Hybrid Batch Normalisation: Resolving the Dilemma of Batch Normalisation in Federated Learning
Hongyao Chen
Tianyang Xu
Xiaojun Wu
Josef Kittler
FedML
21
0
0
28 May 2025
Learning in Compact Spaces with Approximately Normalized Transformers
Jörg Franke
Urs Spiegelhalter
Marianna Nezhurina
J. Jitsev
Frank Hutter
Michael Hefenbrock
52
0
0
28 May 2025
Relevance-driven Input Dropout: an Explanation-guided Regularization Technique
Shreyas Gururaj
Lars Grüne
Wojciech Samek
Sebastian Lapuschkin
Leander Weber
142
0
0
27 May 2025
Identifying Super Spreaders in Multilayer Networks
Michał Czuba
Mateusz Stolarski
Adam Piróg
Piotr Bielak
Piotr Bródka
31
0
0
27 May 2025
Moment kernels: a simple and scalable approach for equivariance to rotations and reflections in deep convolutional networks
Zachary Schlamowitz
Andrew Bennecke
Daniel J. Tward
34
0
0
27 May 2025
Detecting Informative Channels: ActionFormer
Kunpeng Zhao
Asahi Miyazaki
Tsuyoshi Okita
9
0
0
27 May 2025
CA3D: Convolutional-Attentional 3D Nets for Efficient Video Activity Recognition on the Edge
Gabriele Lagani
Fabrizio Falchi
Claudio Gennaro
Giuseppe Amato
17
0
0
26 May 2025
OCN: Effectively Utilizing Higher-Order Common Neighbors for Better Link Prediction
Juntong Wang
Xiyuan Wang
Muhan Zhang
107
0
0
26 May 2025
Mosaic: Data-Free Knowledge Distillation via Mixture-of-Experts for Heterogeneous Distributed Environments
Junming Liu
Yanting Gao
Siyuan Meng
Yifei Sun
Aoqi Wu
Yufei Jin
Yirong Chen
Ding Wang
Guosun Zeng
64
1
0
26 May 2025
Translation-Equivariance of Normalization Layers and Aliasing in Convolutional Neural Networks
Jérémy Scanvic
Quentin Barthélemy
Julián Tachella
12
0
0
26 May 2025
Meta Pruning via Graph Metanetworks : A Meta Learning Framework for Network Pruning
Yewei Liu
Xiyuan Wang
Muhan Zhang
DD
GNN
40
0
0
24 May 2025
Joint-stochastic-approximation Random Fields with Application to Semi-supervised Learning
Yunfu Song
Zhijian Ou
15
0
0
24 May 2025
HiLAB: A Hybrid Inverse-Design Framework
Reza Marzban
Hamed Abiri
Raphael Pestourie
Ali Adibi
86
0
0
23 May 2025
Leveraging KANs for Expedient Training of Multichannel MLPs via Preconditioning and Geometric Refinement
Jonas A. Actor
Graham Harper
Ben Southworth
E. Cyr
50
0
0
23 May 2025
Bayesian Deep Learning for Discrete Choice
Daniel F. Villarraga
Ricardo A. Daziano
BDL
AI4CE
160
0
0
23 May 2025
Auto-nnU-Net: Towards Automated Medical Image Segmentation
Jannis Becktepe
Leona Hennig
Steffen Oeltze-Jafra
Marius Lindauer
238
0
0
22 May 2025
Higher-Order Asymptotics of Test-Time Adaptation for Batch Normalization Statistics
Masanari Kimura
21
0
0
22 May 2025
TULiP: Test-time Uncertainty Estimation via Linearization and Weight Perturbation
Yuhui Zhang
Dongshen Wu
Yuichiro Wada
Takafumi Kanamori
OODD
241
1
0
22 May 2025
MHANet: Multi-scale Hybrid Attention Network for Auditory Attention Detection
Lu Li
Cunhang Fan
Hongyu Zhang
Jingjing Zhang
Xiaoke Yang
Jian Zhou
Zhao Lv
21
0
0
21 May 2025
Integration of TinyML and LargeML: A Survey of 6G and Beyond
Thai-Hoc Vu
Ngo Hoang Tu
Thien Huynh-The
Kyungchun Lee
Sunghwan Kim
Miroslav Voznak
Quoc-Viet Pham
69
0
0
20 May 2025
Explaining Neural Networks with Reasons
Levin Hornischer
Hannes Leitgeb
FAtt
AAML
MILM
95
0
0
20 May 2025
Selective Structured State Space for Multispectral-fused Small Target Detection
Qianqian Zhang
WeiJun Wang
Yunxing Liu
Li Zhou
Hao Zhao
Junshe An
Zihan Wang
Mamba
248
0
0
20 May 2025
ThinkSwitcher: When to Think Hard, When to Think Fast
Guosheng Liang
Longguang Zhong
Ziyi Yang
Xiaojun Quan
LRM
65
1
0
20 May 2025
Self Distillation via Iterative Constructive Perturbations
Maheak Dave
Aniket K. Singh
Aryan Pareek
Harshita Jha
Debasis Chaudhuri
Manish P. Singh
ODL
47
0
0
20 May 2025
Learning to Insert for Constructive Neural Vehicle Routing Solver
Fu Luo
Xi Lin
Mengyuan Zhong
Fei Liu
Zhenkun Wang
Jianyong Sun
Qingfu Zhang
80
0
0
20 May 2025
An Overview of Arithmetic Adaptations for Inference of Convolutional Neural Networks on Re-configurable Hardware
Ilkay Wunderlich
Benjamin Koch
Sven Schönfeld
230
2
0
19 May 2025
Parallel Layer Normalization for Universal Approximation
Yunhao Ni
Yuhe Liu
Wenxin Sun
Yitong Tang
Yuxin Guo
Peilin Feng
Wenjun Wu
Lei Huang
100
0
0
19 May 2025
Advancing Generalization Across a Variety of Abstract Visual Reasoning Tasks
Mikołaj Małkiński
Jacek Mańdziuk
48
0
0
19 May 2025
Machine learning the first stage in 2SLS: Practical guidance from bias decomposition and simulation
Connor Lennon
Edward Rubin
Glen Waddell
43
0
0
19 May 2025
A Training Framework for Optimal and Stable Training of Polynomial Neural Networks
Forsad Al Hossain
Tauhidur Rahman
21
0
0
16 May 2025
From Fibers to Cells: Fourier-Based Registration Enables Virtual Cresyl Violet Staining From 3D Polarized Light Imaging
Alexander Oberstrass
Esteban Vaca
Eric Upschulte
Meiqi Niu
N. Palomero-Gallagher
David Graessel
Christian Schiffer
M. Axer
Katrin Amunts
Timo Dickscheid
71
0
0
16 May 2025
A Multi-modal Fusion Network for Terrain Perception Based on Illumination Aware
Rui Wang
Shichun Yang
Yuyi Chen
Z. Li
Zexiang Tong
Jinfeng Xu
Jiayi Lu
Xinjie Feng
Yaoguang Cao
61
0
0
16 May 2025
CSPENet: Contour-Aware and Saliency Priors Embedding Network for Infrared Small Target Detection
Jiakun Deng
Kexuan Li
Xingye Cui
Jiaxuan Li
Chang Long
Tian Pu
Zhenming Peng
99
0
0
15 May 2025
Uniform Loss vs. Specialized Optimization: A Comparative Analysis in Multi-Task Learning
Gabriel S. Gama
Valdir Grassi Jr
MoMe
103
0
0
15 May 2025
Robust Federated Learning with Confidence-Weighted Filtering and GAN-Based Completion under Noisy and Incomplete Data
Alpaslan Gokcen
Ali Boyaci
FedML
79
0
0
14 May 2025
Variational Prefix Tuning for Diverse and Accurate Code Summarization Using Pre-trained Language Models
Junda Zhao
Yuliang Song
Eldan Cohen
102
0
0
14 May 2025
Virtual Dosimetrists: A Radiotherapy Training "Flight Simulator"
S. Gay
Tucker Netherton
Barbara Marquez
Raymond P. Mumme
Mary P. Gronberg
Brent Parker
Chelsea Pinnix
Sanjay Shete
Carlos Cardenas
Laurence Court
61
0
0
14 May 2025
Neural Multivariate Regression: Qualitative Insights from the Unconstrained Feature Model
Shuyang Ling
Soyuj Jung Basnet
Juan Guevara
Li Guo
George Andriopoulos
76
0
0
14 May 2025
A Large-scale Benchmark on Geological Fault Delineation Models: Domain Shift, Training Dynamics, Generalizability, Evaluation and Inferential Behavior
Jorge Quesada
Chen Zhou
Prithwijit Chowdhury
Mohammad Alotaibi
Ahmad Mustafa
Yusufjon Kumamnov
Mohit Prabhushankar
Ghassan AlRegib
AI4CE
69
0
0
13 May 2025
Unsupervised Raindrop Removal from a Single Image using Conditional Diffusion Models
Lhuqita Fazry
Valentino Vito
DiffM
65
0
0
13 May 2025
Low-Complexity Inference in Continual Learning via Compressed Knowledge Transfer
Zhenrong Liu
Janne M. J. Huttunen
Mikko Honkala
CLL
70
0
0
13 May 2025
Tagging fully hadronic exotic decays of the vectorlike
B
\mathbf{B}
B
quark using a graph neural network
Jai Bardhan
Tanumoy Mandal
Subhadip Mitra
Cyrin Neeraj
Mihir Rawat
46
0
0
12 May 2025
Accurate and Efficient Multivariate Time Series Forecasting via Offline Clustering
Yiming Niu
Jinliang Deng
Lulu Zhang
Zimu Zhou
Yongxin Tong
AI4TS
165
0
0
09 May 2025
Document Image Rectification Bases on Self-Adaptive Multitask Fusion
Heng Li
Xiangping Wu
Qingcai Chen
121
0
0
09 May 2025
Mask-PINNs: Regulating Feature Distributions in Physics-Informed Neural Networks
Feilong Jiang
Xiaonan Hou
Jianqiao Ye
Min Xia
OOD
PINN
84
0
0
09 May 2025
Sparse Training from Random Initialization: Aligning Lottery Ticket Masks using Weight Symmetry
Mohammed Adnan
Rohan Jain
Ekansh Sharma
Rahul Krishnan
Yani Andrew Ioannou
93
0
0
08 May 2025
Threshold Modulation for Online Test-Time Adaptation of Spiking Neural Networks
Kejie Zhao
Wenjia Hua
Aiersi Tuerhong
Luziwei Leng
Yuxin Ma
Qinghua Guo
542
0
0
08 May 2025
Boosting Statistic Learning with Synthetic Data from Pretrained Large Models
Jialong Jiang
Wenkang Hu
Jian Huang
Yuling Jiao
Xu Liu
DiffM
76
0
0
08 May 2025
Previous
1
2
3
4
5
...
220
221
222
Next