Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1806.02375
Cited By
v1
v2
v3
v4 (latest)
Understanding Batch Normalization
1 June 2018
Johan Bjorck
Carla P. Gomes
B. Selman
Kilian Q. Weinberger
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"Understanding Batch Normalization"
50 / 224 papers shown
Title
C
2
BNVAE
\text{C}^{2}\text{BNVAE}
C
2
BNVAE
: Dual-Conditional Deep Generation of Network Traffic Data for Network Intrusion Detection System Balancing
Yifan Zeng
60
0
0
06 Jun 2025
Hybrid Batch Normalisation: Resolving the Dilemma of Batch Normalisation in Federated Learning
Hongyao Chen
Tianyang Xu
Xiaojun Wu
Josef Kittler
FedML
25
0
0
28 May 2025
One Rank at a Time: Cascading Error Dynamics in Sequential Learning
Mahtab Alizadeh Vandchali
Fangshuo
Liao
Anastasios Kyrillidis
41
0
0
28 May 2025
Higher-Order Asymptotics of Test-Time Adaptation for Batch Normalization Statistics
Masanari Kimura
37
0
0
22 May 2025
Parallel Layer Normalization for Universal Approximation
Yunhao Ni
Yuhe Liu
Wenxin Sun
Yitong Tang
Yuxin Guo
Peilin Feng
Wenjun Wu
Lei Huang
103
0
0
19 May 2025
L-SWAG: Layer-Sample Wise Activation with Gradients information for Zero-Shot NAS on Vision Transformers
S. Casarin
Sergio Escalera
Oswald Lanz
113
0
0
12 May 2025
How to Train Your Metamorphic Deep Neural Network
Thomas Sommariva
Simone Calderara
Angelo Porrello
62
0
0
07 May 2025
Probabilistic Curriculum Learning for Goal-Based Reinforcement Learning
Llewyn Salt
Marcus Gallagher
64
1
0
02 Apr 2025
Transformers without Normalization
Jiachen Zhu
Xinlei Chen
Kaiming He
Yann LeCun
Zhuang Liu
OffRL
ViT
160
20
0
13 Mar 2025
Batch normalization does not improve initialization
Joris Dannemann
Gero Junike
ODL
95
0
0
25 Feb 2025
Self-Adjust Softmax
Chuanyang Zheng
Yihang Gao
Guoxuan Chen
Han Shi
Jing Xiong
Xiaozhe Ren
Chao Huang
Xin Jiang
Zhiyu Li
Yu Li
83
1
0
25 Feb 2025
GeoAggregator: An Efficient Transformer Model for Geo-Spatial Tabular Data
Rui Deng
Ziqi Li
Mingshu Wang
151
0
0
24 Feb 2025
Gradient Descent Converges Linearly to Flatter Minima than Gradient Flow in Shallow Linear Networks
Pierfrancesco Beneventano
Blake Woodworth
MLT
94
1
0
15 Jan 2025
A Generalized Unified Skew-Normal Process with Neural Bayes Inference
Kesen Wang
M. Genton
SyDa
462
0
0
26 Nov 2024
Semantic Knowledge Distillation for Onboard Satellite Earth Observation Image Classification
Thanh-Dung Le
Vu Nguyen Ha
T. Nguyen
G. Eappen
P. Thiruvasagam
...
Duc-Dung Tran
Luis Manuel Garcés Socarrás
J. L. González-Rios
JUAN CARLOS MERLANO DUNCAN
Symeon Chatzinotas
23
2
0
31 Oct 2024
Data Generation for Hardware-Friendly Post-Training Quantization
Lior Dikstein
Ariel Lapid
Arnon Netzer
H. Habi
MQ
484
0
0
29 Oct 2024
Data-Driven Gyroscope Calibration
Zeev Yampolsky
Itzik Klein
47
0
0
16 Oct 2024
The Impact of LoRA Adapters for LLMs on Clinical NLP Classification Under Data Limitations
Thanh-Dung Le
T. Nguyen
Vu Nguyen Ha
Symeon Chatzinotas
P. Jouvet
R. Noumeir
96
0
0
27 Jul 2024
Training-Free Model Merging for Multi-target Domain Adaptation
Wenyi Li
Huan-ang Gao
Mingju Gao
Beiwen Tian
Rong Zhi
Hao Zhao
MoMe
94
8
0
18 Jul 2024
SCSA: Exploring the Synergistic Effects Between Spatial and Channel Attention
Yunzhong Si
Huiying Xu
Xinzhong Zhu
Wenhao Zhang
Yao Dong
Yuxing Chen
Hongbo Li
114
36
0
06 Jul 2024
Foundations and Frontiers of Graph Learning Theory
Yu Huang
Min Zhou
Menglin Yang
Zhen Wang
Muhan Zhang
Jie Wang
Hong Xie
Hao Wang
Defu Lian
Enhong Chen
AI4CE
GNN
156
2
0
03 Jul 2024
fKAN: Fractional Kolmogorov-Arnold Networks with trainable Jacobi basis functions
Alireza Afzal Aghaei
79
52
0
11 Jun 2024
Soundscape Captioning using Sound Affective Quality Network and Large Language Model
Yuanbo Hou
Qiaoqiao Ren
A. Mitchell
Wenwu Wang
Jian Kang
Tony Belpaeme
Dick Botteldooren
110
3
0
09 Jun 2024
CCSI: Continual Class-Specific Impression for Data-free Class Incremental Learning
Sana Ayromlou
Teresa S. M. Tsang
Purang Abolmaesumi
Xiaoxiao Li
CLL
63
2
0
09 Jun 2024
On the Nonlinearity of Layer Normalization
Yunhao Ni
Yuxin Guo
Junlong Jia
Lei Huang
163
5
0
03 Jun 2024
No More Mumbles: Enhancing Robot Intelligibility through Speech Adaptation
Qiaoqiao Ren
Yuanbo Hou
Dick Botteldooren
Tony Belpaeme
44
4
0
15 May 2024
Hidden Synergy:
L
1
L_1
L
1
Weight Normalization and 1-Path-Norm Regularization
Aditya Biswas
81
1
0
29 Apr 2024
Align, Minimize and Diversify: A Source-Free Unsupervised Domain Adaptation Method for Handwritten Text Recognition
María Alfaro-Contreras
Jorge Calvo-Zaragoza
69
0
0
28 Apr 2024
GRANOLA: Adaptive Normalization for Graph Neural Networks
Moshe Eliasof
Beatrice Bevilacqua
Carola-Bibiane Schönlieb
Haggai Maron
112
5
0
20 Apr 2024
K-percent Evaluation for Lifelong RL
Golnaz Mesbahi
Parham Mohammad Panahi
Olya Mastikhina
Martha White
Adam White
CLL
OffRL
75
1
0
02 Apr 2024
CHAIN: Enhancing Generalization in Data-Efficient GANs via lipsCHitz continuity constrAIned Normalization
Yao Ni
Piotr Koniusz
AI4CE
GAN
105
2
0
31 Mar 2024
Towards Understanding Dual BN In Hybrid Adversarial Training
Chenshuang Zhang
Chaoning Zhang
Kang Zhang
Axi Niu
Junmo Kim
In So Kweon
AAML
83
1
0
28 Mar 2024
On permutation-invariant neural networks
Masanari Kimura
Ryotaro Shimizu
Yuki Hirakawa
Ryosuke Goto
Yuki Saito
OOD
AAML
94
12
0
26 Mar 2024
Refining Segmentation On-the-Fly: An Interactive Framework for Point Cloud Semantic Segmentation
Peng Zhang
Ting Wu
Jinsheng Sun
Weiqing Li
Zhiyong Su
75
1
0
11 Mar 2024
Improving Normalization with the James-Stein Estimator
Seyedalireza Khoshsirat
Chandra Kambhamettu
73
5
0
01 Dec 2023
ReMoS: 3D Motion-Conditioned Reaction Synthesis for Two-Person Interactions
Anindita Ghosh
Rishabh Dabral
Vladislav Golyanik
Christian Theobalt
Philipp Slusallek
99
23
0
28 Nov 2023
Unified Batch Normalization: Identifying and Alleviating the Feature Condensation in Batch Normalization and a Unified Framework
Shaobo Wang
Xiangdong Zhang
Dongrui Liu
Junchi Yan
109
0
0
27 Nov 2023
Coordinate-Aware Modulation for Neural Fields
J. Lee
Daniel Rho
Seungtae Nam
Jong Hwan Ko
Eunbyung Park
51
5
0
25 Nov 2023
Efficient Neural Networks for Tiny Machine Learning: A Comprehensive Review
M. Lê
Pierre Wolinski
Julyan Arbel
89
10
0
20 Nov 2023
Analytical Verification of Performance of Deep Neural Network Based Time-Synchronized Distribution System State Estimation
Behrouz Azimian
Shiva Moshtagh
Anamitra Pal
Shanshan Ma
64
4
0
12 Nov 2023
Improving Entropy-Based Test-Time Adaptation from a Clustering View
Guoliang Lin
Hanjiang Lai
Yan Pan
Jian Yin
OOD
TTA
28
2
0
31 Oct 2023
Overcoming Recency Bias of Normalization Statistics in Continual Learning: Balance and Adaptation
Yilin Lyu
Liyuan Wang
Xingxing Zhang
Zicheng Sun
Hang Su
Jun Zhu
Liping Jing
80
8
0
13 Oct 2023
Audio Event-Relational Graph Representation Learning for Acoustic Scene Classification
Yuanbo Hou
Siyang Song
Chuang Yu
Wenwu Wang
Dick Botteldooren
57
7
0
05 Oct 2023
Towards Training Without Depth Limits: Batch Normalization Without Gradient Explosion
Alexandru Meterez
Amir Joudaki
Francesco Orabona
Alexander Immer
Gunnar Rätsch
Hadi Daneshmand
71
8
0
03 Oct 2023
GHNet:Learning GNSS Heading from Velocity Measurements
Nitzan Dahan
Itzik Klein
32
0
0
18 Sep 2023
Deep Reinforcement Learning-based Scheduling for Optimizing System Load and Response Time in Edge and Fog Computing Environments
Zhiyu Wang
M. Goudarzi
Mingming Gong
Rajkumar Buyya
76
65
0
14 Sep 2023
Towards Understanding Neural Collapse: The Effects of Batch Normalization and Weight Decay
Leyan Pan
Xinyuan Cao
49
5
0
09 Sep 2023
Prediction of Diblock Copolymer Morphology via Machine Learning
Hyun Park
Boyuan Yu
Juhae Park
Ge Sun
E. Tajkhorshid
J. Pablo
Ludwig Schneider
AI4CE
62
2
0
31 Aug 2023
Joint Prediction of Audio Event and Annoyance Rating in an Urban Soundscape by Hierarchical Graph Representation Learning
Yuanbo Hou
Siyang Song
Cheng Luo
A. Mitchell
Qiaoqiao Ren
Weicheng Xie
Jian Kang
Wenwu Wang
Dick Botteldooren
71
6
0
23 Aug 2023
Uncertainty Quantification for Image-based Traffic Prediction across Cities
Alexander Timans
Nina Wiedemann
Nishant Kumar
Ye Hong
Martin Raubal
84
1
0
11 Aug 2023
1
2
3
4
5
Next