v1v2v3v4 (latest)

Understanding Batch Normalization

1 June 2018

Papers citing "Understanding Batch Normalization"

50 / 224 papers shown

Title
$$\text{C}^{2}\text{BNVAE}$: Dual-Conditional Deep Generation of Network Traffic Data for Network Intrusion Detection System Balancing$ $\text{C}^{2}\text{BNVAE}$ : Dual-Conditional Deep Generation of Network Traffic Data for Network Intrusion Detection System Balancing Yifan Zeng 60 0 0 06 Jun 2025
Hybrid Batch Normalisation: Resolving the Dilemma of Batch Normalisation in Federated Learning Hongyao Chen Tianyang Xu Xiaojun Wu Josef Kittler FedML 25 0 0 28 May 2025
One Rank at a Time: Cascading Error Dynamics in Sequential Learning Mahtab Alizadeh Vandchali Fangshuo Liao Anastasios Kyrillidis 41 0 0 28 May 2025
Higher-Order Asymptotics of Test-Time Adaptation for Batch Normalization Statistics Masanari Kimura 37 0 0 22 May 2025
Parallel Layer Normalization for Universal Approximation Yunhao Ni Yuhe Liu Wenxin Sun Yitong Tang Yuxin Guo Peilin Feng Wenjun Wu Lei Huang 103 0 0 19 May 2025
L-SWAG: Layer-Sample Wise Activation with Gradients information for Zero-Shot NAS on Vision Transformers S. Casarin Sergio Escalera Oswald Lanz 113 0 0 12 May 2025
How to Train Your Metamorphic Deep Neural Network Thomas Sommariva Simone Calderara Angelo Porrello 62 0 0 07 May 2025
Probabilistic Curriculum Learning for Goal-Based Reinforcement Learning Llewyn Salt Marcus Gallagher 64 1 0 02 Apr 2025
Transformers without Normalization Jiachen Zhu Xinlei Chen Kaiming He Yann LeCun Zhuang Liu OffRL ViT 160 20 0 13 Mar 2025
Batch normalization does not improve initialization Joris Dannemann Gero Junike ODL 95 0 0 25 Feb 2025
Self-Adjust Softmax Chuanyang Zheng Yihang Gao Guoxuan Chen Han Shi Jing Xiong Xiaozhe Ren Chao Huang Xin Jiang Zhiyu Li Yu Li 83 1 0 25 Feb 2025
GeoAggregator: An Efficient Transformer Model for Geo-Spatial Tabular Data Rui Deng Ziqi Li Mingshu Wang 151 0 0 24 Feb 2025
Gradient Descent Converges Linearly to Flatter Minima than Gradient Flow in Shallow Linear Networks Pierfrancesco Beneventano Blake Woodworth MLT 94 1 0 15 Jan 2025
A Generalized Unified Skew-Normal Process with Neural Bayes Inference Kesen Wang M. Genton SyDa 462 0 0 26 Nov 2024
Semantic Knowledge Distillation for Onboard Satellite Earth Observation Image Classification Thanh-Dung Le Vu Nguyen Ha T. Nguyen G. Eappen P. Thiruvasagam ... Duc-Dung Tran Luis Manuel Garcés Socarrás J. L. González-Rios JUAN CARLOS MERLANO DUNCAN Symeon Chatzinotas 23 2 0 31 Oct 2024
Data Generation for Hardware-Friendly Post-Training Quantization Lior Dikstein Ariel Lapid Arnon Netzer H. Habi MQ 484 0 0 29 Oct 2024
Data-Driven Gyroscope Calibration Zeev Yampolsky Itzik Klein 47 0 0 16 Oct 2024
The Impact of LoRA Adapters for LLMs on Clinical NLP Classification Under Data Limitations Thanh-Dung Le T. Nguyen Vu Nguyen Ha Symeon Chatzinotas P. Jouvet R. Noumeir 96 0 0 27 Jul 2024
Training-Free Model Merging for Multi-target Domain Adaptation Wenyi Li Huan-ang Gao Mingju Gao Beiwen Tian Rong Zhi Hao Zhao MoMe 94 8 0 18 Jul 2024
SCSA: Exploring the Synergistic Effects Between Spatial and Channel Attention Yunzhong Si Huiying Xu Xinzhong Zhu Wenhao Zhang Yao Dong Yuxing Chen Hongbo Li 114 36 0 06 Jul 2024
Foundations and Frontiers of Graph Learning Theory Yu Huang Min Zhou Menglin Yang Zhen Wang Muhan Zhang Jie Wang Hong Xie Hao Wang Defu Lian Enhong Chen AI4CE GNN 156 2 0 03 Jul 2024
fKAN: Fractional Kolmogorov-Arnold Networks with trainable Jacobi basis functions Alireza Afzal Aghaei 79 52 0 11 Jun 2024
Soundscape Captioning using Sound Affective Quality Network and Large Language Model Yuanbo Hou Qiaoqiao Ren A. Mitchell Wenwu Wang Jian Kang Tony Belpaeme Dick Botteldooren 110 3 0 09 Jun 2024
CCSI: Continual Class-Specific Impression for Data-free Class Incremental Learning Sana Ayromlou Teresa S. M. Tsang Purang Abolmaesumi Xiaoxiao Li CLL 63 2 0 09 Jun 2024
On the Nonlinearity of Layer Normalization Yunhao Ni Yuxin Guo Junlong Jia Lei Huang 166 5 0 03 Jun 2024
No More Mumbles: Enhancing Robot Intelligibility through Speech Adaptation Qiaoqiao Ren Yuanbo Hou Dick Botteldooren Tony Belpaeme 44 4 0 15 May 2024
Hidden Synergy: $L_1$ Weight Normalization and 1-Path-Norm Regularization Aditya Biswas 81 1 0 29 Apr 2024
Align, Minimize and Diversify: A Source-Free Unsupervised Domain Adaptation Method for Handwritten Text Recognition María Alfaro-Contreras Jorge Calvo-Zaragoza 69 0 0 28 Apr 2024
GRANOLA: Adaptive Normalization for Graph Neural Networks Moshe Eliasof Beatrice Bevilacqua Carola-Bibiane Schönlieb Haggai Maron 112 5 0 20 Apr 2024
K-percent Evaluation for Lifelong RL Golnaz Mesbahi Parham Mohammad Panahi Olya Mastikhina Martha White Adam White CLL OffRL 75 1 0 02 Apr 2024
CHAIN: Enhancing Generalization in Data-Efficient GANs via lipsCHitz continuity constrAIned Normalization Yao Ni Piotr Koniusz AI4CE GAN 105 2 0 31 Mar 2024
Towards Understanding Dual BN In Hybrid Adversarial Training Chenshuang Zhang Chaoning Zhang Kang Zhang Axi Niu Junmo Kim In So Kweon AAML 83 1 0 28 Mar 2024
On permutation-invariant neural networks Masanari Kimura Ryotaro Shimizu Yuki Hirakawa Ryosuke Goto Yuki Saito OOD AAML 94 12 0 26 Mar 2024
Refining Segmentation On-the-Fly: An Interactive Framework for Point Cloud Semantic Segmentation Peng Zhang Ting Wu Jinsheng Sun Weiqing Li Zhiyong Su 75 1 0 11 Mar 2024
Improving Normalization with the James-Stein Estimator Seyedalireza Khoshsirat Chandra Kambhamettu 73 5 0 01 Dec 2023
ReMoS: 3D Motion-Conditioned Reaction Synthesis for Two-Person Interactions Anindita Ghosh Rishabh Dabral Vladislav Golyanik Christian Theobalt Philipp Slusallek 99 23 0 28 Nov 2023
Unified Batch Normalization: Identifying and Alleviating the Feature Condensation in Batch Normalization and a Unified Framework Shaobo Wang Xiangdong Zhang Dongrui Liu Junchi Yan 109 0 0 27 Nov 2023
Coordinate-Aware Modulation for Neural Fields J. Lee Daniel Rho Seungtae Nam Jong Hwan Ko Eunbyung Park 51 5 0 25 Nov 2023
Efficient Neural Networks for Tiny Machine Learning: A Comprehensive Review M. Lê Pierre Wolinski Julyan Arbel 89 10 0 20 Nov 2023
Analytical Verification of Performance of Deep Neural Network Based Time-Synchronized Distribution System State Estimation Behrouz Azimian Shiva Moshtagh Anamitra Pal Shanshan Ma 64 4 0 12 Nov 2023
Improving Entropy-Based Test-Time Adaptation from a Clustering View Guoliang Lin Hanjiang Lai Yan Pan Jian Yin OOD TTA 28 2 0 31 Oct 2023
Overcoming Recency Bias of Normalization Statistics in Continual Learning: Balance and Adaptation Yilin Lyu Liyuan Wang Xingxing Zhang Zicheng Sun Hang Su Jun Zhu Liping Jing 80 8 0 13 Oct 2023
Audio Event-Relational Graph Representation Learning for Acoustic Scene Classification Yuanbo Hou Siyang Song Chuang Yu Wenwu Wang Dick Botteldooren 57 7 0 05 Oct 2023
Towards Training Without Depth Limits: Batch Normalization Without Gradient Explosion Alexandru Meterez Amir Joudaki Francesco Orabona Alexander Immer Gunnar Rätsch Hadi Daneshmand 71 8 0 03 Oct 2023
GHNet:Learning GNSS Heading from Velocity Measurements Nitzan Dahan Itzik Klein 32 0 0 18 Sep 2023
Deep Reinforcement Learning-based Scheduling for Optimizing System Load and Response Time in Edge and Fog Computing Environments Zhiyu Wang M. Goudarzi Mingming Gong Rajkumar Buyya 76 65 0 14 Sep 2023
Towards Understanding Neural Collapse: The Effects of Batch Normalization and Weight Decay Leyan Pan Xinyuan Cao 49 5 0 09 Sep 2023
Prediction of Diblock Copolymer Morphology via Machine Learning Hyun Park Boyuan Yu Juhae Park Ge Sun E. Tajkhorshid J. Pablo Ludwig Schneider AI4CE 62 2 0 31 Aug 2023
Joint Prediction of Audio Event and Annoyance Rating in an Urban Soundscape by Hierarchical Graph Representation Learning Yuanbo Hou Siyang Song Cheng Luo A. Mitchell Qiaoqiao Ren Weicheng Xie Jian Kang Wenwu Wang Dick Botteldooren 71 6 0 23 Aug 2023
Uncertainty Quantification for Image-based Traffic Prediction across Cities Alexander Timans Nina Wiedemann Nishant Kumar Ye Hong Martin Raubal 84 1 0 11 Aug 2023