ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1502.03167
  4. Cited By
Batch Normalization: Accelerating Deep Network Training by Reducing
  Internal Covariate Shift
v1v2v3 (latest)

Batch Normalization: Accelerating Deep Network Training by Reducing Internal Covariate Shift

11 February 2015
Sergey Ioffe
Christian Szegedy
    OOD
ArXiv (abs)PDFHTML

Papers citing "Batch Normalization: Accelerating Deep Network Training by Reducing Internal Covariate Shift"

50 / 11,080 papers shown
Title
How far away are truly hyperparameter-free learning algorithms?
How far away are truly hyperparameter-free learning algorithms?
Priya Kasimbeg
Vincent Roulet
Naman Agarwal
Sourabh Medapati
Fabian Pedregosa
Atish Agarwala
George E. Dahl
22
0
0
29 May 2025
RC-AutoCalib: An End-to-End Radar-Camera Automatic Calibration Network
RC-AutoCalib: An End-to-End Radar-Camera Automatic Calibration Network
Van-Tin Luu
Yon-Lin Cai
Vu-Hoang Tran
Wei-Chen Chiu
Yi-Ting Chen
Ching-Chun Huang
12
0
0
28 May 2025
Hybrid Batch Normalisation: Resolving the Dilemma of Batch Normalisation in Federated Learning
Hongyao Chen
Tianyang Xu
Xiaojun Wu
Josef Kittler
FedML
21
0
0
28 May 2025
Learning in Compact Spaces with Approximately Normalized Transformers
Learning in Compact Spaces with Approximately Normalized Transformers
Jörg Franke
Urs Spiegelhalter
Marianna Nezhurina
J. Jitsev
Frank Hutter
Michael Hefenbrock
52
0
0
28 May 2025
Relevance-driven Input Dropout: an Explanation-guided Regularization Technique
Relevance-driven Input Dropout: an Explanation-guided Regularization Technique
Shreyas Gururaj
Lars Grüne
Wojciech Samek
Sebastian Lapuschkin
Leander Weber
142
0
0
27 May 2025
Identifying Super Spreaders in Multilayer Networks
Identifying Super Spreaders in Multilayer Networks
Michał Czuba
Mateusz Stolarski
Adam Piróg
Piotr Bielak
Piotr Bródka
31
0
0
27 May 2025
Moment kernels: a simple and scalable approach for equivariance to rotations and reflections in deep convolutional networks
Moment kernels: a simple and scalable approach for equivariance to rotations and reflections in deep convolutional networks
Zachary Schlamowitz
Andrew Bennecke
Daniel J. Tward
34
0
0
27 May 2025
Detecting Informative Channels: ActionFormer
Detecting Informative Channels: ActionFormer
Kunpeng Zhao
Asahi Miyazaki
Tsuyoshi Okita
9
0
0
27 May 2025
CA3D: Convolutional-Attentional 3D Nets for Efficient Video Activity Recognition on the Edge
CA3D: Convolutional-Attentional 3D Nets for Efficient Video Activity Recognition on the Edge
Gabriele Lagani
Fabrizio Falchi
Claudio Gennaro
Giuseppe Amato
17
0
0
26 May 2025
OCN: Effectively Utilizing Higher-Order Common Neighbors for Better Link Prediction
OCN: Effectively Utilizing Higher-Order Common Neighbors for Better Link Prediction
Juntong Wang
Xiyuan Wang
Muhan Zhang
107
0
0
26 May 2025
Mosaic: Data-Free Knowledge Distillation via Mixture-of-Experts for Heterogeneous Distributed Environments
Mosaic: Data-Free Knowledge Distillation via Mixture-of-Experts for Heterogeneous Distributed Environments
Junming Liu
Yanting Gao
Siyuan Meng
Yifei Sun
Aoqi Wu
Yufei Jin
Yirong Chen
Ding Wang
Guosun Zeng
64
1
0
26 May 2025
Translation-Equivariance of Normalization Layers and Aliasing in Convolutional Neural Networks
Translation-Equivariance of Normalization Layers and Aliasing in Convolutional Neural Networks
Jérémy Scanvic
Quentin Barthélemy
Julián Tachella
12
0
0
26 May 2025
Meta Pruning via Graph Metanetworks : A Meta Learning Framework for Network Pruning
Meta Pruning via Graph Metanetworks : A Meta Learning Framework for Network Pruning
Yewei Liu
Xiyuan Wang
Muhan Zhang
DDGNN
40
0
0
24 May 2025
Joint-stochastic-approximation Random Fields with Application to Semi-supervised Learning
Joint-stochastic-approximation Random Fields with Application to Semi-supervised Learning
Yunfu Song
Zhijian Ou
15
0
0
24 May 2025
HiLAB: A Hybrid Inverse-Design Framework
HiLAB: A Hybrid Inverse-Design Framework
Reza Marzban
Hamed Abiri
Raphael Pestourie
Ali Adibi
86
0
0
23 May 2025
Leveraging KANs for Expedient Training of Multichannel MLPs via Preconditioning and Geometric Refinement
Leveraging KANs for Expedient Training of Multichannel MLPs via Preconditioning and Geometric Refinement
Jonas A. Actor
Graham Harper
Ben Southworth
E. Cyr
50
0
0
23 May 2025
Bayesian Deep Learning for Discrete Choice
Bayesian Deep Learning for Discrete Choice
Daniel F. Villarraga
Ricardo A. Daziano
BDLAI4CE
160
0
0
23 May 2025
Auto-nnU-Net: Towards Automated Medical Image Segmentation
Auto-nnU-Net: Towards Automated Medical Image Segmentation
Jannis Becktepe
Leona Hennig
Steffen Oeltze-Jafra
Marius Lindauer
238
0
0
22 May 2025
Higher-Order Asymptotics of Test-Time Adaptation for Batch Normalization Statistics
Higher-Order Asymptotics of Test-Time Adaptation for Batch Normalization Statistics
Masanari Kimura
21
0
0
22 May 2025
TULiP: Test-time Uncertainty Estimation via Linearization and Weight Perturbation
TULiP: Test-time Uncertainty Estimation via Linearization and Weight Perturbation
Yuhui Zhang
Dongshen Wu
Yuichiro Wada
Takafumi Kanamori
OODD
241
1
0
22 May 2025
MHANet: Multi-scale Hybrid Attention Network for Auditory Attention Detection
MHANet: Multi-scale Hybrid Attention Network for Auditory Attention Detection
Lu Li
Cunhang Fan
Hongyu Zhang
Jingjing Zhang
Xiaoke Yang
Jian Zhou
Zhao Lv
21
0
0
21 May 2025
Integration of TinyML and LargeML: A Survey of 6G and Beyond
Integration of TinyML and LargeML: A Survey of 6G and Beyond
Thai-Hoc Vu
Ngo Hoang Tu
Thien Huynh-The
Kyungchun Lee
Sunghwan Kim
Miroslav Voznak
Quoc-Viet Pham
69
0
0
20 May 2025
Explaining Neural Networks with Reasons
Explaining Neural Networks with Reasons
Levin Hornischer
Hannes Leitgeb
FAttAAMLMILM
95
0
0
20 May 2025
Selective Structured State Space for Multispectral-fused Small Target Detection
Selective Structured State Space for Multispectral-fused Small Target Detection
Qianqian Zhang
WeiJun Wang
Yunxing Liu
Li Zhou
Hao Zhao
Junshe An
Zihan Wang
Mamba
248
0
0
20 May 2025
ThinkSwitcher: When to Think Hard, When to Think Fast
ThinkSwitcher: When to Think Hard, When to Think Fast
Guosheng Liang
Longguang Zhong
Ziyi Yang
Xiaojun Quan
LRM
65
1
0
20 May 2025
Self Distillation via Iterative Constructive Perturbations
Self Distillation via Iterative Constructive Perturbations
Maheak Dave
Aniket K. Singh
Aryan Pareek
Harshita Jha
Debasis Chaudhuri
Manish P. Singh
ODL
47
0
0
20 May 2025
Learning to Insert for Constructive Neural Vehicle Routing Solver
Learning to Insert for Constructive Neural Vehicle Routing Solver
Fu Luo
Xi Lin
Mengyuan Zhong
Fei Liu
Zhenkun Wang
Jianyong Sun
Qingfu Zhang
80
0
0
20 May 2025
An Overview of Arithmetic Adaptations for Inference of Convolutional Neural Networks on Re-configurable Hardware
An Overview of Arithmetic Adaptations for Inference of Convolutional Neural Networks on Re-configurable Hardware
Ilkay Wunderlich
Benjamin Koch
Sven Schönfeld
230
2
0
19 May 2025
Parallel Layer Normalization for Universal Approximation
Parallel Layer Normalization for Universal Approximation
Yunhao Ni
Yuhe Liu
Wenxin Sun
Yitong Tang
Yuxin Guo
Peilin Feng
Wenjun Wu
Lei Huang
100
0
0
19 May 2025
Advancing Generalization Across a Variety of Abstract Visual Reasoning Tasks
Advancing Generalization Across a Variety of Abstract Visual Reasoning Tasks
Mikołaj Małkiński
Jacek Mańdziuk
48
0
0
19 May 2025
Machine learning the first stage in 2SLS: Practical guidance from bias decomposition and simulation
Machine learning the first stage in 2SLS: Practical guidance from bias decomposition and simulation
Connor Lennon
Edward Rubin
Glen Waddell
43
0
0
19 May 2025
A Training Framework for Optimal and Stable Training of Polynomial Neural Networks
A Training Framework for Optimal and Stable Training of Polynomial Neural Networks
Forsad Al Hossain
Tauhidur Rahman
21
0
0
16 May 2025
From Fibers to Cells: Fourier-Based Registration Enables Virtual Cresyl Violet Staining From 3D Polarized Light Imaging
From Fibers to Cells: Fourier-Based Registration Enables Virtual Cresyl Violet Staining From 3D Polarized Light Imaging
Alexander Oberstrass
Esteban Vaca
Eric Upschulte
Meiqi Niu
N. Palomero-Gallagher
David Graessel
Christian Schiffer
M. Axer
Katrin Amunts
Timo Dickscheid
71
0
0
16 May 2025
A Multi-modal Fusion Network for Terrain Perception Based on Illumination Aware
A Multi-modal Fusion Network for Terrain Perception Based on Illumination Aware
Rui Wang
Shichun Yang
Yuyi Chen
Z. Li
Zexiang Tong
Jinfeng Xu
Jiayi Lu
Xinjie Feng
Yaoguang Cao
61
0
0
16 May 2025
CSPENet: Contour-Aware and Saliency Priors Embedding Network for Infrared Small Target Detection
CSPENet: Contour-Aware and Saliency Priors Embedding Network for Infrared Small Target Detection
Jiakun Deng
Kexuan Li
Xingye Cui
Jiaxuan Li
Chang Long
Tian Pu
Zhenming Peng
99
0
0
15 May 2025
Uniform Loss vs. Specialized Optimization: A Comparative Analysis in Multi-Task Learning
Uniform Loss vs. Specialized Optimization: A Comparative Analysis in Multi-Task Learning
Gabriel S. Gama
Valdir Grassi Jr
MoMe
103
0
0
15 May 2025
Robust Federated Learning with Confidence-Weighted Filtering and GAN-Based Completion under Noisy and Incomplete Data
Robust Federated Learning with Confidence-Weighted Filtering and GAN-Based Completion under Noisy and Incomplete Data
Alpaslan Gokcen
Ali Boyaci
FedML
79
0
0
14 May 2025
Variational Prefix Tuning for Diverse and Accurate Code Summarization Using Pre-trained Language Models
Variational Prefix Tuning for Diverse and Accurate Code Summarization Using Pre-trained Language Models
Junda Zhao
Yuliang Song
Eldan Cohen
102
0
0
14 May 2025
Virtual Dosimetrists: A Radiotherapy Training "Flight Simulator"
Virtual Dosimetrists: A Radiotherapy Training "Flight Simulator"
S. Gay
Tucker Netherton
Barbara Marquez
Raymond P. Mumme
Mary P. Gronberg
Brent Parker
Chelsea Pinnix
Sanjay Shete
Carlos Cardenas
Laurence Court
61
0
0
14 May 2025
Neural Multivariate Regression: Qualitative Insights from the Unconstrained Feature Model
Neural Multivariate Regression: Qualitative Insights from the Unconstrained Feature Model
Shuyang Ling
Soyuj Jung Basnet
Juan Guevara
Li Guo
George Andriopoulos
76
0
0
14 May 2025
A Large-scale Benchmark on Geological Fault Delineation Models: Domain Shift, Training Dynamics, Generalizability, Evaluation and Inferential Behavior
A Large-scale Benchmark on Geological Fault Delineation Models: Domain Shift, Training Dynamics, Generalizability, Evaluation and Inferential Behavior
Jorge Quesada
Chen Zhou
Prithwijit Chowdhury
Mohammad Alotaibi
Ahmad Mustafa
Yusufjon Kumamnov
Mohit Prabhushankar
Ghassan AlRegib
AI4CE
69
0
0
13 May 2025
Unsupervised Raindrop Removal from a Single Image using Conditional Diffusion Models
Unsupervised Raindrop Removal from a Single Image using Conditional Diffusion Models
Lhuqita Fazry
Valentino Vito
DiffM
65
0
0
13 May 2025
Low-Complexity Inference in Continual Learning via Compressed Knowledge Transfer
Low-Complexity Inference in Continual Learning via Compressed Knowledge Transfer
Zhenrong Liu
Janne M. J. Huttunen
Mikko Honkala
CLL
70
0
0
13 May 2025
Tagging fully hadronic exotic decays of the vectorlike $\mathbf{B}$ quark using a graph neural network
Tagging fully hadronic exotic decays of the vectorlike B\mathbf{B}B quark using a graph neural network
Jai Bardhan
Tanumoy Mandal
Subhadip Mitra
Cyrin Neeraj
Mihir Rawat
46
0
0
12 May 2025
Accurate and Efficient Multivariate Time Series Forecasting via Offline Clustering
Accurate and Efficient Multivariate Time Series Forecasting via Offline Clustering
Yiming Niu
Jinliang Deng
Lulu Zhang
Zimu Zhou
Yongxin Tong
AI4TS
165
0
0
09 May 2025
Document Image Rectification Bases on Self-Adaptive Multitask Fusion
Document Image Rectification Bases on Self-Adaptive Multitask Fusion
Heng Li
Xiangping Wu
Qingcai Chen
121
0
0
09 May 2025
Mask-PINNs: Regulating Feature Distributions in Physics-Informed Neural Networks
Mask-PINNs: Regulating Feature Distributions in Physics-Informed Neural Networks
Feilong Jiang
Xiaonan Hou
Jianqiao Ye
Min Xia
OODPINN
84
0
0
09 May 2025
Sparse Training from Random Initialization: Aligning Lottery Ticket Masks using Weight Symmetry
Sparse Training from Random Initialization: Aligning Lottery Ticket Masks using Weight Symmetry
Mohammed Adnan
Rohan Jain
Ekansh Sharma
Rahul Krishnan
Yani Andrew Ioannou
93
0
0
08 May 2025
Threshold Modulation for Online Test-Time Adaptation of Spiking Neural Networks
Threshold Modulation for Online Test-Time Adaptation of Spiking Neural Networks
Kejie Zhao
Wenjia Hua
Aiersi Tuerhong
Luziwei Leng
Yuxin Ma
Qinghua Guo
542
0
0
08 May 2025
Boosting Statistic Learning with Synthetic Data from Pretrained Large Models
Boosting Statistic Learning with Synthetic Data from Pretrained Large Models
Jialong Jiang
Wenkang Hu
Jian Huang
Yuling Jiao
Xu Liu
DiffM
76
0
0
08 May 2025
Previous
12345...220221222
Next