ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1502.03167
  4. Cited By
Batch Normalization: Accelerating Deep Network Training by Reducing
  Internal Covariate Shift
v1v2v3 (latest)

Batch Normalization: Accelerating Deep Network Training by Reducing Internal Covariate Shift

11 February 2015
Sergey Ioffe
Christian Szegedy
    OOD
ArXiv (abs)PDFHTML

Papers citing "Batch Normalization: Accelerating Deep Network Training by Reducing Internal Covariate Shift"

50 / 11,238 papers shown
Title
On permutation symmetries in Bayesian neural network posteriors: a
  variational perspective
On permutation symmetries in Bayesian neural network posteriors: a variational perspective
Simone Rossi
Ankit Singh
T. Hannagan
69
3
0
16 Oct 2023
SoTTA: Robust Test-Time Adaptation on Noisy Data Streams
SoTTA: Robust Test-Time Adaptation on Noisy Data Streams
Taesik Gong
Yewon Kim
Taeckyung Lee
Sorn Chottananurak
Sung-Ju Lee
TTA
72
33
0
16 Oct 2023
Data Augmentation for Time-Series Classification: An Extensive Empirical
  Study and Comprehensive Survey
Data Augmentation for Time-Series Classification: An Extensive Empirical Study and Comprehensive Survey
Zijun Gao
Lingbo Li
AI4TS
105
9
0
16 Oct 2023
Towards Unified and Effective Domain Generalization
Towards Unified and Effective Domain Generalization
Yiyuan Zhang
Kaixiong Gong
Xiaohan Ding
Kaipeng Zhang
Fangrui Lv
Kurt Keutzer
Xiangyu Yue
AI4CEOODFedML
106
4
0
16 Oct 2023
SeUNet-Trans: A Simple yet Effective UNet-Transformer Model for Medical
  Image Segmentation
SeUNet-Trans: A Simple yet Effective UNet-Transformer Model for Medical Image Segmentation
Tan-Hanh Pham
Xianqi Li
Kim-Doang Nguyen
MedImViT
71
14
0
16 Oct 2023
Efficient Model-Agnostic Multi-Group Equivariant Networks
Efficient Model-Agnostic Multi-Group Equivariant Networks
Razan Baltaji
Sourya Basu
Lav Varshney
58
1
0
14 Oct 2023
STORM: Efficient Stochastic Transformer based World Models for
  Reinforcement Learning
STORM: Efficient Stochastic Transformer based World Models for Reinforcement Learning
Weipu Zhang
Gang Wang
Jian Sun
Yetian Yuan
Gao Huang
104
45
0
14 Oct 2023
Learning Unified Representations for Multi-Resolution Face Recognition
Learning Unified Representations for Multi-Resolution Face Recognition
Hulingxiao He
Wu Yuan
Yidian Huang
Shilong Zhao
Wen Yuan
Hanqin Li
CVBM
35
0
0
14 Oct 2023
Pairwise Similarity Learning is SimPLE
Pairwise Similarity Learning is SimPLE
Yandong Wen
Weiyang Liu
Yao Feng
Bhiksha Raj
Rita Singh
Adrian Weller
Michael J. Black
Bernhard Schölkopf
128
6
0
13 Oct 2023
Transformer-based Multimodal Change Detection with Multitask Consistency
  Constraints
Transformer-based Multimodal Change Detection with Multitask Consistency Constraints
Biyuan Liu
Huaixin Chen
Kun Li
Michael Ying Yang
77
16
0
13 Oct 2023
Differential Evolution Algorithm based Hyper-Parameters Selection of
  Convolutional Neural Network for Speech Command Recognition
Differential Evolution Algorithm based Hyper-Parameters Selection of Convolutional Neural Network for Speech Command Recognition
Sandipan Dhar
Anuvab Sen
Aritra Bandyopadhyay
N. D. Jana
Arjun Ghosh
Zahra Sarayloo
48
0
0
13 Oct 2023
Overcoming Recency Bias of Normalization Statistics in Continual
  Learning: Balance and Adaptation
Overcoming Recency Bias of Normalization Statistics in Continual Learning: Balance and Adaptation
Yilin Lyu
Liyuan Wang
Xingxing Zhang
Zicheng Sun
Hang Su
Jun Zhu
Liping Jing
80
8
0
13 Oct 2023
Splicing Up Your Predictions with RNA Contrastive Learning
Splicing Up Your Predictions with RNA Contrastive Learning
Phil Fradkin
Ruian Shi
Bo Wang
Brendan Frey
Leo J. Lee
SSL
69
0
0
12 Oct 2023
A Symmetry-Aware Exploration of Bayesian Neural Network Posteriors
A Symmetry-Aware Exploration of Bayesian Neural Network Posteriors
Olivier Laurent
Emanuel Aldea
Gianni Franchi
BDLUQCV
79
8
0
12 Oct 2023
NuTime: Numerically Multi-Scaled Embedding for Large-Scale Time-Series
  Pretraining
NuTime: Numerically Multi-Scaled Embedding for Large-Scale Time-Series Pretraining
Chenguo Lin
Xumeng Wen
Wei Cao
Congrui Huang
Jiang Bian
Stephen Lin
Zhirong Wu
AI4TS
88
5
0
11 Oct 2023
Unsupervised Denoising for Signal-Dependent and Row-Correlated Imaging Noise
Unsupervised Denoising for Signal-Dependent and Row-Correlated Imaging Noise
Benjamin Salmon
Alexander Krull
148
1
0
11 Oct 2023
Neural Bounding
Neural Bounding
Wenxin Liu
Michael Fischer
Paul D. Yoo
Tobias Ritschel
156
0
0
10 Oct 2023
Interpretable Traffic Event Analysis with Bayesian Networks
Interpretable Traffic Event Analysis with Bayesian Networks
Tong Yuan
Jian Yang
Zeyi Wen
47
0
0
10 Oct 2023
Self-Supervised Dataset Distillation for Transfer Learning
Self-Supervised Dataset Distillation for Transfer Learning
Dong Bok Lee
Seanie Lee
Joonho Ko
Kenji Kawaguchi
Juho Lee
Sung Ju Hwang
DD
88
3
0
10 Oct 2023
Factorized Tensor Networks for Multi-Task and Multi-Domain Learning
Factorized Tensor Networks for Multi-Task and Multi-Domain Learning
Yash Garg
Nebiyou Yismaw
Rakib Hyder
Ashley Prater-Bennette
M. Salman Asif
58
2
0
09 Oct 2023
Generative ensemble deep learning severe weather prediction from a
  deterministic convection-allowing model
Generative ensemble deep learning severe weather prediction from a deterministic convection-allowing model
Yingkai Sha
Ryan Sobash
David John Gagne
51
0
0
09 Oct 2023
Based on What We Can Control Artificial Neural Networks
Based on What We Can Control Artificial Neural Networks
Cheng Kang
Xujing Yao
55
0
0
09 Oct 2023
Climate-sensitive Urban Planning through Optimization of Tree Placements
Climate-sensitive Urban Planning through Optimization of Tree Placements
Simon Schrodi
Ferdinand Briegel
Max Argus
Andreas Christen
Thomas Brox
AI4CE
116
0
0
09 Oct 2023
Multi-timestep models for Model-based Reinforcement Learning
Multi-timestep models for Model-based Reinforcement Learning
Abdelhakim Benechehab
Giuseppe Paolo
Albert Thomas
Maurizio Filippone
Balázs Kégl
OffRL
74
0
0
09 Oct 2023
Binary Classification with Confidence Difference
Binary Classification with Confidence Difference
Wei Wang
Lei Feng
Yuchen Jiang
Gang Niu
Min Zhang
Masashi Sugiyama
64
7
0
09 Oct 2023
A Simple and Robust Framework for Cross-Modality Medical Image
  Segmentation applied to Vision Transformers
A Simple and Robust Framework for Cross-Modality Medical Image Segmentation applied to Vision Transformers
Matteo Bastico
David Ryckelynck
Laurent Corté
Yannick Tillier
Etienne Decencière
MedImViT
66
2
0
09 Oct 2023
A Comprehensive Survey on Deep Neural Image Deblurring
A Comprehensive Survey on Deep Neural Image Deblurring
S. A. Biyouki
Hoon Hwangbo
72
2
0
07 Oct 2023
Generalized Robust Test-Time Adaptation in Continuous Dynamic Scenarios
Generalized Robust Test-Time Adaptation in Continuous Dynamic Scenarios
Shuangliang Li
Longhui Yuan
Binhui Xie
Tao Yang
TTA
77
2
0
07 Oct 2023
Understanding, Predicting and Better Resolving Q-Value Divergence in
  Offline-RL
Understanding, Predicting and Better Resolving Q-Value Divergence in Offline-RL
Yang Yue
Rui Lu
Bingyi Kang
Shiji Song
Gao Huang
OffRL
122
17
0
06 Oct 2023
Introducing the Attribution Stability Indicator: a Measure for Time
  Series XAI Attributions
Introducing the Attribution Stability Indicator: a Measure for Time Series XAI Attributions
U. Schlegel
Daniel A. Keim
AI4TS
92
1
0
06 Oct 2023
Robust Multimodal Learning with Missing Modalities via
  Parameter-Efficient Adaptation
Robust Multimodal Learning with Missing Modalities via Parameter-Efficient Adaptation
Md Kaykobad Reza
Ashley Prater-Bennette
M. Salman Asif
74
8
0
06 Oct 2023
Accelerated Neural Network Training with Rooted Logistic Objectives
Accelerated Neural Network Training with Rooted Logistic Objectives
Zhu Wang
Praveen Raj Veluswami
Harshit Mishra
Sathya Ravi
68
0
0
05 Oct 2023
A Long Way to Go: Investigating Length Correlations in RLHF
A Long Way to Go: Investigating Length Correlations in RLHF
Prasann Singhal
Tanya Goyal
Jiacheng Xu
Greg Durrett
160
161
0
05 Oct 2023
Robustness-Guided Image Synthesis for Data-Free Quantization
Robustness-Guided Image Synthesis for Data-Free Quantization
Jianhong Bai
Yuchen Yang
Huanpeng Chu
Hualiang Wang
Zuo-Qiang Liu
Ruizhe Chen
Xiaoxuan He
Lianrui Mu
Chengfei Cai
Haoji Hu
DiffMMQ
151
5
0
05 Oct 2023
TRAM: Bridging Trust Regions and Sharpness Aware Minimization
TRAM: Bridging Trust Regions and Sharpness Aware Minimization
Tom Sherborne
Naomi Saphra
Pradeep Dasigi
Hao Peng
58
5
0
05 Oct 2023
Deep Geometric Learning with Monotonicity Constraints for Alzheimer's
  Disease Progression
Deep Geometric Learning with Monotonicity Constraints for Alzheimer's Disease Progression
Seungwoo Jeong
Wonsik Jung
Junghyo Sohn
Heung-Il Suk
89
3
0
05 Oct 2023
PDR-CapsNet: an Energy-Efficient Parallel Approach to Dynamic Routing in
  Capsule Networks
PDR-CapsNet: an Energy-Efficient Parallel Approach to Dynamic Routing in Capsule Networks
Samaneh Javadinia
A. Baniasadi
23
2
0
04 Oct 2023
ECoFLaP: Efficient Coarse-to-Fine Layer-Wise Pruning for Vision-Language
  Models
ECoFLaP: Efficient Coarse-to-Fine Layer-Wise Pruning for Vision-Language Models
Yi-Lin Sung
Jaehong Yoon
Mohit Bansal
VLM
92
14
0
04 Oct 2023
Delving into CLIP latent space for Video Anomaly Recognition
Delving into CLIP latent space for Video Anomaly Recognition
Luca Zanella
Benedetta Liberatori
Willi Menapace
Fabio Poiesi
Yiming Wang
Elisa Ricci
69
27
0
04 Oct 2023
Clustering-based Image-Text Graph Matching for Domain Generalization
Clustering-based Image-Text Graph Matching for Domain Generalization
Nokyung Park
Daewon Chae
Jeongyong Shim
Sangpil Kim
Eun-Sol Kim
Jinkyu Kim
OOD
61
1
0
04 Oct 2023
Dual-stage Flows-based Generative Modeling for Traceable Urban Planning
Dual-stage Flows-based Generative Modeling for Traceable Urban Planning
Xuanming Hu
Wei Fan
Dongjie Wang
Pengyang Wang
Yong Li
Yanjie Fu
AI4CE
74
2
0
03 Oct 2023
FedL2P: Federated Learning to Personalize
FedL2P: Federated Learning to Personalize
Royson Lee
Minyoung Kim
Da Li
Xinchi Qiu
Timothy M. Hospedales
Ferenc Huszár
Nicholas D. Lane
FedML
67
0
0
03 Oct 2023
Bag of Tricks for Fully Test-Time Adaptation
Bag of Tricks for Fully Test-Time Adaptation
Saypraseuth Mounsaveng
Florent Chiaroni
Malik Boudiaf
M. Pedersoli
Ismail Ben Ayed
TTA
70
7
0
03 Oct 2023
Chunking: Continual Learning is not just about Distribution Shift
Chunking: Continual Learning is not just about Distribution Shift
Thomas L. Lee
Amos Storkey
78
1
0
03 Oct 2023
Towards Training Without Depth Limits: Batch Normalization Without
  Gradient Explosion
Towards Training Without Depth Limits: Batch Normalization Without Gradient Explosion
Alexandru Meterez
Amir Joudaki
Francesco Orabona
Alexander Immer
Gunnar Rätsch
Hadi Daneshmand
73
8
0
03 Oct 2023
Generative Autoencoding of Dropout Patterns
Generative Autoencoding of Dropout Patterns
Shunta Maeda
SyDa
22
1
0
03 Oct 2023
Locality-Aware Graph-Rewiring in GNNs
Locality-Aware Graph-Rewiring in GNNs
Federico Barbero
A. Velingker
Amin Saberi
Michael M. Bronstein
Francesco Di Giovanni
110
33
0
02 Oct 2023
On Training Derivative-Constrained Neural Networks
On Training Derivative-Constrained Neural Networks
KaiChieh Lo
Daniel Huang
94
3
0
02 Oct 2023
PatchMixer: A Patch-Mixing Architecture for Long-Term Time Series
  Forecasting
PatchMixer: A Patch-Mixing Architecture for Long-Term Time Series Forecasting
Zeying Gong
Yujin Tang
Junwei Liang
KELMAI4TS
71
28
0
01 Oct 2023
RegBN: Batch Normalization of Multimodal Data with Regularization
RegBN: Batch Normalization of Multimodal Data with Regularization
Morteza Ghahremani
Christian Wachinger
99
7
0
01 Oct 2023
Previous
123...293031...223224225
Next