v1v2v3 (latest)

Batch Normalization: Accelerating Deep Network Training by Reducing Internal Covariate Shift

11 February 2015

Sergey Ioffe

Christian Szegedy

OOD

ArXiv (abs)PDF HTML

Papers citing "Batch Normalization: Accelerating Deep Network Training by Reducing Internal Covariate Shift"

50 / 11,238 papers shown

Title
On permutation symmetries in Bayesian neural network posteriors: a variational perspective Simone Rossi Ankit Singh T. Hannagan 69 3 0 16 Oct 2023
SoTTA: Robust Test-Time Adaptation on Noisy Data Streams Taesik Gong Yewon Kim Taeckyung Lee Sorn Chottananurak Sung-Ju Lee TTA 72 33 0 16 Oct 2023
Data Augmentation for Time-Series Classification: An Extensive Empirical Study and Comprehensive Survey Zijun Gao Lingbo Li AI4TS 105 9 0 16 Oct 2023
Towards Unified and Effective Domain Generalization Yiyuan Zhang Kaixiong Gong Xiaohan Ding Kaipeng Zhang Fangrui Lv Kurt Keutzer Xiangyu Yue AI4CE OOD FedML 106 4 0 16 Oct 2023
SeUNet-Trans: A Simple yet Effective UNet-Transformer Model for Medical Image Segmentation Tan-Hanh Pham Xianqi Li Kim-Doang Nguyen MedIm ViT 71 14 0 16 Oct 2023
Efficient Model-Agnostic Multi-Group Equivariant Networks Razan Baltaji Sourya Basu Lav Varshney 58 1 0 14 Oct 2023
STORM: Efficient Stochastic Transformer based World Models for Reinforcement Learning Weipu Zhang Gang Wang Jian Sun Yetian Yuan Gao Huang 104 45 0 14 Oct 2023
Learning Unified Representations for Multi-Resolution Face Recognition Hulingxiao He Wu Yuan Yidian Huang Shilong Zhao Wen Yuan Hanqin Li CVBM 35 0 0 14 Oct 2023
Pairwise Similarity Learning is SimPLE Yandong Wen Weiyang Liu Yao Feng Bhiksha Raj Rita Singh Adrian Weller Michael J. Black Bernhard Schölkopf 128 6 0 13 Oct 2023
Transformer-based Multimodal Change Detection with Multitask Consistency Constraints Biyuan Liu Huaixin Chen Kun Li Michael Ying Yang 77 16 0 13 Oct 2023
Differential Evolution Algorithm based Hyper-Parameters Selection of Convolutional Neural Network for Speech Command Recognition Sandipan Dhar Anuvab Sen Aritra Bandyopadhyay N. D. Jana Arjun Ghosh Zahra Sarayloo 48 0 0 13 Oct 2023
Overcoming Recency Bias of Normalization Statistics in Continual Learning: Balance and Adaptation Yilin Lyu Liyuan Wang Xingxing Zhang Zicheng Sun Hang Su Jun Zhu Liping Jing 80 8 0 13 Oct 2023
Splicing Up Your Predictions with RNA Contrastive Learning Phil Fradkin Ruian Shi Bo Wang Brendan Frey Leo J. Lee SSL 69 0 0 12 Oct 2023
A Symmetry-Aware Exploration of Bayesian Neural Network Posteriors Olivier Laurent Emanuel Aldea Gianni Franchi BDL UQCV 79 8 0 12 Oct 2023
NuTime: Numerically Multi-Scaled Embedding for Large-Scale Time-Series Pretraining Chenguo Lin Xumeng Wen Wei Cao Congrui Huang Jiang Bian Stephen Lin Zhirong Wu AI4TS 88 5 0 11 Oct 2023
Unsupervised Denoising for Signal-Dependent and Row-Correlated Imaging Noise Benjamin Salmon Alexander Krull 148 1 0 11 Oct 2023
Neural Bounding Wenxin Liu Michael Fischer Paul D. Yoo Tobias Ritschel 156 0 0 10 Oct 2023
Interpretable Traffic Event Analysis with Bayesian Networks Tong Yuan Jian Yang Zeyi Wen 47 0 0 10 Oct 2023
Self-Supervised Dataset Distillation for Transfer Learning Dong Bok Lee Seanie Lee Joonho Ko Kenji Kawaguchi Juho Lee Sung Ju Hwang DD 88 3 0 10 Oct 2023
Factorized Tensor Networks for Multi-Task and Multi-Domain Learning Yash Garg Nebiyou Yismaw Rakib Hyder Ashley Prater-Bennette M. Salman Asif 58 2 0 09 Oct 2023
Generative ensemble deep learning severe weather prediction from a deterministic convection-allowing model Yingkai Sha Ryan Sobash David John Gagne 51 0 0 09 Oct 2023
Based on What We Can Control Artificial Neural Networks Cheng Kang Xujing Yao 55 0 0 09 Oct 2023
Climate-sensitive Urban Planning through Optimization of Tree Placements Simon Schrodi Ferdinand Briegel Max Argus Andreas Christen Thomas Brox AI4CE 116 0 0 09 Oct 2023
Multi-timestep models for Model-based Reinforcement Learning Abdelhakim Benechehab Giuseppe Paolo Albert Thomas Maurizio Filippone Balázs Kégl OffRL 74 0 0 09 Oct 2023
Binary Classification with Confidence Difference Wei Wang Lei Feng Yuchen Jiang Gang Niu Min Zhang Masashi Sugiyama 64 7 0 09 Oct 2023
A Simple and Robust Framework for Cross-Modality Medical Image Segmentation applied to Vision Transformers Matteo Bastico David Ryckelynck Laurent Corté Yannick Tillier Etienne Decencière MedIm ViT 66 2 0 09 Oct 2023
A Comprehensive Survey on Deep Neural Image Deblurring S. A. Biyouki Hoon Hwangbo 72 2 0 07 Oct 2023
Generalized Robust Test-Time Adaptation in Continuous Dynamic Scenarios Shuangliang Li Longhui Yuan Binhui Xie Tao Yang TTA 77 2 0 07 Oct 2023
Understanding, Predicting and Better Resolving Q-Value Divergence in Offline-RL Yang Yue Rui Lu Bingyi Kang Shiji Song Gao Huang OffRL 122 17 0 06 Oct 2023
Introducing the Attribution Stability Indicator: a Measure for Time Series XAI Attributions U. Schlegel Daniel A. Keim AI4TS 92 1 0 06 Oct 2023
Robust Multimodal Learning with Missing Modalities via Parameter-Efficient Adaptation Md Kaykobad Reza Ashley Prater-Bennette M. Salman Asif 74 8 0 06 Oct 2023
Accelerated Neural Network Training with Rooted Logistic Objectives Zhu Wang Praveen Raj Veluswami Harshit Mishra Sathya Ravi 68 0 0 05 Oct 2023
A Long Way to Go: Investigating Length Correlations in RLHF Prasann Singhal Tanya Goyal Jiacheng Xu Greg Durrett 160 161 0 05 Oct 2023
Robustness-Guided Image Synthesis for Data-Free Quantization Jianhong Bai Yuchen Yang Huanpeng Chu Hualiang Wang Zuo-Qiang Liu Ruizhe Chen Xiaoxuan He Lianrui Mu Chengfei Cai Haoji Hu DiffM MQ 151 5 0 05 Oct 2023
TRAM: Bridging Trust Regions and Sharpness Aware Minimization Tom Sherborne Naomi Saphra Pradeep Dasigi Hao Peng 58 5 0 05 Oct 2023
Deep Geometric Learning with Monotonicity Constraints for Alzheimer's Disease Progression Seungwoo Jeong Wonsik Jung Junghyo Sohn Heung-Il Suk 89 3 0 05 Oct 2023
PDR-CapsNet: an Energy-Efficient Parallel Approach to Dynamic Routing in Capsule Networks Samaneh Javadinia A. Baniasadi 23 2 0 04 Oct 2023
ECoFLaP: Efficient Coarse-to-Fine Layer-Wise Pruning for Vision-Language Models Yi-Lin Sung Jaehong Yoon Mohit Bansal VLM 92 14 0 04 Oct 2023
Delving into CLIP latent space for Video Anomaly Recognition Luca Zanella Benedetta Liberatori Willi Menapace Fabio Poiesi Yiming Wang Elisa Ricci 69 27 0 04 Oct 2023
Clustering-based Image-Text Graph Matching for Domain Generalization Nokyung Park Daewon Chae Jeongyong Shim Sangpil Kim Eun-Sol Kim Jinkyu Kim OOD 61 1 0 04 Oct 2023
Dual-stage Flows-based Generative Modeling for Traceable Urban Planning Xuanming Hu Wei Fan Dongjie Wang Pengyang Wang Yong Li Yanjie Fu AI4CE 74 2 0 03 Oct 2023
FedL2P: Federated Learning to Personalize Royson Lee Minyoung Kim Da Li Xinchi Qiu Timothy M. Hospedales Ferenc Huszár Nicholas D. Lane FedML 67 0 0 03 Oct 2023
Bag of Tricks for Fully Test-Time Adaptation Saypraseuth Mounsaveng Florent Chiaroni Malik Boudiaf M. Pedersoli Ismail Ben Ayed TTA 70 7 0 03 Oct 2023
Chunking: Continual Learning is not just about Distribution Shift Thomas L. Lee Amos Storkey 78 1 0 03 Oct 2023
Towards Training Without Depth Limits: Batch Normalization Without Gradient Explosion Alexandru Meterez Amir Joudaki Francesco Orabona Alexander Immer Gunnar Rätsch Hadi Daneshmand 73 8 0 03 Oct 2023
Generative Autoencoding of Dropout Patterns Shunta Maeda SyDa 22 1 0 03 Oct 2023
Locality-Aware Graph-Rewiring in GNNs Federico Barbero A. Velingker Amin Saberi Michael M. Bronstein Francesco Di Giovanni 110 33 0 02 Oct 2023
On Training Derivative-Constrained Neural Networks KaiChieh Lo Daniel Huang 94 3 0 02 Oct 2023
PatchMixer: A Patch-Mixing Architecture for Long-Term Time Series Forecasting Zeying Gong Yujin Tang Junwei Liang KELM AI4TS 71 28 0 01 Oct 2023
RegBN: Batch Normalization of Multimodal Data with Regularization Morteza Ghahremani Christian Wachinger 99 7 0 01 Oct 2023