ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1502.03167
  4. Cited By
Batch Normalization: Accelerating Deep Network Training by Reducing
  Internal Covariate Shift
v1v2v3 (latest)

Batch Normalization: Accelerating Deep Network Training by Reducing Internal Covariate Shift

11 February 2015
Sergey Ioffe
Christian Szegedy
    OOD
ArXiv (abs)PDFHTML

Papers citing "Batch Normalization: Accelerating Deep Network Training by Reducing Internal Covariate Shift"

50 / 11,269 papers shown
Title
LABO: Towards Learning Optimal Label Regularization via Bi-level Optimization
LABO: Towards Learning Optimal Label Regularization via Bi-level Optimization
Peng Lu
Ahmad Rashid
I. Kobyzev
Mehdi Rezagholizadeh
Philippe Langlais
61
0
0
08 May 2023
Dual Residual Attention Network for Image Denoising
Dual Residual Attention Network for Image Denoising
Wencong Wu
Shijie Liu
Yi Zhou
Yungang Zhang
Yu Xiang
67
75
0
07 May 2023
YOLOCS: Object Detection based on Dense Channel Compression for Feature
  Spatial Solidification
YOLOCS: Object Detection based on Dense Channel Compression for Feature Spatial Solidification
Lingyi Huang
Weisheng Li
Linlin Shen
Haojie Fu
Xue Xiao
Suihan Xiao
77
17
0
07 May 2023
Electromyography Signal Classification Using Deep Learning
Electromyography Signal Classification Using Deep Learning
Mekia Shigute Gaso
S. Cankurt
A. Subasi
20
4
0
06 May 2023
Physics-based network fine-tuning for robust quantitative susceptibility
  mapping from high-pass filtered phase
Physics-based network fine-tuning for robust quantitative susceptibility mapping from high-pass filtered phase
Jinwei Zhang
A. Dimov
Chao Li
Hang Zhang
Thanh D. Nguyen
P. Spincemaille
Yi Wang
52
0
0
05 May 2023
Evolution under Length Constraints for CNN Architecture design
Evolution under Length Constraints for CNN Architecture design
Ousmane Youme
J. Dembele
E. C. Ezin
C. Cambier
3DV
63
1
0
05 May 2023
CAMEL: Co-Designing AI Models and Embedded DRAMs for Efficient On-Device
  Learning
CAMEL: Co-Designing AI Models and Embedded DRAMs for Efficient On-Device Learning
Sai Qian Zhang
Thierry Tambe
Nestor Cuevas
Gu-Yeon Wei
David Brooks
58
4
0
04 May 2023
OctFormer: Octree-based Transformers for 3D Point Clouds
OctFormer: Octree-based Transformers for 3D Point Clouds
Peng-Shuai Wang
ViT3DPC
83
88
0
04 May 2023
Input Layer Binarization with Bit-Plane Encoding
Input Layer Binarization with Bit-Plane Encoding
Lorenzo Vorabbi
Davide Maltoni
Stefano Santi
MQ
60
6
0
04 May 2023
On the Expressivity Role of LayerNorm in Transformers' Attention
On the Expressivity Role of LayerNorm in Transformers' Attention
Shaked Brody
Shiyu Jin
Xinghao Zhu
MoE
125
32
0
04 May 2023
Tensorizing flows: a tool for variational inference
Tensorizing flows: a tool for variational inference
Y. Khoo
M. Lindsey
Renana Keydar
DRL
70
4
0
03 May 2023
A Curriculum View of Robust Loss Functions
A Curriculum View of Robust Loss Functions
Zebin Ou
Yue Zhang
NoLa
77
0
0
03 May 2023
Automatic Parameterization for Aerodynamic Shape Optimization via Deep
  Geometric Learning
Automatic Parameterization for Aerodynamic Shape Optimization via Deep Geometric Learning
Zhen Wei
Pascal Fua
Michaël Bauerheim
AI4CE
21
2
0
03 May 2023
Attention Based Feature Fusion For Multi-Agent Collaborative Perception
Attention Based Feature Fusion For Multi-Agent Collaborative Perception
A. N. Ahmed
Siegfried Mercelis
Ali Anwar
54
1
0
03 May 2023
DiffFacto: Controllable Part-Based 3D Point Cloud Generation with Cross
  Diffusion
DiffFacto: Controllable Part-Based 3D Point Cloud Generation with Cross Diffusion
Kiyohiro Nakayama
Mikaela Angelina Uy
Jiahui Huang
Shihui Hu
Ke Li
Leonidas Guibas
DiffM
124
24
0
03 May 2023
Localization using Multi-Focal Spatial Attention for Masked Face
  Recognition
Localization using Multi-Focal Spatial Attention for Masked Face Recognition
Samrudhdhi B. Rangrej
Hanbyel Cho
H. Hong
James J. Clark
Dongmin Cho
JungWoo Chang
Junmo Kim
CVBM
84
1
0
03 May 2023
A Lightweight CNN-Transformer Model for Learning Traveling Salesman
  Problems
A Lightweight CNN-Transformer Model for Learning Traveling Salesman Problems
Minseop Jung
Jaeseung Lee
Jibum Kim
ViT
56
13
0
03 May 2023
MISNN: Multiple Imputation via Semi-parametric Neural Networks
MISNN: Multiple Imputation via Semi-parametric Neural Networks
Zhiqi Bu
Zongyu Dai
Yiliang Zhang
Q. Long
66
0
0
02 May 2023
Sequence Modeling with Multiresolution Convolutional Memory
Sequence Modeling with Multiresolution Convolutional Memory
Jiaxin Shi
Ke Alexander Wang
E. Fox
104
14
0
02 May 2023
The Training Process of Many Deep Networks Explores the Same
  Low-Dimensional Manifold
The Training Process of Many Deep Networks Explores the Same Low-Dimensional Manifold
Jialin Mao
Itay Griniasty
H. Teoh
Rahul Ramesh
Rubing Yang
Mark K. Transtrum
James P. Sethna
Pratik Chaudhari
3DPC
85
17
0
02 May 2023
Random Function Descent
Random Function Descent
Felix Benning
L. Döring
45
0
0
02 May 2023
MDENet: Multi-modal Dual-embedding Networks for Malware Open-set
  Recognition
MDENet: Multi-modal Dual-embedding Networks for Malware Open-set Recognition
Jingcai Guo
Yuanyuan Xu
Wenchao Xu
Yufeng Zhan
Yuxia Sun
Song Guo
100
12
0
02 May 2023
PRSeg: A Lightweight Patch Rotate MLP Decoder for Semantic Segmentation
PRSeg: A Lightweight Patch Rotate MLP Decoder for Semantic Segmentation
Yizhe Ma
Fangjian Lin
Sitong Wu
Sheng Tian
Long Yu
92
12
0
01 May 2023
File Fragment Classification using Light-Weight Convolutional Neural
  Networks
File Fragment Classification using Light-Weight Convolutional Neural Networks
Mustafa Ghaleb
K. Saaim
Muhamad Felemban
S. Al-Saleh
Ahmad S. Al-Mulhem
72
1
0
01 May 2023
Representations and Exploration for Deep Reinforcement Learning using
  Singular Value Decomposition
Representations and Exploration for Deep Reinforcement Learning using Singular Value Decomposition
Yash Chandak
S. Thakoor
Z. Guo
Yunhao Tang
Rémi Munos
Will Dabney
Diana Borsa
98
4
0
01 May 2023
A Simplified Framework for Contrastive Learning for Node Representations
A Simplified Framework for Contrastive Learning for Node Representations
Ilgee Hong
Huy Tran
Claire Donnat
SSL
92
2
0
01 May 2023
A Direct Sampling-Based Deep Learning Approach for Inverse Medium
  Scattering Problems
A Direct Sampling-Based Deep Learning Approach for Inverse Medium Scattering Problems
Jianfeng Ning
Fuqun Han
Jun Zou
58
14
0
29 Apr 2023
When Deep Learning Meets Polyhedral Theory: A Survey
When Deep Learning Meets Polyhedral Theory: A Survey
Joey Huchette
Gonzalo Muñoz
Thiago Serra
Calvin Tsay
AI4CE
171
37
0
29 Apr 2023
A Stable and Scalable Method for Solving Initial Value PDEs with Neural
  Networks
A Stable and Scalable Method for Solving Initial Value PDEs with Neural Networks
Marc Finzi
Andres Potapczynski
M. Choptuik
A. Wilson
75
11
0
28 Apr 2023
Earning Extra Performance from Restrictive Feedbacks
Earning Extra Performance from Restrictive Feedbacks
Jing Li
Yuangang Pan
Yueming Lyu
Yinghua Yao
Yulei Sui
Ivor W. Tsang
52
3
0
28 Apr 2023
Semi-Supervised RF Fingerprinting with Consistency-Based Regularization
Semi-Supervised RF Fingerprinting with Consistency-Based Regularization
Weidong Wang
Cheng Luo
Jiancheng An
Lu Gan
H. Liao
Chau Yuen
51
13
0
28 Apr 2023
Hyperparameter Optimization through Neural Network Partitioning
Hyperparameter Optimization through Neural Network Partitioning
Bruno Mlodozeniec
M. Reisser
Christos Louizos
96
8
0
28 Apr 2023
Improve Video Representation with Temporal Adversarial Augmentation
Improve Video Representation with Temporal Adversarial Augmentation
Jinhao Duan
Quanfu Fan
Hao-Ran Cheng
Xiaoshuang Shi
Kaidi Xu
AAMLAI4TSViT
58
2
0
28 Apr 2023
Blind Signal Separation for Fast Ultrasound Computed Tomography
Blind Signal Separation for Fast Ultrasound Computed Tomography
Takumi Noda
Yuusuke Jinnai
Naoki Tomii
T. Azuma
15
2
0
27 Apr 2023
A Review of Panoptic Segmentation for Mobile Mapping Point Clouds
A Review of Panoptic Segmentation for Mobile Mapping Point Clouds
Binbin Xiang
Yuanwen Yue
T. Peters
Konrad Schindler
3DPC
83
8
0
27 Apr 2023
Noise Is Not the Main Factor Behind the Gap Between SGD and Adam on
  Transformers, but Sign Descent Might Be
Noise Is Not the Main Factor Behind the Gap Between SGD and Adam on Transformers, but Sign Descent Might Be
Frederik Kunstner
Jacques Chen
J. Lavington
Mark Schmidt
100
75
0
27 Apr 2023
Do SSL Models Have Déjà Vu? A Case of Unintended Memorization in
  Self-supervised Learning
Do SSL Models Have Déjà Vu? A Case of Unintended Memorization in Self-supervised Learning
Casey Meehan
Florian Bordes
Pascal Vincent
Kamalika Chaudhuri
Chuan Guo
86
18
0
26 Apr 2023
Effect of latent space distribution on the segmentation of images with
  multiple annotations
Effect of latent space distribution on the segmentation of images with multiple annotations
Ishaan Bhat
J. Pluim
M. Viergever
Hugo J. Kuijf
44
6
0
26 Apr 2023
Uncovering the Representation of Spiking Neural Networks Trained with
  Surrogate Gradient
Uncovering the Representation of Spiking Neural Networks Trained with Surrogate Gradient
Yuhang Li
Youngeun Kim
Hyoungseob Park
Priyadarshini Panda
127
16
0
25 Apr 2023
Room dimensions and absorption inference from room transfer function via
  machine learning
Room dimensions and absorption inference from room transfer function via machine learning
Yuanxin Xia
C. Jeong
41
2
0
25 Apr 2023
Parallel Spiking Neurons with High Efficiency and Ability to Learn
  Long-term Dependencies
Parallel Spiking Neurons with High Efficiency and Ability to Learn Long-term Dependencies
Wei Fang
Zhaofei Yu
Zhaokun Zhou
Ding Chen
Yanqing Chen
Zhengyu Ma
T. Masquelier
Yonghong Tian
92
47
0
25 Apr 2023
Restoring Original Signal From Pile-up Signal using Deep Learning
Restoring Original Signal From Pile-up Signal using Deep Learning
C. H. Kim
S. Ahn
K. Y. Chae
J. Hooker
G. Rogachev
18
1
0
24 Apr 2023
Universal Domain Adaptation via Compressive Attention Matching
Universal Domain Adaptation via Compressive Attention Matching
Didi Zhu
Yincuan Li
Junkun Yuan
Zexi Li
Kun Kuang
Chao Wu
80
21
0
24 Apr 2023
Accurate and Efficient Event-based Semantic Segmentation Using Adaptive
  Spiking Encoder-Decoder Network
Accurate and Efficient Event-based Semantic Segmentation Using Adaptive Spiking Encoder-Decoder Network
Rui Zhang
Luziwei Leng
Kaiwei Che
Hu Zhang
Jieling Cheng
Qinghai Guo
Jiangxing Liao
Ran Cheng
97
12
0
24 Apr 2023
Function-Consistent Feature Distillation
Function-Consistent Feature Distillation
Dongyang Liu
Meina Kan
Shiguang Shan
Xilin Chen
112
19
0
24 Apr 2023
End-to-End Feasible Optimization Proxies for Large-Scale Economic
  Dispatch
End-to-End Feasible Optimization Proxies for Large-Scale Economic Dispatch
Wenbo Chen
Mathieu Tanneau
Pascal Van Hentenryck
105
36
0
23 Apr 2023
The Disharmony between BN and ReLU Causes Gradient Explosion, but is
  Offset by the Correlation between Activations
The Disharmony between BN and ReLU Causes Gradient Explosion, but is Offset by the Correlation between Activations
Inyoung Paik
Jaesik Choi
81
1
0
23 Apr 2023
StyLess: Boosting the Transferability of Adversarial Examples
StyLess: Boosting the Transferability of Adversarial Examples
Kaisheng Liang
Bin Xiao
AAML
71
18
0
23 Apr 2023
Effective Neural Network $L_0$ Regularization With BinMask
Effective Neural Network L0L_0L0​ Regularization With BinMask
Kai Jia
Martin Rinard
77
3
0
21 Apr 2023
Task-Adaptive Pseudo Labeling for Transductive Meta-Learning
Task-Adaptive Pseudo Labeling for Transductive Meta-Learning
Sang-Hoon Lee
Seunghyun Lee
B. Song
87
0
0
21 Apr 2023
Previous
123...414243...224225226
Next