ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1502.03167
  4. Cited By
Batch Normalization: Accelerating Deep Network Training by Reducing
  Internal Covariate Shift
v1v2v3 (latest)

Batch Normalization: Accelerating Deep Network Training by Reducing Internal Covariate Shift

11 February 2015
Sergey Ioffe
Christian Szegedy
    OOD
ArXiv (abs)PDFHTML

Papers citing "Batch Normalization: Accelerating Deep Network Training by Reducing Internal Covariate Shift"

50 / 11,238 papers shown
Title
Using convolutional neural networks for stereological characterization
  of 3D hetero-aggregates based on synthetic STEM data
Using convolutional neural networks for stereological characterization of 3D hetero-aggregates based on synthetic STEM data
Lukas Fuchs
Tom Kirstein
Christoph Mahr
O. Furat
Valentin Baric
Andreas Rosenauer
Lutz Mädler
Volker Schmidt
3DV
66
3
0
27 Oct 2023
Semi-Supervised Panoptic Narrative Grounding
Semi-Supervised Panoptic Narrative Grounding
Danni Yang
Jiayi Ji
Xiaoshuai Sun
Haowei Wang
Yinan Li
Yiwei Ma
Rongrong Ji
84
5
0
27 Oct 2023
DP-SGD with weight clipping
DP-SGD with weight clipping
Antoine Barczewski
Jan Ramon
94
1
0
27 Oct 2023
A Spectral Condition for Feature Learning
A Spectral Condition for Feature Learning
Greg Yang
James B. Simon
Jeremy Bernstein
115
33
0
26 Oct 2023
Learning depth from monocular video sequences
Learning depth from monocular video sequences
Zhenwei Luo
VLMMDE
32
0
0
26 Oct 2023
Transformer-based Atmospheric Density Forecasting
Transformer-based Atmospheric Density Forecasting
Julia Briden
P. M. Siew
Victor Rodríguez-Fernández
Richard Linares
AI4CE
6
3
0
25 Oct 2023
LightSpeed: Light and Fast Neural Light Fields on Mobile Devices
LightSpeed: Light and Fast Neural Light Fields on Mobile Devices
Aarush Gupta
Junli Cao
Chaoyang Wang
Ju Hu
Sergey Tulyakov
Jian Ren
László A. Jeni
94
10
0
25 Oct 2023
MixerFlow: MLP-Mixer meets Normalising Flows
MixerFlow: MLP-Mixer meets Normalising Flows
Eshant English
Matthias Kirchler
Christoph Lippert
TPM
74
0
0
25 Oct 2023
Dynamic Processing Neural Network Architecture For Hearing Loss
  Compensation
Dynamic Processing Neural Network Architecture For Hearing Loss Compensation
S. Drgas
Lars Bramsløw
Archontis Politis
Gaurav Naithani
Tuomas Virtanen
22
2
0
25 Oct 2023
Gramian Attention Heads are Strong yet Efficient Vision Learners
Gramian Attention Heads are Strong yet Efficient Vision Learners
Jongbin Ryu
Dongyoon Han
J. Lim
102
2
0
25 Oct 2023
ClearMark: Intuitive and Robust Model Watermarking via Transposed Model
  Training
ClearMark: Intuitive and Robust Model Watermarking via Transposed Model Training
T. Krauß
Jasper Stang
Alexandra Dmitrienko
AAML
109
0
0
25 Oct 2023
Modality-Agnostic Self-Supervised Learning with Meta-Learned Masked
  Auto-Encoder
Modality-Agnostic Self-Supervised Learning with Meta-Learned Masked Auto-Encoder
Huiwon Jang
Jihoon Tack
Daewon Choi
Jongheon Jeong
Jinwoo Shin
76
3
0
25 Oct 2023
Instance-wise Linearization of Neural Network for Model Interpretation
Instance-wise Linearization of Neural Network for Model Interpretation
Zhimin Li
Shusen Liu
B. Kailkhura
Timo Bremer
Valerio Pascucci
MILMFAtt
62
0
0
25 Oct 2023
MotionAGFormer: Enhancing 3D Human Pose Estimation with a
  Transformer-GCNFormer Network
MotionAGFormer: Enhancing 3D Human Pose Estimation with a Transformer-GCNFormer Network
Soroush Mehraban
Vida Adeli
Babak Taati
ViT
130
46
0
25 Oct 2023
Multi-label Text Classification using GloVe and Neural Network Models
Multi-label Text Classification using GloVe and Neural Network Models
Hongren Wang
64
0
0
25 Oct 2023
Pixel-Level Clustering Network for Unsupervised Image Segmentation
Pixel-Level Clustering Network for Unsupervised Image Segmentation
Cuong Manh Hoang
Byeongkeun Kang
SSeg
101
22
0
24 Oct 2023
Image Segmentation using U-Net Architecture for Powder X-ray Diffraction
  Images
Image Segmentation using U-Net Architecture for Powder X-ray Diffraction Images
Howard Yanxon
Eric J. Roberts
Hannah Parraga
James Weng
Wenqian Xu
Uta Ruett
Alexander Hexemer
Petrus H. Zwart
Nickolas Schwarz
48
1
0
24 Oct 2023
Physically Explainable Deep Learning for Convective Initiation
  Nowcasting Using GOES-16 Satellite Observations
Physically Explainable Deep Learning for Convective Initiation Nowcasting Using GOES-16 Satellite Observations
Da Fan
S. Greybush
David John Gagne
E. Clothiaux
76
2
0
24 Oct 2023
Neural Collapse in Multi-label Learning with Pick-all-label Loss
Neural Collapse in Multi-label Learning with Pick-all-label Loss
Pengyu Li
Xiao Li
Yutong Wang
Qing Qu
67
9
0
24 Oct 2023
GNeSF: Generalizable Neural Semantic Fields
GNeSF: Generalizable Neural Semantic Fields
Hanlin Chen
Chen Li
Mengqi Guo
Zhiwen Yan
Gim Hee Lee
68
12
0
24 Oct 2023
How Much Context Does My Attention-Based ASR System Need?
How Much Context Does My Attention-Based ASR System Need?
Robert Flynn
Anton Ragni
65
2
0
24 Oct 2023
Dynamic Convolutional Neural Networks as Efficient Pre-trained Audio
  Models
Dynamic Convolutional Neural Networks as Efficient Pre-trained Audio Models
Florian Schmid
Khaled Koutini
Gerhard Widmer
47
11
0
24 Oct 2023
I$^2$MD: 3D Action Representation Learning with Inter- and Intra-modal
  Mutual Distillation
I2^22MD: 3D Action Representation Learning with Inter- and Intra-modal Mutual Distillation
Yunyao Mao
Jiajun Deng
Wen-gang Zhou
Zhenbo Lu
Wanli Ouyang
Houqiang Li
VLM
85
1
0
24 Oct 2023
G2-MonoDepth: A General Framework of Generalized Depth Inference from
  Monocular RGB+X Data
G2-MonoDepth: A General Framework of Generalized Depth Inference from Monocular RGB+X Data
Haotian Wang
Meng Yang
Nanning Zheng
VLMMDE
118
8
0
24 Oct 2023
Unlocking the Transferability of Tokens in Deep Models for Tabular Data
Unlocking the Transferability of Tokens in Deep Models for Tabular Data
Qi-Le Zhou
Han-Jia Ye
Le-Ye Wang
De-Chuan Zhan
132
10
0
23 Oct 2023
Rethinking Tokenizer and Decoder in Masked Graph Modeling for Molecules
Rethinking Tokenizer and Decoder in Masked Graph Modeling for Molecules
Zhiyuan Liu
Yaorui Shi
An Zhang
Enzhi Zhang
Kenji Kawaguchi
Xiang Wang
Tat-Seng Chua
AI4CE
93
40
0
23 Oct 2023
Extended Deep Adaptive Input Normalization for Preprocessing Time Series
  Data for Neural Networks
Extended Deep Adaptive Input Normalization for Preprocessing Time Series Data for Neural Networks
Marcus A. K. September
Francesco Sanna Passino
Leonie Goldmann
Anton Hinel
AI4TS
27
3
0
23 Oct 2023
Cross-Domain HAR: Few Shot Transfer Learning for Human Activity
  Recognition
Cross-Domain HAR: Few Shot Transfer Learning for Human Activity Recognition
Megha Thukral
H. Haresamudram
Thomas Ploetz
97
8
0
22 Oct 2023
Toward Flare-Free Images: A Survey
Toward Flare-Free Images: A Survey
Yousef Kotp
Marwan Torki
81
3
0
22 Oct 2023
Neural Multi-Objective Combinatorial Optimization with Diversity
  Enhancement
Neural Multi-Objective Combinatorial Optimization with Diversity Enhancement
Jinbiao Chen
Zizhen Zhang
Zhiguang Cao
Yaoxin Wu
Yining Ma
Te Ye
Jiahai Wang
85
13
0
22 Oct 2023
Gradual Domain Adaptation: Theory and Algorithms
Gradual Domain Adaptation: Theory and Algorithms
Yifei He
Haoxiang Wang
Bo Li
Han Zhao
CLL
140
6
0
20 Oct 2023
Data-Free Knowledge Distillation Using Adversarially Perturbed OpenGL
  Shader Images
Data-Free Knowledge Distillation Using Adversarially Perturbed OpenGL Shader Images
Logan Frank
Jim Davis
77
1
0
20 Oct 2023
Learning with Unmasked Tokens Drives Stronger Vision Learners
Learning with Unmasked Tokens Drives Stronger Vision Learners
Taekyung Kim
Sanghyuk Chun
Byeongho Heo
Dongyoon Han
SSL
100
2
0
20 Oct 2023
ScaleLong: Towards More Stable Training of Diffusion Model via Scaling
  Network Long Skip Connection
ScaleLong: Towards More Stable Training of Diffusion Model via Scaling Network Long Skip Connection
Zhongzhan Huang
Pan Zhou
Shuicheng Yan
Liang Lin
93
27
0
20 Oct 2023
DeepFDR: A Deep Learning-based False Discovery Rate Control Method for
  Neuroimaging Data
DeepFDR: A Deep Learning-based False Discovery Rate Control Method for Neuroimaging Data
Taehyo Kim
Hai Shu
Qiran Jia
Mony de Leon
49
1
0
20 Oct 2023
To grok or not to grok: Disentangling generalization and memorization on
  corrupted algorithmic datasets
To grok or not to grok: Disentangling generalization and memorization on corrupted algorithmic datasets
Darshil Doshi
Aritra Das
Tianyu He
Andrey Gromov
OOD
112
7
0
19 Oct 2023
Improved Operator Learning by Orthogonal Attention
Improved Operator Learning by Orthogonal Attention
Zipeng Xiao
Zhongkai Hao
Bokai Lin
Zhijie Deng
Hang Su
126
21
0
19 Oct 2023
MTS-LOF: Medical Time-Series Representation Learning via
  Occlusion-Invariant Features
MTS-LOF: Medical Time-Series Representation Learning via Occlusion-Invariant Features
Huayu Li
Ana S. Carreon-Rascon
Xiwen Chen
Geng Yuan
Ao Li
AI4TS
38
5
0
19 Oct 2023
Differential Equation Scaling Limits of Shaped and Unshaped Neural
  Networks
Differential Equation Scaling Limits of Shaped and Unshaped Neural Networks
Mufan Li
Mihai Nica
79
2
0
18 Oct 2023
Recasting Continual Learning as Sequence Modeling
Recasting Continual Learning as Sequence Modeling
Soochan Lee
Jaehyeon Son
Gunhee Kim
CLL
61
10
0
18 Oct 2023
Learning to Generate Parameters of ConvNets for Unseen Image Data
Learning to Generate Parameters of ConvNets for Unseen Image Data
Shiye Wang
Kaituo Feng
Changsheng Li
Ye Yuan
Guoren Wang
100
1
0
18 Oct 2023
Video Super-Resolution Using a Grouped Residual in Residual Network
Video Super-Resolution Using a Grouped Residual in Residual Network
MohammadHossein Ashoori
Arash Amini
SupR
101
0
0
17 Oct 2023
USDC: Unified Static and Dynamic Compression for Visual Transformer
USDC: Unified Static and Dynamic Compression for Visual Transformer
Huan Yuan
Chao Liao
Jianchao Tan
Peng Yao
Jiyuan Jia
Bin Chen
Chengru Song
Di Zhang
ViT
44
0
0
17 Oct 2023
United We Stand: Using Epoch-wise Agreement of Ensembles to Combat
  Overfit
United We Stand: Using Epoch-wise Agreement of Ensembles to Combat Overfit
Uri Stern
Daniel Shwartz
D. Weinshall
80
1
0
17 Oct 2023
TEQ: Trainable Equivalent Transformation for Quantization of LLMs
TEQ: Trainable Equivalent Transformation for Quantization of LLMs
Wenhua Cheng
Yiyang Cai
Kaokao Lv
Haihao Shen
MQ
96
7
0
17 Oct 2023
Active Learning Framework for Cost-Effective TCR-Epitope Binding
  Affinity Prediction
Active Learning Framework for Cost-Effective TCR-Epitope Binding Affinity Prediction
Pengfei Zhang
Seojin Bang
Heewook Lee
109
1
0
16 Oct 2023
A representation learning approach to probe for dynamical dark energy in
  matter power spectra
A representation learning approach to probe for dynamical dark energy in matter power spectra
Davide Piras
Lucas Lombriser
65
2
0
16 Oct 2023
LocSelect: Target Speaker Localization with an Auditory Selective
  Hearing Mechanism
LocSelect: Target Speaker Localization with an Auditory Selective Hearing Mechanism
Yu Chen
Xinyuan Qian
Zexu Pan
Kainan Chen
Haizhou Li
60
3
0
16 Oct 2023
Passive Inference Attacks on Split Learning via Adversarial
  Regularization
Passive Inference Attacks on Split Learning via Adversarial Regularization
Xiaochen Zhu
Xinjian Luo
Yuncheng Wu
Yangfan Jiang
Xiaokui Xiao
Beng Chin Ooi
FedML
76
9
0
16 Oct 2023
Equivariant Matrix Function Neural Networks
Equivariant Matrix Function Neural Networks
Ilyes Batatia
Lars L. Schaaf
Huajie Chen
Gábor Csányi
Christoph Ortner
Felix A. Faber
87
6
0
16 Oct 2023
Previous
123...282930...223224225
Next