Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1805.11604
Cited By
How Does Batch Normalization Help Optimization?
29 May 2018
Shibani Santurkar
Dimitris Tsipras
Andrew Ilyas
A. Madry
ODL
Re-assign community
ArXiv
PDF
HTML
Papers citing
"How Does Batch Normalization Help Optimization?"
50 / 183 papers shown
Title
NetSight: Graph Attention Based Traffic Forecasting in Computer Networks
Jinming Xing
Guoheng Sun
Hui Sun
Linchao Pan
Shakir Mahmood
Xuanhao Luo
Muhammad Shahzad
28
0
0
11 May 2025
SPD Learning for Covariance-Based Neuroimaging Analysis: Perspectives, Methods, and Challenges
Ce Ju
Reinmar J. Kobler
Antoine Collas
M. Kawanabe
Cuntai Guan
Bertrand Thirion
43
0
0
26 Apr 2025
Decentralized Federated Domain Generalization with Style Sharing: A Formal Modeling and Convergence Analysis
Shahryar Zehtabi
Dong-Jun Han
Seyyedali Hosseinalipour
Christopher G. Brinton
FedML
AI4CE
50
0
0
08 Apr 2025
A Real-time Multimodal Transformer Neural Network-powered Wildfire Forecasting System
Qijun Chen
Shaofan Li
51
0
0
07 Mar 2025
Beyond R-barycenters: an effective averaging method on Stiefel and Grassmann manifolds
Florent Bouchard
Nils Laurent
Salem Said
N. L. Bihan
37
1
0
20 Jan 2025
Quantum Cognition-Inspired EEG-based Recommendation via Graph Neural Networks
Jinkun Han
Wei Li
Yong Li
Zhipeng Cai
45
2
0
05 Jan 2025
Data-Efficient Discovery of Hyperelastic TPMS Metamaterials with Extreme Energy Dissipation
Maxine Perroni-Scharf
Zachary Ferguson
Thomas Butrille
Carlos Portela
Mina Konaković Luković
32
0
0
29 May 2024
Hidden Synergy:
L
1
L_1
L
1
Weight Normalization and 1-Path-Norm Regularization
Aditya Biswas
41
0
0
29 Apr 2024
PillarTrack:Boosting Pillar Representation for Transformer-based 3D Single Object Tracking on Point Clouds
Weisheng Xu
Sifan Zhou
Jiaqi Xiong
Ziyu Zhao
Zhihang Yuan
45
0
0
11 Apr 2024
Linearly Constrained Weights: Reducing Activation Shift for Faster Training of Neural Networks
Takuro Kutsuna
LLMSV
32
1
0
08 Mar 2024
Overcoming Recency Bias of Normalization Statistics in Continual Learning: Balance and Adaptation
Yilin Lyu
Liyuan Wang
Xingxing Zhang
Zicheng Sun
Hang Su
Jun Zhu
Liping Jing
34
8
0
13 Oct 2023
Weakly Supervised Multi-Task Representation Learning for Human Activity Analysis Using Wearables
Taoran Sheng
M. Huber
SSL
HAI
24
20
0
06 Aug 2023
AC-Norm: Effective Tuning for Medical Image Analysis via Affine Collaborative Normalization
Chuyan Zhang
Yuncheng Yang
Hao Zheng
Yun Gu
32
0
0
28 Jul 2023
Reinterpreting survival analysis in the universal approximator age
Sören Dittmer
M. Roberts
J. Preller
AIX-COVNET Collaboration
James H. F. Rudd
J. Aston
Carola-Bibiane Schönlieb
32
0
0
25 Jul 2023
Adversarial Latent Autoencoder with Self-Attention for Structural Image Synthesis
Jiajie Fan
L. Vuaille
Hongya Wang
Thomas Bäck
AI4CE
22
5
0
19 Jul 2023
The Implicit Bias of Batch Normalization in Linear Models and Two-layer Linear Convolutional Neural Networks
Yuan Cao
Difan Zou
Yuan-Fang Li
Quanquan Gu
MLT
37
5
0
20 Jun 2023
Group channel pruning and spatial attention distilling for object detection
Yun Chu
Pu Li
Yong Bai
Zhuhua Hu
Yongqing Chen
Jiafeng Lu
VLM
24
13
0
02 Jun 2023
On the Weight Dynamics of Deep Normalized Networks
Christian H. X. Ali Mehmeti-Göpel
Michael Wand
38
1
0
01 Jun 2023
An End-to-End Vehicle Trajcetory Prediction Framework
Fuad Hasan
Hailong Huang
18
0
0
19 Apr 2023
Information Geometrically Generalized Covariate Shift Adaptation
Masanari Kimura
H. Hino
OOD
16
5
0
19 Apr 2023
End-to-end codesign of Hessian-aware quantized neural networks for FPGAs and ASICs
Javier Campos
Zhen Dong
Javier Mauricio Duarte
A. Gholami
Michael W. Mahoney
Jovan Mitrevski
Nhan Tran
MQ
32
3
0
13 Apr 2023
Inductive biases in deep learning models for weather prediction
Jannik Thümmel
Matthias Karlbauer
S. Otte
C. Zarfl
Georg Martius
...
Thomas Scholten
Ulrich Friedrich
V. Wulfmeyer
B. Goswami
Martin Volker Butz
AI4CE
43
5
0
06 Apr 2023
Self-Supervised learning for Neural Architecture Search (NAS)
Samuel Ducros
SSL
21
1
0
03 Apr 2023
Q-DETR: An Efficient Low-Bit Quantized Detection Transformer
Sheng Xu
Yanjing Li
Mingbao Lin
Penglei Gao
Guodong Guo
Jinhu Lu
Baochang Zhang
MQ
29
23
0
01 Apr 2023
Understanding plasticity in neural networks
Clare Lyle
Zeyu Zheng
Evgenii Nikishin
Bernardo Avila-Pires
Razvan Pascanu
Will Dabney
AI4CE
35
97
0
02 Mar 2023
Combating Uncertainties in Wind and Distributed PV Energy Sources Using Integrated Reinforcement Learning and Time-Series Forecasting
Arman Ghasemi
Amin Shojaeighadikolaei
M. Hashemi
AI4TS
6
3
0
27 Feb 2023
NU-AIR -- A Neuromorphic Urban Aerial Dataset for Detection and Localization of Pedestrians and Vehicles
Craig Iaboni
Thomas Kelly
Pramod Abichandani
21
2
0
18 Feb 2023
Novel Building Detection and Location Intelligence Collection in Aerial Satellite Imagery
Sandeep Singh
Christian Wiles
A. Bilal
30
0
0
06 Feb 2023
Modality-Agnostic Variational Compression of Implicit Neural Representations
Jonathan Richard Schwarz
Jihoon Tack
Yee Whye Teh
Jaeho Lee
Jinwoo Shin
34
25
0
23 Jan 2023
Low PAPR MIMO-OFDM Design Based on Convolutional Autoencoder
Yara Huleihel
Haim Permuter
25
6
0
11 Jan 2023
Disentangled Explanations of Neural Network Predictions by Finding Relevant Subspaces
Pattarawat Chormai
J. Herrmann
Klaus-Robert Muller
G. Montavon
FAtt
48
17
0
30 Dec 2022
Stable Learning via Sparse Variable Independence
Han Yu
Peng Cui
Yue He
Zheyan Shen
Yong Lin
Renzhe Xu
Xingxuan Zhang
OOD
40
13
0
02 Dec 2022
Normal Transformer: Extracting Surface Geometry from LiDAR Points Enhanced by Visual Semantics
Ancheng Lin
Jun Yu Li
Yusheng Xiang
Wei Bian
Mukesh Prasad
3DPC
ViT
43
2
0
19 Nov 2022
We need to talk about random seeds
Steven Bethard
31
8
0
24 Oct 2022
Rethinking Normalization Methods in Federated Learning
Zhixu Du
Jingwei Sun
Ang Li
Pin-Yu Chen
Jianyi Zhang
H. Li
Yiran Chen
FedML
29
28
0
07 Oct 2022
Dynamical Isometry for Residual Networks
Advait Gadhikar
R. Burkholz
ODL
AI4CE
40
2
0
05 Oct 2022
RankMe: Assessing the downstream performance of pretrained self-supervised representations by their rank
Q. Garrido
Randall Balestriero
Laurent Najman
Yann LeCun
SSL
59
73
0
05 Oct 2022
Batch Normalization Explained
Randall Balestriero
Richard G. Baraniuk
AAML
33
16
0
29 Sep 2022
On the Pros and Cons of Momentum Encoder in Self-Supervised Visual Representation Learning
T. Pham
Chaoning Zhang
Axi Niu
Kang Zhang
Chang D. Yoo
36
11
0
11 Aug 2022
EMC2A-Net: An Efficient Multibranch Cross-channel Attention Network for SAR Target Classification
Xiang Yu
Zhe Geng
Xiaohua Huang
Qinglu Wang
Daiyin Zhu
38
5
0
03 Aug 2022
Continuous locomotion mode recognition and gait phase estimation based on a shank-mounted IMU with artificial neural networks
F. Weigand
Andreas Höhl
Julian Zeiss
U. Konigorski
M. Grimmer
22
3
0
01 Aug 2022
OCTAL: Graph Representation Learning for LTL Model Checking
Prasita Mukherjee
Haoteng Yin
Susheel Suresh
Tiark Rompf
24
4
0
24 Jul 2022
Generative Domain Adaptation for Face Anti-Spoofing
Qianyu Zhou
Ke-Yue Zhang
Taiping Yao
Ran Yi
Kekai Sheng
Shouhong Ding
Lizhuang Ma
CVBM
32
48
0
20 Jul 2022
Lipschitz Continuity Retained Binary Neural Network
Yuzhang Shang
Dan Xu
Bin Duan
Ziliang Zong
Liqiang Nie
Yan Yan
16
19
0
13 Jul 2022
PointNorm: Dual Normalization is All You Need for Point Cloud Analysis
Shen Zheng
Jinqian Pan
Chang-Tien Lu
Gaurav Gupta
3DPC
40
7
0
13 Jul 2022
Understanding and Improving Group Normalization
Agus Gunawan
Xu Yin
Kang Zhang
15
3
0
05 Jul 2022
Understanding the Generalization Benefit of Normalization Layers: Sharpness Reduction
Kaifeng Lyu
Zhiyuan Li
Sanjeev Arora
FAtt
40
70
0
14 Jun 2022
SmartGD: A GAN-Based Graph Drawing Framework for Diverse Aesthetic Goals
Xiaoqi Wang
Kevin Yen
Yifan Hu
Hang Shen
27
4
0
13 Jun 2022
SPD domain-specific batch normalization to crack interpretable unsupervised domain adaptation in EEG
Reinmar J. Kobler
J. Hirayama
Qibin Zhao
M. Kawanabe
19
53
0
02 Jun 2022
Batch Normalization Is Blind to the First and Second Derivatives of the Loss
Zhanpeng Zhou
Wen Shen
Huixin Chen
Ling Tang
Quanshi Zhang
34
2
0
30 May 2022
1
2
3
4
Next