Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1803.09820
Cited By
A disciplined approach to neural network hyper-parameters: Part 1 -- learning rate, batch size, momentum, and weight decay
26 March 2018
L. Smith
Re-assign community
ArXiv
PDF
HTML
Papers citing
"A disciplined approach to neural network hyper-parameters: Part 1 -- learning rate, batch size, momentum, and weight decay"
50 / 86 papers shown
Title
Calibration and Uncertainty for multiRater Volume Assessment in multiorgan Segmentation (CURVAS) challenge results
Meritxell Riera-Marin
S. Ko
Julia Rodriguez-Comas
Matthias Stefan May
Zhaohong Pan
...
Anton Aubanell
Andreu Antolin
Javier Garcia-Lopez
M. A. G. Ballester
Adrian Galdran
UQCV
43
0
0
13 May 2025
Towards order of magnitude X-ray dose reduction in breast cancer imaging using phase contrast and deep denoising
Ashkan Pakzad
Robert Turnbull
Simon Mutch
Thomas A. Leatham
Darren Lockie
...
Amir Entezam
Seyedamir T. Taba
Patrick C. Brennan
Timur E. Gureyev
Harry M. Quiney
MedIm
28
0
0
09 May 2025
Enhancing Cell Counting through MLOps: A Structured Approach for Automated Cell Analysis
Matteo Testi
Luca Clissa
Matteo Ballabio
Salvatore Ricciardi
Federico Baldo
Emanuele Frontoni
S. Moccia
Gennario Vessio
74
0
0
28 Apr 2025
Compressibility Analysis for the differentiable shift-variant Filtered Backprojection Model
Chengze Ye
Linda-Sophie Schneider
Yipeng Sun
Mareike Thies
Andreas K. Maier
39
0
0
20 Jan 2025
B-STaR: Monitoring and Balancing Exploration and Exploitation in Self-Taught Reasoners
Weihao Zeng
Yuzhen Huang
Lulu Zhao
Yijun Wang
Zifei Shan
Junxian He
LRM
35
7
0
23 Dec 2024
On the Performance Analysis of Momentum Method: A Frequency Domain Perspective
Xianliang Li
Jun Luo
Zhiwei Zheng
Hanxiao Wang
Li Luo
Lingkun Wen
Linlong Wu
Sheng Xu
72
0
0
29 Nov 2024
The Role of Deep Learning Regularizations on Actors in Offline RL
Denis Tarasov
Anja Surina
Çağlar Gülçehre
OffRL
AI4CE
48
1
0
11 Sep 2024
Label-free Monitoring of Self-Supervised Learning Progress
Isaac Xu
Scott Lowe
Thomas Trappenberg
32
1
0
10 Sep 2024
BenthicNet: A global compilation of seafloor images for deep learning applications
Scott C. Lowe
B. Misiuk
Isaac Xu
Shakhboz Abdulazizov
A. R. Baroi
...
Jordan A. Thomson
Brittany R. Wilson
Melisa C. Wong
Craig J. Brown
Thomas Trappenberg
49
3
0
08 May 2024
Image segmentation of treated and untreated tumor spheroids by Fully Convolutional Networks
Matthias Streller
S. Michlíková
Willy Ciecior
Katharina Lönnecke
L. Kunz-Schughart
Steffen Lange
Anja Voss-Böhme
46
1
0
02 May 2024
FisheyeDetNet: 360° Surround view Fisheye Camera based Object Detection System for Autonomous Driving
Ganesh Sistu
S. Yogamani
31
0
0
20 Apr 2024
A Comparison of Deep Learning Architectures for Spacecraft Anomaly Detection
Daniel Lakey
Tim Schlippe
29
2
0
19 Mar 2024
Better Schedules for Low Precision Training of Deep Neural Networks
Cameron R. Wolfe
Anastasios Kyrillidis
42
1
0
04 Mar 2024
Artificial Bee Colony optimization of Deep Convolutional Neural Networks in the context of Biomedical Imaging
Adri Gomez Martin
Carlos Fernandez del Cerro
Monica Abella Garcia
Manuel Desco Menendez
33
0
0
23 Feb 2024
Physics-informed Deep Learning to Solve Three-dimensional Terzaghi Consolidation Equation: Forward and Inverse Problems
Biao Yuan
Ana Heitor
He Wang
Xiaohui Chen
AI4CE
PINN
29
1
0
08 Jan 2024
VoxelKP: A Voxel-based Network Architecture for Human Keypoint Estimation in LiDAR Data
Jian Shi
Peter Wonka
3DPC
40
0
0
11 Dec 2023
Using Learnable Physics for Real-Time Exercise Form Recommendations
Abhishek Jaiswal
Gautam Chauhan
Nisheeth Srivastava
16
1
0
11 Oct 2023
Reviewing 3D Object Detectors in the Context of High-Resolution 3+1D Radar
Patrick Palmer
Martin Krueger
R. Altendorfer
Ganesh Adam
Torsten Bertram
3DPC
21
9
0
10 Aug 2023
Robust Surgical Tools Detection in Endoscopic Videos with Noisy Data
Adnan Qayyum
Hassan Ali
Massimo Caputo
H. Vohra
Taofeek Akinosho
Sofiat Abioye
Ilhem Berrou
Paweł Capik
Junaid Qadir
Muhammad Bilal
27
0
0
03 Jul 2023
Phase transitions in the mini-batch size for sparse and dense two-layer neural networks
Raffaele Marino
F. Ricci-Tersenghi
27
14
0
10 May 2023
The R-mAtrIx Net
Shailesh Lal
Suvajit Majumder
E. Sobko
24
5
0
14 Apr 2023
Isolated Sign Language Recognition based on Tree Structure Skeleton Images
David Laines
G. Bejarano
M. González-Mendoza
Gilberto Ochoa-Ruiz
SLR
24
12
0
10 Apr 2023
Training Strategies for Vision Transformers for Object Detection
Apoorv Singh
23
4
0
05 Apr 2023
Enhanced detection of the presence and severity of COVID-19 from CT scans using lung segmentation
R. Turnbull
27
2
0
16 Mar 2023
Novel Building Detection and Location Intelligence Collection in Aerial Satellite Imagery
Sandeep Singh
Christian Wiles
A. Bilal
18
0
0
06 Feb 2023
TAME: Attention Mechanism Based Feature Fusion for Generating Explanation Maps of Convolutional Neural Networks
Mariano V. Ntrougkas
Nikolaos Gkalelis
Vasileios Mezaris
FAtt
13
8
0
18 Jan 2023
Homogeneous Multi-modal Feature Fusion and Interaction for 3D Object Detection
Xin Li
Botian Shi
Yuenan Hou
Xingjiao Wu
Tianlong Ma
Yikang Li
Liangbo He
3DPC
10
49
0
18 Oct 2022
Adaptive Smoothness-weighted Adversarial Training for Multiple Perturbations with Its Stability Analysis
Jiancong Xiao
Zeyu Qin
Yanbo Fan
Baoyuan Wu
Jue Wang
Zhimin Luo
AAML
31
7
0
02 Oct 2022
Combining Metric Learning and Attention Heads For Accurate and Efficient Multilabel Image Classification
K. Prokofiev
V. Sovrasov
VLM
20
9
0
14 Sep 2022
Efficient Augmentation for Imbalanced Deep Learning
Damien Dablain
C. Bellinger
Bartosz Krawczyk
Nitesh V. Chawla
24
7
0
13 Jul 2022
hmBERT: Historical Multilingual Language Models for Named Entity Recognition
Stefan Schweter
Luisa März
Katharina Schmid
Erion cCano
35
18
0
31 May 2022
Assessing Demographic Bias Transfer from Dataset to Model: A Case Study in Facial Expression Recognition
Iris Dominguez-Catena
D. Paternain
M. Galar
31
12
0
20 May 2022
Virtual Analog Modeling of Distortion Circuits Using Neural Ordinary Differential Equations
Jan Wilczek
Alec Wright
Vesa Valimaki
Emanuel Habets
19
4
0
04 May 2022
TemporalWiki: A Lifelong Benchmark for Training and Evaluating Ever-Evolving Language Models
Joel Jang
Seonghyeon Ye
Changho Lee
Sohee Yang
Joongbo Shin
Janghoon Han
Gyeonghun Kim
Minjoon Seo
CLL
KELM
24
91
0
29 Apr 2022
Dense Voxel Fusion for 3D Object Detection
Anas Mahmoud
Jordan S. K. Hu
Steven L. Waslander
3DPC
20
45
0
02 Mar 2022
Does prior knowledge in the form of multiple low-dose PET images (at different dose levels) improve standard-dose PET prediction?
Behnoush Sanaei
R. Faghihi
Hossein ARABI
MedIm
13
8
0
22 Feb 2022
Multi-task UNet: Jointly Boosting Saliency Prediction and Disease Classification on Chest X-ray Images
Hongzhi Zhu
R. Rohling
Septimiu Salcudean
14
4
0
15 Feb 2022
Hyperparameter Optimization for COVID-19 Chest X-Ray Classification
I. Hamdi
Muhammad Ridzuan
Mohammad Yaqub
LM&MA
102
0
0
26 Jan 2022
AttentionHTR: Handwritten Text Recognition Based on Attention Encoder-Decoder Networks
Dmitrijs Kass
Ekta Vats
HAI
37
28
0
23 Jan 2022
Application of Machine Learning in understanding plant virus pathogenesis: Trends and perspectives on emergence, diagnosis, host-virus interplay and management
Dibyendu Ghosh
Srija Chakraborty
H. Kodamana
S. Chakraborty
16
18
0
03 Dec 2021
Implicit Equivariance in Convolutional Networks
Naman Khetan
Tushar Arora
S. U. Rehman
D. K. Gupta
31
4
0
28 Nov 2021
MiNet: A Convolutional Neural Network for Identifying and Categorising Minerals
Emmanuel Asiedu Brempong
M. Agangiba
Daniel C. Aikins
14
7
0
22 Nov 2021
An Analysis of the Influence of Transfer Learning When Measuring the Tortuosity of Blood Vessels
Matheus V. da Silva
Julie Ouellette
Baptiste Lacoste
C. H. Comin
11
7
0
19 Nov 2021
Logical Activation Functions: Logit-space equivalents of Probabilistic Boolean Operators
S. Lowe
Robert C. Earle
Jason dÉon
Thomas Trappenberg
Sageev Oore
20
1
0
22 Oct 2021
Multi-label Classification with Partial Annotations using Class-aware Selective Loss
Emanuel Ben-Baruch
T. Ridnik
Itamar Friedman
Avi Ben-Cohen
Nadav Zamir
Asaf Noy
Lihi Zelnik-Manor
30
38
0
21 Oct 2021
BNAS v2: Learning Architectures for Binary Networks with Empirical Improvements
Dahyun Kim
Kunal Pratap Singh
Jonghyun Choi
MQ
40
7
0
16 Oct 2021
Large Learning Rate Tames Homogeneity: Convergence and Balancing Effect
Yuqing Wang
Minshuo Chen
T. Zhao
Molei Tao
AI4CE
55
40
0
07 Oct 2021
Is Attention always needed? A Case Study on Language Identification from Speech
A. Mandal
Santanu Pal
Indranil Dutta
Mahidas Bhattacharya
S. Naskar
19
6
0
05 Oct 2021
Traffic-Net: 3D Traffic Monitoring Using a Single Camera
Mahdi Rezaei
Mohsen Azarmi
Farzam Mohammad Pour Mir
33
19
0
19 Sep 2021
Query2Label: A Simple Transformer Way to Multi-Label Classification
Shilong Liu
Lei Zhang
Xiao Yang
Hang Su
Jun Zhu
10
187
0
22 Jul 2021
1
2
Next