Large Batch Training of Convolutional Networks

13 August 2017

Yang You

Igor Gitman

Boris Ginsburg

ODL

ArXiv PDF HTML

Papers citing "Large Batch Training of Convolutional Networks"

50 / 545 papers shown

Title
AdaFisher: Adaptive Second Order Optimization via Fisher Information Damien Martins Gomes Yanlei Zhang Eugene Belilovsky Guy Wolf Mahdi S. Hosseini ODL 78 2 0 26 May 2024
Integrating Present and Past in Unsupervised Continual Learning Yipeng Zhang Laurent Charlin R. Zemel Mengye Ren CLL 43 3 0 29 Apr 2024
Pretraining Billion-scale Geospatial Foundational Models on Frontier A. Tsaris P. Dias Abhishek Potnis Junqi Yin Feiyi Wang D. Lunga AI4CE 38 4 0 17 Apr 2024
FairCLIP: Harnessing Fairness in Vision-Language Learning Yan Luo Minfei Shi Muhammad Osama Khan Muhammad Muneeb Afzal Hao Huang ... Luo Song Ava Kouhana T. Elze Yi Fang Mengyu Wang VLM 42 32 0 29 Mar 2024
Tiny Machine Learning: Progress and Futures Ji Lin Ligeng Zhu Wei-Ming Chen Wei-Chen Wang Song Han 52 51 0 28 Mar 2024
OrCo: Towards Better Generalization via Orthogonality and Contrast for Few-Shot Class-Incremental Learning Noor Ahmed Anna Kukleva Bernt Schiele CLL 40 12 0 27 Mar 2024
Branch-Tuning: Balancing Stability and Plasticity for Continual Self-Supervised Learning Wenzhuo Liu Fei Zhu Cheng-Lin Liu CLL 43 2 0 27 Mar 2024
LISA: Layerwise Importance Sampling for Memory-Efficient Large Language Model Fine-Tuning Rui Pan Xiang Liu Shizhe Diao Renjie Pi Jipeng Zhang Chi Han Tong Zhang 46 37 0 26 Mar 2024
Self-Supervised Backbone Framework for Diverse Agricultural Vision Tasks Sudhir Sornapudi Rajhans Singh Corteva Agriscience SSL 25 1 0 22 Mar 2024
Can Generative Models Improve Self-Supervised Representation Learning? Sana Ayromlou Arash Afkanpour Vahid Reza Khazaie Fereshteh Forghani 40 3 0 09 Mar 2024
Augmentations vs Algorithms: What Works in Self-Supervised Learning Warren Morningstar Alex Bijamov Chris Duvarney Luke Friedman Neha Kalibhat ... Philip Mansfield Renan A. Rojas-Gomez Karan Singhal Bradley Green Sushant Prakash SSL 38 10 0 08 Mar 2024
Self-Supervised Multiple Instance Learning for Acute Myeloid Leukemia Classification Salome Kazeminia Max Joosten D. Bosnacki Carsten Marr 30 2 0 08 Mar 2024
Towards Calibrated Deep Clustering Network Yuheng Jia Jianhong Cheng Hui Liu Junhui Hou UQCV 53 1 0 04 Mar 2024
VisionLLaMA: A Unified LLaMA Backbone for Vision Tasks Xiangxiang Chu Jianlin Su Bo-Wen Zhang Chunhua Shen MLLM 44 10 0 01 Mar 2024
Learning and Leveraging World Models in Visual Representation Learning Q. Garrido Mahmoud Assran Nicolas Ballas Adrien Bardes Laurent Najman Yann LeCun SSL 46 24 0 01 Mar 2024
A Large-scale Evaluation of Pretraining Paradigms for the Detection of Defects in Electroluminescence Solar Cell Images David Torpey Lawrence Pratt Richard Klein 32 0 0 27 Feb 2024
Parallelized Midpoint Randomization for Langevin Monte Carlo Lu Yu A. Dalalyan 36 6 0 22 Feb 2024
Implicit Bias in Noisy-SGD: With Applications to Differentially Private Training Tom Sander Maxime Sylvestre Alain Durmus 31 1 0 13 Feb 2024
CochCeps-Augment: A Novel Self-Supervised Contrastive Learning Using Cochlear Cepstrum-based Masking for Speech Emotion Recognition Ioannis Ziogas Hessa Alfalahi A. Khandoker L. Hadjileontiadis 39 0 0 10 Feb 2024
Feature learning as alignment: a structural property of gradient descent in non-linear neural networks Daniel Beaglehole Ioannis Mitliagkas Atish Agarwala MLT 42 2 0 07 Feb 2024
Breaking MLPerf Training: A Case Study on Optimizing BERT Yongdeok Kim Jaehyung Ahn Myeongwoo Kim Changin Choi Heejae Kim ... Xiongzhan Linghu Jingkun Ma Lin Chen Yuehua Dai Sungjoo Yoo 25 0 0 04 Feb 2024
BECLR: Batch Enhanced Contrastive Few-Shot Learning Stylianos Poulakakis-Daktylidis Hadi Jamali Rad 28 5 0 04 Feb 2024
LDReg: Local Dimensionality Regularized Self-Supervised Learning Hanxun Huang R. Campello S. Erfani Xingjun Ma Michael E. Houle James Bailey 41 5 0 19 Jan 2024
Visual Robotic Manipulation with Depth-Aware Pretraining Wanying Wang Jinming Li Yichen Zhu Zhiyuan Xu Zhengping Che Yaxin Peng Chaomin Shen Dong Liu Feifei Feng Jian Tang MDE 32 3 0 17 Jan 2024
MADA: Meta-Adaptive Optimizers through hyper-gradient Descent Kaan Ozkara Can Karakus Parameswaran Raman Mingyi Hong Shoham Sabach B. Kveton V. Cevher 30 2 0 17 Jan 2024
Enhancing Contrastive Learning with Efficient Combinatorial Positive Pairing Jaeill Kim Duhun Hwang Eunjung Lee Jangwon Suh Jimyeong Kim Wonjong Rhee 33 0 0 11 Jan 2024
Interpreting Adaptive Gradient Methods by Parameter Scaling for Learning-Rate-Free Optimization Min-Kook Suh Seung-Woo Seo ODL 29 0 0 06 Jan 2024
Accelerated Convergence of Stochastic Heavy Ball Method under Anisotropic Gradient Noise Rui Pan Yuxing Liu Xiaoyu Wang Tong Zhang 26 5 0 22 Dec 2023
LiDAR: Sensing Linear Probing Performance in Joint Embedding SSL Architectures Vimal Thilak Chen Huang Omid Saremi Laurent Dinh Hanlin Goh Preetum Nakkiran Josh Susskind Etai Littwin 25 9 0 07 Dec 2023
Analyzing and Improving the Training Dynamics of Diffusion Models Tero Karras M. Aittala J. Lehtinen Janne Hellsten Timo Aila S. Laine 42 158 0 05 Dec 2023
Guarding Barlow Twins Against Overfitting with Mixed Samples W. G. C. Bandara C. D. Melo Vishal M. Patel SSL 37 11 0 04 Dec 2023
Disentangling the Effects of Data Augmentation and Format Transform in Self-Supervised Learning of Image Representations Neha Kalibhat Warren Morningstar Alex Bijamov Luyang Liu Karan Singhal Philip Mansfield 33 2 0 02 Dec 2023
SASSL: Enhancing Self-Supervised Learning via Neural Style Transfer Renan A. Rojas-Gomez Karan Singhal Ali Etemad Alex Bijamov Warren Morningstar Philip Mansfield 32 1 0 02 Dec 2023
Generalisable Agents for Neural Network Optimisation Kale-ab Tessera C. Tilbury Sasha Abramowitz Ruan de Kock Omayma Mahjoub Benjamin Rosman Sara Hooker Arnu Pretorius AI4CE 20 0 0 30 Nov 2023
Lesion Search with Self-supervised Learning Kristin Qi Jiali Cheng D. Haehn SSL 9 0 0 18 Nov 2023
Self-Supervised Disentanglement by Leveraging Structure in Data Augmentations Cian Eastwood Julius von Kügelgen Linus Ericsson Diane Bouchacourt Pascal Vincent Bernhard Schölkopf Mark Ibrahim 39 10 0 15 Nov 2023
Osteoporosis Prediction from Hand and Wrist X-rays using Image Segmentation and Self-Supervised Learning Hyungeun Lee Ung Hwang Seungwon Yu Chang-Hun Lee Kijung Yoon 11 1 0 12 Nov 2023
Enhancing Instance-Level Image Classification with Set-Level Labels Renyu Zhang Aly A. Khan Yuxin Chen Robert L. Grossman 33 0 0 09 Nov 2023
Image Generation and Learning Strategy for Deep Document Forgery Detection Yamato Okamoto Osada Genki Iu Yahiro Rintaro Hasegawa Peifei Zhu Hirokatsu Kataoka AAML 36 0 0 07 Nov 2023
Group Robust Classification Without Any Group Information Christos Tsirigotis João Monteiro Pau Rodríguez David Vazquez Aaron Courville OOD 27 22 0 28 Oct 2023
Representation Learning via Consistent Assignment of Views over Random Partitions T. Silva Adín Ramirez Rivera SSL 27 2 0 19 Oct 2023
WeedCLR: Weed Contrastive Learning through Visual Representations with Class-Optimized Loss in Long-Tailed Datasets Alzayat Saleh A. Olsen Jake Wood B. Philippa M. R. Azghadi 25 0 0 19 Oct 2023
Two-Stage Deep Learning Framework for Quality Assessment of Left Atrial Late Gadolinium Enhanced MRI Images K. M. A. Sultan Benjamin A. Orkild Alan Morris E. Kholmovski E. Bieging Eugene Kwan Ravi Ranjan Ed DiBella Shireen Y. Elhabian MedIm 13 1 0 13 Oct 2023
FroSSL: Frobenius Norm Minimization for Efficient Multiview Self-Supervised Learning Oscar Skean Aayush Dhakal Nathan Jacobs Luis Gonzalo Sánchez Giraldo 37 0 0 04 Oct 2023
ScaleNet: An Unsupervised Representation Learning Method for Limited Information Huili Huang M. M. Roozbahani SSL 32 806 0 03 Oct 2023
Self-supervised Learning of Contextualized Local Visual Embeddings T. Silva Hélio Pedrini Adín Ramirez Rivera SSL 23 3 0 01 Oct 2023
Revisiting LARS for Large Batch Training Generalization of Neural Networks K. Do Duong Nguyen Hoa Nguyen Long Tran-Thanh Nguyen-Hoang Tran Viet Quoc Pham AI4CE ODL 31 0 0 25 Sep 2023
Accelerating Large Batch Training via Gradient Signal to Noise Ratio (GSNR) Guo-qing Jiang Jinlong Liu Zixiang Ding Lin Guo W. Lin AI4CE 26 1 0 24 Sep 2023
Investigating Efficient Deep Learning Architectures For Side-Channel Attacks on AES Yohai-Eliel Berreby L. Sauvage AAML 23 2 0 22 Sep 2023
Masking Improves Contrastive Self-Supervised Learning for ConvNets, and Saliency Tells You Where Zhi-Yi Chin Chieh-Ming Jiang Ching-Chun Huang Pin-Yu Chen Wei-Chen Chiu SSL 29 0 0 22 Sep 2023