ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1708.03888
  4. Cited By
Large Batch Training of Convolutional Networks

Large Batch Training of Convolutional Networks

13 August 2017
Yang You
Igor Gitman
Boris Ginsburg
    ODL
ArXivPDFHTML

Papers citing "Large Batch Training of Convolutional Networks"

50 / 545 papers shown
Title
AdaFisher: Adaptive Second Order Optimization via Fisher Information
AdaFisher: Adaptive Second Order Optimization via Fisher Information
Damien Martins Gomes
Yanlei Zhang
Eugene Belilovsky
Guy Wolf
Mahdi S. Hosseini
ODL
78
2
0
26 May 2024
Integrating Present and Past in Unsupervised Continual Learning
Integrating Present and Past in Unsupervised Continual Learning
Yipeng Zhang
Laurent Charlin
R. Zemel
Mengye Ren
CLL
43
3
0
29 Apr 2024
Pretraining Billion-scale Geospatial Foundational Models on Frontier
Pretraining Billion-scale Geospatial Foundational Models on Frontier
A. Tsaris
P. Dias
Abhishek Potnis
Junqi Yin
Feiyi Wang
D. Lunga
AI4CE
38
4
0
17 Apr 2024
FairCLIP: Harnessing Fairness in Vision-Language Learning
FairCLIP: Harnessing Fairness in Vision-Language Learning
Yan Luo
Minfei Shi
Muhammad Osama Khan
Muhammad Muneeb Afzal
Hao Huang
...
Luo Song
Ava Kouhana
T. Elze
Yi Fang
Mengyu Wang
VLM
42
32
0
29 Mar 2024
Tiny Machine Learning: Progress and Futures
Tiny Machine Learning: Progress and Futures
Ji Lin
Ligeng Zhu
Wei-Ming Chen
Wei-Chen Wang
Song Han
52
51
0
28 Mar 2024
OrCo: Towards Better Generalization via Orthogonality and Contrast for
  Few-Shot Class-Incremental Learning
OrCo: Towards Better Generalization via Orthogonality and Contrast for Few-Shot Class-Incremental Learning
Noor Ahmed
Anna Kukleva
Bernt Schiele
CLL
40
12
0
27 Mar 2024
Branch-Tuning: Balancing Stability and Plasticity for Continual
  Self-Supervised Learning
Branch-Tuning: Balancing Stability and Plasticity for Continual Self-Supervised Learning
Wenzhuo Liu
Fei Zhu
Cheng-Lin Liu
CLL
43
2
0
27 Mar 2024
LISA: Layerwise Importance Sampling for Memory-Efficient Large Language
  Model Fine-Tuning
LISA: Layerwise Importance Sampling for Memory-Efficient Large Language Model Fine-Tuning
Rui Pan
Xiang Liu
Shizhe Diao
Renjie Pi
Jipeng Zhang
Chi Han
Tong Zhang
46
37
0
26 Mar 2024
Self-Supervised Backbone Framework for Diverse Agricultural Vision Tasks
Self-Supervised Backbone Framework for Diverse Agricultural Vision Tasks
Sudhir Sornapudi
Rajhans Singh Corteva Agriscience
SSL
25
1
0
22 Mar 2024
Can Generative Models Improve Self-Supervised Representation Learning?
Can Generative Models Improve Self-Supervised Representation Learning?
Sana Ayromlou
Arash Afkanpour
Vahid Reza Khazaie
Fereshteh Forghani
40
3
0
09 Mar 2024
Augmentations vs Algorithms: What Works in Self-Supervised Learning
Augmentations vs Algorithms: What Works in Self-Supervised Learning
Warren Morningstar
Alex Bijamov
Chris Duvarney
Luke Friedman
Neha Kalibhat
...
Philip Mansfield
Renan A. Rojas-Gomez
Karan Singhal
Bradley Green
Sushant Prakash
SSL
38
10
0
08 Mar 2024
Self-Supervised Multiple Instance Learning for Acute Myeloid Leukemia
  Classification
Self-Supervised Multiple Instance Learning for Acute Myeloid Leukemia Classification
Salome Kazeminia
Max Joosten
D. Bosnacki
Carsten Marr
30
2
0
08 Mar 2024
Towards Calibrated Deep Clustering Network
Towards Calibrated Deep Clustering Network
Yuheng Jia
Jianhong Cheng
Hui Liu
Junhui Hou
UQCV
53
1
0
04 Mar 2024
VisionLLaMA: A Unified LLaMA Backbone for Vision Tasks
VisionLLaMA: A Unified LLaMA Backbone for Vision Tasks
Xiangxiang Chu
Jianlin Su
Bo-Wen Zhang
Chunhua Shen
MLLM
44
10
0
01 Mar 2024
Learning and Leveraging World Models in Visual Representation Learning
Learning and Leveraging World Models in Visual Representation Learning
Q. Garrido
Mahmoud Assran
Nicolas Ballas
Adrien Bardes
Laurent Najman
Yann LeCun
SSL
46
24
0
01 Mar 2024
A Large-scale Evaluation of Pretraining Paradigms for the Detection of
  Defects in Electroluminescence Solar Cell Images
A Large-scale Evaluation of Pretraining Paradigms for the Detection of Defects in Electroluminescence Solar Cell Images
David Torpey
Lawrence Pratt
Richard Klein
32
0
0
27 Feb 2024
Parallelized Midpoint Randomization for Langevin Monte Carlo
Parallelized Midpoint Randomization for Langevin Monte Carlo
Lu Yu
A. Dalalyan
36
6
0
22 Feb 2024
Implicit Bias in Noisy-SGD: With Applications to Differentially Private
  Training
Implicit Bias in Noisy-SGD: With Applications to Differentially Private Training
Tom Sander
Maxime Sylvestre
Alain Durmus
31
1
0
13 Feb 2024
CochCeps-Augment: A Novel Self-Supervised Contrastive Learning Using
  Cochlear Cepstrum-based Masking for Speech Emotion Recognition
CochCeps-Augment: A Novel Self-Supervised Contrastive Learning Using Cochlear Cepstrum-based Masking for Speech Emotion Recognition
Ioannis Ziogas
Hessa Alfalahi
A. Khandoker
L. Hadjileontiadis
39
0
0
10 Feb 2024
Feature learning as alignment: a structural property of gradient descent
  in non-linear neural networks
Feature learning as alignment: a structural property of gradient descent in non-linear neural networks
Daniel Beaglehole
Ioannis Mitliagkas
Atish Agarwala
MLT
42
2
0
07 Feb 2024
Breaking MLPerf Training: A Case Study on Optimizing BERT
Breaking MLPerf Training: A Case Study on Optimizing BERT
Yongdeok Kim
Jaehyung Ahn
Myeongwoo Kim
Changin Choi
Heejae Kim
...
Xiongzhan Linghu
Jingkun Ma
Lin Chen
Yuehua Dai
Sungjoo Yoo
25
0
0
04 Feb 2024
BECLR: Batch Enhanced Contrastive Few-Shot Learning
BECLR: Batch Enhanced Contrastive Few-Shot Learning
Stylianos Poulakakis-Daktylidis
Hadi Jamali Rad
28
5
0
04 Feb 2024
LDReg: Local Dimensionality Regularized Self-Supervised Learning
LDReg: Local Dimensionality Regularized Self-Supervised Learning
Hanxun Huang
R. Campello
S. Erfani
Xingjun Ma
Michael E. Houle
James Bailey
41
5
0
19 Jan 2024
Visual Robotic Manipulation with Depth-Aware Pretraining
Visual Robotic Manipulation with Depth-Aware Pretraining
Wanying Wang
Jinming Li
Yichen Zhu
Zhiyuan Xu
Zhengping Che
Yaxin Peng
Chaomin Shen
Dong Liu
Feifei Feng
Jian Tang
MDE
32
3
0
17 Jan 2024
MADA: Meta-Adaptive Optimizers through hyper-gradient Descent
MADA: Meta-Adaptive Optimizers through hyper-gradient Descent
Kaan Ozkara
Can Karakus
Parameswaran Raman
Mingyi Hong
Shoham Sabach
B. Kveton
V. Cevher
30
2
0
17 Jan 2024
Enhancing Contrastive Learning with Efficient Combinatorial Positive
  Pairing
Enhancing Contrastive Learning with Efficient Combinatorial Positive Pairing
Jaeill Kim
Duhun Hwang
Eunjung Lee
Jangwon Suh
Jimyeong Kim
Wonjong Rhee
33
0
0
11 Jan 2024
Interpreting Adaptive Gradient Methods by Parameter Scaling for
  Learning-Rate-Free Optimization
Interpreting Adaptive Gradient Methods by Parameter Scaling for Learning-Rate-Free Optimization
Min-Kook Suh
Seung-Woo Seo
ODL
29
0
0
06 Jan 2024
Accelerated Convergence of Stochastic Heavy Ball Method under
  Anisotropic Gradient Noise
Accelerated Convergence of Stochastic Heavy Ball Method under Anisotropic Gradient Noise
Rui Pan
Yuxing Liu
Xiaoyu Wang
Tong Zhang
26
5
0
22 Dec 2023
LiDAR: Sensing Linear Probing Performance in Joint Embedding SSL
  Architectures
LiDAR: Sensing Linear Probing Performance in Joint Embedding SSL Architectures
Vimal Thilak
Chen Huang
Omid Saremi
Laurent Dinh
Hanlin Goh
Preetum Nakkiran
Josh Susskind
Etai Littwin
25
9
0
07 Dec 2023
Analyzing and Improving the Training Dynamics of Diffusion Models
Analyzing and Improving the Training Dynamics of Diffusion Models
Tero Karras
M. Aittala
J. Lehtinen
Janne Hellsten
Timo Aila
S. Laine
42
158
0
05 Dec 2023
Guarding Barlow Twins Against Overfitting with Mixed Samples
Guarding Barlow Twins Against Overfitting with Mixed Samples
W. G. C. Bandara
C. D. Melo
Vishal M. Patel
SSL
37
11
0
04 Dec 2023
Disentangling the Effects of Data Augmentation and Format Transform in
  Self-Supervised Learning of Image Representations
Disentangling the Effects of Data Augmentation and Format Transform in Self-Supervised Learning of Image Representations
Neha Kalibhat
Warren Morningstar
Alex Bijamov
Luyang Liu
Karan Singhal
Philip Mansfield
33
2
0
02 Dec 2023
SASSL: Enhancing Self-Supervised Learning via Neural Style Transfer
SASSL: Enhancing Self-Supervised Learning via Neural Style Transfer
Renan A. Rojas-Gomez
Karan Singhal
Ali Etemad
Alex Bijamov
Warren Morningstar
Philip Mansfield
32
1
0
02 Dec 2023
Generalisable Agents for Neural Network Optimisation
Generalisable Agents for Neural Network Optimisation
Kale-ab Tessera
C. Tilbury
Sasha Abramowitz
Ruan de Kock
Omayma Mahjoub
Benjamin Rosman
Sara Hooker
Arnu Pretorius
AI4CE
20
0
0
30 Nov 2023
Lesion Search with Self-supervised Learning
Lesion Search with Self-supervised Learning
Kristin Qi
Jiali Cheng
D. Haehn
SSL
9
0
0
18 Nov 2023
Self-Supervised Disentanglement by Leveraging Structure in Data
  Augmentations
Self-Supervised Disentanglement by Leveraging Structure in Data Augmentations
Cian Eastwood
Julius von Kügelgen
Linus Ericsson
Diane Bouchacourt
Pascal Vincent
Bernhard Schölkopf
Mark Ibrahim
39
10
0
15 Nov 2023
Osteoporosis Prediction from Hand and Wrist X-rays using Image
  Segmentation and Self-Supervised Learning
Osteoporosis Prediction from Hand and Wrist X-rays using Image Segmentation and Self-Supervised Learning
Hyungeun Lee
Ung Hwang
Seungwon Yu
Chang-Hun Lee
Kijung Yoon
11
1
0
12 Nov 2023
Enhancing Instance-Level Image Classification with Set-Level Labels
Enhancing Instance-Level Image Classification with Set-Level Labels
Renyu Zhang
Aly A. Khan
Yuxin Chen
Robert L. Grossman
33
0
0
09 Nov 2023
Image Generation and Learning Strategy for Deep Document Forgery
  Detection
Image Generation and Learning Strategy for Deep Document Forgery Detection
Yamato Okamoto
Osada Genki
Iu Yahiro
Rintaro Hasegawa
Peifei Zhu
Hirokatsu Kataoka
AAML
36
0
0
07 Nov 2023
Group Robust Classification Without Any Group Information
Group Robust Classification Without Any Group Information
Christos Tsirigotis
João Monteiro
Pau Rodríguez
David Vazquez
Aaron Courville
OOD
27
22
0
28 Oct 2023
Representation Learning via Consistent Assignment of Views over Random
  Partitions
Representation Learning via Consistent Assignment of Views over Random Partitions
T. Silva
Adín Ramirez Rivera
SSL
27
2
0
19 Oct 2023
WeedCLR: Weed Contrastive Learning through Visual Representations with
  Class-Optimized Loss in Long-Tailed Datasets
WeedCLR: Weed Contrastive Learning through Visual Representations with Class-Optimized Loss in Long-Tailed Datasets
Alzayat Saleh
A. Olsen
Jake Wood
B. Philippa
M. R. Azghadi
25
0
0
19 Oct 2023
Two-Stage Deep Learning Framework for Quality Assessment of Left Atrial
  Late Gadolinium Enhanced MRI Images
Two-Stage Deep Learning Framework for Quality Assessment of Left Atrial Late Gadolinium Enhanced MRI Images
K. M. A. Sultan
Benjamin A. Orkild
Alan Morris
E. Kholmovski
E. Bieging
Eugene Kwan
Ravi Ranjan
Ed DiBella
Shireen Y. Elhabian
MedIm
13
1
0
13 Oct 2023
FroSSL: Frobenius Norm Minimization for Efficient Multiview
  Self-Supervised Learning
FroSSL: Frobenius Norm Minimization for Efficient Multiview Self-Supervised Learning
Oscar Skean
Aayush Dhakal
Nathan Jacobs
Luis Gonzalo Sánchez Giraldo
37
0
0
04 Oct 2023
ScaleNet: An Unsupervised Representation Learning Method for Limited
  Information
ScaleNet: An Unsupervised Representation Learning Method for Limited Information
Huili Huang
M. M. Roozbahani
SSL
32
806
0
03 Oct 2023
Self-supervised Learning of Contextualized Local Visual Embeddings
Self-supervised Learning of Contextualized Local Visual Embeddings
T. Silva
Hélio Pedrini
Adín Ramirez Rivera
SSL
23
3
0
01 Oct 2023
Revisiting LARS for Large Batch Training Generalization of Neural
  Networks
Revisiting LARS for Large Batch Training Generalization of Neural Networks
K. Do
Duong Nguyen
Hoa Nguyen
Long Tran-Thanh
Nguyen-Hoang Tran
Viet Quoc Pham
AI4CE
ODL
31
0
0
25 Sep 2023
Accelerating Large Batch Training via Gradient Signal to Noise Ratio
  (GSNR)
Accelerating Large Batch Training via Gradient Signal to Noise Ratio (GSNR)
Guo-qing Jiang
Jinlong Liu
Zixiang Ding
Lin Guo
W. Lin
AI4CE
26
1
0
24 Sep 2023
Investigating Efficient Deep Learning Architectures For Side-Channel
  Attacks on AES
Investigating Efficient Deep Learning Architectures For Side-Channel Attacks on AES
Yohai-Eliel Berreby
L. Sauvage
AAML
23
2
0
22 Sep 2023
Masking Improves Contrastive Self-Supervised Learning for ConvNets, and
  Saliency Tells You Where
Masking Improves Contrastive Self-Supervised Learning for ConvNets, and Saliency Tells You Where
Zhi-Yi Chin
Chieh-Ming Jiang
Ching-Chun Huang
Pin-Yu Chen
Wei-Chen Chiu
SSL
29
0
0
22 Sep 2023
Previous
12345...91011
Next