ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1706.02677
  4. Cited By
Accurate, Large Minibatch SGD: Training ImageNet in 1 Hour
v1v2 (latest)

Accurate, Large Minibatch SGD: Training ImageNet in 1 Hour

8 June 2017
Priya Goyal
Piotr Dollár
Ross B. Girshick
P. Noordhuis
Lukasz Wesolowski
Aapo Kyrola
Andrew Tulloch
Yangqing Jia
Kaiming He
    3DH
ArXiv (abs)PDFHTML

Papers citing "Accurate, Large Minibatch SGD: Training ImageNet in 1 Hour"

50 / 2,054 papers shown
Title
GLEAM: Greedy Learning for Large-Scale Accelerated MRI Reconstruction
GLEAM: Greedy Learning for Large-Scale Accelerated MRI Reconstruction
Batu Mehmet Ozturkler
Arda Sahiner
Tolga Ergen
Arjun D Desai
Christopher M. Sandino
S. Vasanawala
John M. Pauly
Morteza Mardani
Mert Pilanci
55
4
0
18 Jul 2022
TS2-Net: Token Shift and Selection Transformer for Text-Video Retrieval
TS2-Net: Token Shift and Selection Transformer for Text-Video Retrieval
Yuqi Liu
Pengfei Xiong
Luhui Xu
Shengming Cao
Qin Jin
95
122
0
16 Jul 2022
On the Strong Correlation Between Model Invariance and Generalization
On the Strong Correlation Between Model Invariance and Generalization
Weijian Deng
Stephen Gould
Liang Zheng
OOD
91
19
0
14 Jul 2022
Scene Text Recognition with Permuted Autoregressive Sequence Models
Scene Text Recognition with Permuted Autoregressive Sequence Models
Darwin Bautista
Rowel Atienza
108
174
0
14 Jul 2022
TCT: Convexifying Federated Learning using Bootstrapped Neural Tangent
  Kernels
TCT: Convexifying Federated Learning using Bootstrapped Neural Tangent Kernels
Yaodong Yu
Alexander Wei
Sai Praneeth Karimireddy
Yi-An Ma
Michael I. Jordan
FedML
80
31
0
13 Jul 2022
DSPNet: Towards Slimmable Pretrained Networks based on Discriminative
  Self-supervised Learning
DSPNet: Towards Slimmable Pretrained Networks based on Discriminative Self-supervised Learning
Shaoru Wang
Zeming Li
Jin Gao
Liang Li
Weiming Hu
66
0
0
13 Jul 2022
Towards understanding how momentum improves generalization in deep
  learning
Towards understanding how momentum improves generalization in deep learning
Samy Jelassi
Yuanzhi Li
ODLMLTAI4CE
90
38
0
13 Jul 2022
Long-term Leap Attention, Short-term Periodic Shift for Video
  Classification
Long-term Leap Attention, Short-term Periodic Shift for Video Classification
Huatian Zhang
Lechao Cheng
Y. Hao
Chong-Wah Ngo
ViT
77
10
0
12 Jul 2022
Instance Shadow Detection with A Single-Stage Detector
Instance Shadow Detection with A Single-Stage Detector
Tianyu Wang
Xiaowei Hu
Pheng-Ann Heng
Chi-Wing Fu
85
28
0
11 Jul 2022
Efficient Multi-Task RGB-D Scene Analysis for Indoor Environments
Efficient Multi-Task RGB-D Scene Analysis for Indoor Environments
Daniel Seichter
Söhnke Benedikt Fischedick
Mona Köhler
H. Groß
89
40
0
10 Jul 2022
A Mask Attention Interaction and Scale Enhancement Network for SAR Ship
  Instance Segmentation
A Mask Attention Interaction and Scale Enhancement Network for SAR Ship Instance Segmentation
Tianwen Zhang
Xiaoling Zhang
58
54
0
08 Jul 2022
An Embedding-Dynamic Approach to Self-supervised Learning
An Embedding-Dynamic Approach to Self-supervised Learning
Suhong Moon
Domas Buracas
Seunghyun Park
Jinkyu Kim
John F. Canny
OCL
126
4
0
07 Jul 2022
PoF: Post-Training of Feature Extractor for Improving Generalization
PoF: Post-Training of Feature Extractor for Improving Generalization
Ikuro Sato
Ryota Yamada
Masayuki Tanaka
Nakamasa Inoue
Rei Kawakami
39
4
0
05 Jul 2022
End-to-end Learning for Image-based Detection of Molecular Alterations
  in Digital Pathology
End-to-end Learning for Image-based Detection of Molecular Alterations in Digital Pathology
Marvin Teichmann
A. Aichert
H. Bohnenberger
P. Ströbel
T. Heimann
MedIm
18
3
0
30 Jun 2022
On-Device Training Under 256KB Memory
On-Device Training Under 256KB Memory
Ji Lin
Ligeng Zhu
Wei-Ming Chen
Wei-Chen Wang
Chuang Gan
Song Han
MQ
144
213
0
30 Jun 2022
Scalable K-FAC Training for Deep Neural Networks with Distributed
  Preconditioning
Scalable K-FAC Training for Deep Neural Networks with Distributed Preconditioning
Lin Zhang
Shaoshuai Shi
Wei Wang
Yue Liu
70
10
0
30 Jun 2022
Exploring Temporally Dynamic Data Augmentation for Video Recognition
Exploring Temporally Dynamic Data Augmentation for Video Recognition
Taeoh Kim
Jinhyung Kim
Minho Shim
Sangdoo Yun
Myunggu Kang
Dongyoon Wee
Sangyoun Lee
AI4TS
118
10
0
30 Jun 2022
QTI Submission to DCASE 2021: residual normalization for
  device-imbalanced acoustic scene classification with efficient design
QTI Submission to DCASE 2021: residual normalization for device-imbalanced acoustic scene classification with efficient design
Byeonggeun Kim
Seunghan Yang
Jangho Kim
Simyung Chang
81
58
0
28 Jun 2022
Studying Generalization Through Data Averaging
Studying Generalization Through Data Averaging
C. Gomez-Uribe
FedML
138
0
0
28 Jun 2022
AutoInit: Automatic Initialization via Jacobian Tuning
AutoInit: Automatic Initialization via Jacobian Tuning
Tianyu He
Darshil Doshi
Andrey Gromov
68
4
0
27 Jun 2022
Zero Stability Well Predicts Performance of Convolutional Neural
  Networks
Zero Stability Well Predicts Performance of Convolutional Neural Networks
Liangming Chen
Long Jin
Mingsheng Shang
MLT
79
8
0
27 Jun 2022
Multi-Scale Spatial Temporal Graph Convolutional Network for
  Skeleton-Based Action Recognition
Multi-Scale Spatial Temporal Graph Convolutional Network for Skeleton-Based Action Recognition
Zhan Chen
Sicheng Li
Bing Yang
Qinghan Li
Hong Liu
79
268
0
27 Jun 2022
p-Meta: Towards On-device Deep Model Adaptation
p-Meta: Towards On-device Deep Model Adaptation
Zhongnan Qu
Zimu Zhou
Yongxin Tong
Lothar Thiele
75
13
0
25 Jun 2022
Domain Generalization with Relaxed Instance Frequency-wise Normalization
  for Multi-device Acoustic Scene Classification
Domain Generalization with Relaxed Instance Frequency-wise Normalization for Multi-device Acoustic Scene Classification
Byeonggeun Kim
Seunghan Yang
Jangho Kim
Hyunsin Park
Juntae Lee
Simyung Chang
97
29
0
24 Jun 2022
MaskViT: Masked Visual Pre-Training for Video Prediction
MaskViT: Masked Visual Pre-Training for Video Prediction
Agrim Gupta
Stephen Tian
Yunzhi Zhang
Jiajun Wu
Roberto Martín-Martín
Li Fei-Fei
190
121
0
23 Jun 2022
TiCo: Transformation Invariance and Covariance Contrast for
  Self-Supervised Visual Representation Learning
TiCo: Transformation Invariance and Covariance Contrast for Self-Supervised Visual Representation Learning
Jiachen Zhu
R. M. Moraes
Serkan Karakulak
Vlad Sobol
A. Canziani
Yann LeCun
SSL
61
22
0
21 Jun 2022
On the Maximum Hessian Eigenvalue and Generalization
On the Maximum Hessian Eigenvalue and Generalization
Simran Kaur
Jérémy E. Cohen
Zachary Chase Lipton
115
43
0
21 Jun 2022
Shifted Compression Framework: Generalizations and Improvements
Shifted Compression Framework: Generalizations and Improvements
Egor Shulgin
Peter Richtárik
63
6
0
21 Jun 2022
An Efficient Industrial Federated Learning Framework for AIoT: A Face
  Recognition Application
An Efficient Industrial Federated Learning Framework for AIoT: A Face Recognition Application
Youlong Ding
Xueyang Wu
Zhitao Li
Zeheng Wu
S. Tan
Qian Xu
Weike Pan
Qiang Yang
FedML
70
4
0
21 Jun 2022
Human-in-the-loop Speaker Adaptation for DNN-based Multi-speaker TTS
Human-in-the-loop Speaker Adaptation for DNN-based Multi-speaker TTS
K. Udagawa
Yuki Saito
Hiroshi Saruwatari
28
6
0
21 Jun 2022
SemMAE: Semantic-Guided Masking for Learning Masked Autoencoders
SemMAE: Semantic-Guided Masking for Learning Masked Autoencoders
Gang Li
Heliang Zheng
Daqing Liu
Chaoyue Wang
Fuchun Sun
Changwen Zheng
119
130
0
21 Jun 2022
Out-of-distribution Detection by Cross-class Vicinity Distribution of
  In-distribution Data
Out-of-distribution Detection by Cross-class Vicinity Distribution of In-distribution Data
Zhilin Zhao
LongBing Cao
Kun-Yu Lin
OOD
48
2
0
19 Jun 2022
Open-Sampling: Exploring Out-of-Distribution data for Re-balancing
  Long-tailed datasets
Open-Sampling: Exploring Out-of-Distribution data for Re-balancing Long-tailed datasets
Hongxin Wei
Lue Tao
Renchunzi Xie
Lei Feng
Bo An
OODD
64
39
0
17 Jun 2022
The State of Sparse Training in Deep Reinforcement Learning
The State of Sparse Training in Deep Reinforcement Learning
L. Graesser
Utku Evci
Erich Elsen
Pablo Samuel Castro
OffRL
75
40
0
17 Jun 2022
iBoot: Image-bootstrapped Self-Supervised Video Representation Learning
iBoot: Image-bootstrapped Self-Supervised Video Representation Learning
F. Saleh
Fuwen Tan
Adrian Bulat
Georgios Tzimiropoulos
Brais Martínez
SSL
99
1
0
16 Jun 2022
Adapting Self-Supervised Vision Transformers by Probing
  Attention-Conditioned Masking Consistency
Adapting Self-Supervised Vision Transformers by Probing Attention-Conditioned Masking Consistency
Viraj Prabhu
Sriram Yenamandra
Aaditya K. Singh
Judy Hoffman
69
15
0
16 Jun 2022
Patch-level Representation Learning for Self-supervised Vision
  Transformers
Patch-level Representation Learning for Self-supervised Vision Transformers
Sukmin Yun
Hankook Lee
Jaehyung Kim
Jinwoo Shin
ViT
120
68
0
16 Jun 2022
Identifying Electrocardiogram Abnormalities Using a
  Handcrafted-Rule-Enhanced Neural Network
Identifying Electrocardiogram Abnormalities Using a Handcrafted-Rule-Enhanced Neural Network
Yu Bian
Jintai Chen
Xiaojun Chen
Xiaoxian Yang
Da Chen
Jian Wu
48
10
0
16 Jun 2022
Write and Paint: Generative Vision-Language Models are Unified Modal
  Learners
Write and Paint: Generative Vision-Language Models are Unified Modal Learners
Shizhe Diao
Wangchunshu Zhou
Xinsong Zhang
Jiawei Wang
MLLMAI4CE
97
17
0
15 Jun 2022
A Simple Data Mixing Prior for Improving Self-Supervised Learning
A Simple Data Mixing Prior for Improving Self-Supervised Learning
Sucheng Ren
Huiyu Wang
Zhengqi Gao
Shengfeng He
Alan Yuille
Yuyin Zhou
Cihang Xie
51
35
0
15 Jun 2022
Asynchronous SGD Beats Minibatch SGD Under Arbitrary Delays
Asynchronous SGD Beats Minibatch SGD Under Arbitrary Delays
Konstantin Mishchenko
Francis R. Bach
Mathieu Even
Blake E. Woodworth
83
61
0
15 Jun 2022
Self-Supervised Representation Learning With MUlti-Segmental
  Informational Coding (MUSIC)
Self-Supervised Representation Learning With MUlti-Segmental Informational Coding (MUSIC)
Chuang Niu
Ge Wang
SSL
62
6
0
13 Jun 2022
Distributed Adversarial Training to Robustify Deep Neural Networks at
  Scale
Distributed Adversarial Training to Robustify Deep Neural Networks at Scale
Gaoyuan Zhang
Songtao Lu
Yihua Zhang
Xiangyi Chen
Pin-Yu Chen
Quanfu Fan
Lee Martie
L. Horesh
Min-Fong Hong
Sijia Liu
OOD
73
12
0
13 Jun 2022
Towards Understanding Sharpness-Aware Minimization
Towards Understanding Sharpness-Aware Minimization
Maksym Andriushchenko
Nicolas Flammarion
AAML
124
142
0
13 Jun 2022
2nd Place Solution for ICCV 2021 VIPriors Image Classification
  Challenge: An Attract-and-Repulse Learning Approach
2nd Place Solution for ICCV 2021 VIPriors Image Classification Challenge: An Attract-and-Repulse Learning Approach
Yilu Guo
Shicai Yang
Weijie Chen
Liang Ma
Di Xie
Shiliang Pu
65
1
0
13 Jun 2022
Modeling the Machine Learning Multiverse
Modeling the Machine Learning Multiverse
Samuel J. Bell
Onno P. Kampman
Jesse Dodge
Neil D. Lawrence
80
18
0
13 Jun 2022
Anchor Sampling for Federated Learning with Partial Client Participation
Anchor Sampling for Federated Learning with Partial Client Participation
Feijie Wu
Song Guo
Zhihao Qu
Shiqi He
Ziming Liu
Jing Gao
FedML
83
14
0
13 Jun 2022
MLLess: Achieving Cost Efficiency in Serverless Machine Learning
  Training
MLLess: Achieving Cost Efficiency in Serverless Machine Learning Training
Pablo Gimeno Sarroca
Marc Sánchez Artigas
55
16
0
12 Jun 2022
Narrowing the Gap: Improved Detector Training with Noisy Location
  Annotations
Narrowing the Gap: Improved Detector Training with Noisy Location Annotations
Shaoru Wang
Jin Gao
Bing Li
Weiming Hu
ObjDNoLa
73
9
0
12 Jun 2022
Learning Imbalanced Datasets with Maximum Margin Loss
Learning Imbalanced Datasets with Maximum Margin Loss
Haeyong Kang
Thang Vu
Chang D. Yoo
118
18
0
11 Jun 2022
Previous
123...131415...404142
Next