ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1706.02677
  4. Cited By
Accurate, Large Minibatch SGD: Training ImageNet in 1 Hour
v1v2 (latest)

Accurate, Large Minibatch SGD: Training ImageNet in 1 Hour

8 June 2017
Priya Goyal
Piotr Dollár
Ross B. Girshick
P. Noordhuis
Lukasz Wesolowski
Aapo Kyrola
Andrew Tulloch
Yangqing Jia
Kaiming He
    3DH
ArXiv (abs)PDFHTML

Papers citing "Accurate, Large Minibatch SGD: Training ImageNet in 1 Hour"

50 / 2,054 papers shown
Title
Improving Video-Text Retrieval by Multi-Stream Corpus Alignment and Dual
  Softmax Loss
Improving Video-Text Retrieval by Multi-Stream Corpus Alignment and Dual Softmax Loss
Xingyi Cheng
Hezheng Lin
Xiangyu Wu
Fan Yang
Dong Shen
93
155
0
09 Sep 2021
ISyNet: Convolutional Neural Networks design for AI accelerator
ISyNet: Convolutional Neural Networks design for AI accelerator
Alexey Letunovskiy
Vladimir Korviakov
V. Polovnikov
Anastasiia Kargapoltseva
I. Mazurenko
Yepan Xiong
104
1
0
04 Sep 2021
LIGAR: Lightweight General-purpose Action Recognition
LIGAR: Lightweight General-purpose Action Recognition
Evgeny Izutov
46
3
0
30 Aug 2021
Exploring and Improving Mobile Level Vision Transformers
Exploring and Improving Mobile Level Vision Transformers
Pengguang Chen
Yixin Chen
Shu Liu
Ming-Hsuan Yang
Jiaya Jia
ViT
104
4
0
30 Aug 2021
4-bit Quantization of LSTM-based Speech Recognition Models
4-bit Quantization of LSTM-based Speech Recognition Models
A. Fasoli
Chia-Yu Chen
Mauricio Serrano
Xiao Sun
Naigang Wang
...
Xiaodong Cui
Brian Kingsbury
Wei Zhang
Zoltán Tüske
K. Gopalakrishnan
MQ
75
23
0
27 Aug 2021
EmoBERTa: Speaker-Aware Emotion Recognition in Conversation with RoBERTa
EmoBERTa: Speaker-Aware Emotion Recognition in Conversation with RoBERTa
Taewoon Kim
Piek Vossen
98
102
0
26 Aug 2021
Unsupervised domain adaptation for clinician pose estimation and
  instance segmentation in the operating room
Unsupervised domain adaptation for clinician pose estimation and instance segmentation in the operating room
V. Srivastav
A. Gangi
N. Padoy
OOD
83
11
0
26 Aug 2021
Multi-Task Self-Training for Learning General Representations
Multi-Task Self-Training for Learning General Representations
Golnaz Ghiasi
Barret Zoph
E. D. Cubuk
Quoc V. Le
Nayeon Lee
SSL
91
102
0
25 Aug 2021
A Scaling Law for Synthetic-to-Real Transfer: How Much Is Your
  Pre-training Effective?
A Scaling Law for Synthetic-to-Real Transfer: How Much Is Your Pre-training Effective?
Hiroaki Mikami
Kenji Fukumizu
Shogo Murai
Shuji Suzuki
Yuta Kikuchi
Taiji Suzuki
S. Maeda
Kohei Hayashi
92
12
0
25 Aug 2021
CE-Dedup: Cost-Effective Convolutional Neural Nets Training based on
  Image Deduplication
CE-Dedup: Cost-Effective Convolutional Neural Nets Training based on Image Deduplication
Xuan Li
Liqiong Chang
Xue Liu
44
8
0
23 Aug 2021
Shift-Curvature, SGD, and Generalization
Shift-Curvature, SGD, and Generalization
Arwen V. Bradley
C. Gomez-Uribe
Manish Reddy Vuyyuru
62
3
0
21 Aug 2021
Online Continual Learning with Natural Distribution Shifts: An Empirical
  Study with Visual Data
Online Continual Learning with Natural Distribution Shifts: An Empirical Study with Visual Data
Zhipeng Cai
Ozan Sener
V. Koltun
CLL
65
87
0
20 Aug 2021
Deep MRI Reconstruction with Radial Subsampling
Deep MRI Reconstruction with Radial Subsampling
George Yiasemis
Chaoping Zhang
C. Sánchez
Jan-Jakob Sonke
Jonas Teuwen
128
9
0
17 Aug 2021
Contextual Convolutional Neural Networks
Contextual Convolutional Neural Networks
Ionut Cosmin Duta
Mariana-Iuliana Georgescu
Radu Tudor Ionescu
62
7
0
17 Aug 2021
Masked Face Recognition Challenge: The WebFace260M Track Report
Masked Face Recognition Challenge: The WebFace260M Track Report
Zheng Zhu
Guan Huang
Jiankang Deng
Yun Ye
Junjie Huang
...
Tian Yang
Jia Guo
Jiwen Lu
Dalong Du
Jie Zhou
88
26
0
16 Aug 2021
Implicit Regularization of Bregman Proximal Point Algorithm and Mirror
  Descent on Separable Data
Implicit Regularization of Bregman Proximal Point Algorithm and Mirror Descent on Separable Data
Yan Li
Caleb Ju
Ethan X. Fang
T. Zhao
69
9
0
15 Aug 2021
GeoCLR: Georeference Contrastive Learning for Efficient Seafloor Image
  Interpretation
GeoCLR: Georeference Contrastive Learning for Efficient Seafloor Image Interpretation
Takaki Yamada
Adam Prugel-Bennett
Stefan B. Williams
Oscar Pizarro
B. Thornton
67
14
0
13 Aug 2021
A Distributed SGD Algorithm with Global Sketching for Deep Learning
  Training Acceleration
A Distributed SGD Algorithm with Global Sketching for Deep Learning Training Acceleration
Lingfei Dai
Boyu Diao
Chao Li
Yongjun Xu
68
5
0
13 Aug 2021
Logit Attenuating Weight Normalization
Logit Attenuating Weight Normalization
Aman Gupta
R. Ramanath
Jun Shi
Anika Ramachandran
Sirou Zhou
Mingzhou Zhou
S. Keerthi
75
1
0
12 Aug 2021
Representation Learning for Remote Sensing: An Unsupervised Sensor
  Fusion Approach
Representation Learning for Remote Sensing: An Unsupervised Sensor Fusion Approach
Aidan M. Swope
X. Rudelis
Kyle T. Story
SSL
130
20
0
11 Aug 2021
FedPAGE: A Fast Local Stochastic Gradient Method for
  Communication-Efficient Federated Learning
FedPAGE: A Fast Local Stochastic Gradient Method for Communication-Efficient Federated Learning
Haoyu Zhao
Zhize Li
Peter Richtárik
FedML
79
29
0
10 Aug 2021
Tensor Yard: One-Shot Algorithm of Hardware-Friendly Tensor-Train
  Decomposition for Convolutional Neural Networks
Tensor Yard: One-Shot Algorithm of Hardware-Friendly Tensor-Train Decomposition for Convolutional Neural Networks
Anuar Taskynov
Vladimir Korviakov
I. Mazurenko
Yepan Xiong
61
2
0
09 Aug 2021
BenchENAS: A Benchmarking Platform for Evolutionary Neural Architecture
  Search
BenchENAS: A Benchmarking Platform for Evolutionary Neural Architecture Search
Xiangning Xie
Yuqiao Liu
Yanan Sun
Gary G. Yen
Bing Xue
Mengjie Zhang
117
19
0
09 Aug 2021
Online Evolutionary Batch Size Orchestration for Scheduling Deep
  Learning Workloads in GPU Clusters
Online Evolutionary Batch Size Orchestration for Scheduling Deep Learning Workloads in GPU Clusters
Chen Sun
Shenggui Li
Jinyue Wang
Jun Yu
114
48
0
08 Aug 2021
Impact of Aliasing on Generalization in Deep Convolutional Networks
Impact of Aliasing on Generalization in Deep Convolutional Networks
C. N. Vasconcelos
Hugo Larochelle
Vincent Dumoulin
Rob Romijnders
Nicolas Le Roux
Ross Goroshin
OOD
124
36
0
07 Aug 2021
Toward Efficient Online Scheduling for Distributed Machine Learning
  Systems
Toward Efficient Online Scheduling for Distributed Machine Learning Systems
Menglu Yu
Jia Liu
Chuan Wu
Bo Ji
Elizabeth S. Bentley
89
6
0
06 Aug 2021
Learning to Elect
Learning to Elect
Cem Anil
Xuchan Bao
27
8
0
05 Aug 2021
Token Shift Transformer for Video Classification
Token Shift Transformer for Video Classification
Hao Zhang
Y. Hao
Chong-Wah Ngo
ViT
87
119
0
05 Aug 2021
ACE: Ally Complementary Experts for Solving Long-Tailed Recognition in
  One-Shot
ACE: Ally Complementary Experts for Solving Long-Tailed Recognition in One-Shot
Jiarui Cai
Yizhou Wang
Lei Li
OODD
119
140
0
05 Aug 2021
Domain Adaptor Networks for Hyperspectral Image Recognition
Domain Adaptor Networks for Hyperspectral Image Recognition
Gustavo Pérez
Subhransu Maji
31
0
0
03 Aug 2021
AASAE: Augmentation-Augmented Stochastic Autoencoders
AASAE: Augmentation-Augmented Stochastic Autoencoders
William Falcon
A. Jha
Teddy Koker
Kyunghyun Cho
102
0
0
26 Jul 2021
Adaptive Recursive Circle Framework for Fine-grained Action Recognition
Adaptive Recursive Circle Framework for Fine-grained Action Recognition
Hanxi Lin
Xinxiao Wu
Jiebo Luo
65
2
0
25 Jul 2021
Compressing Neural Networks: Towards Determining the Optimal Layer-wise
  Decomposition
Compressing Neural Networks: Towards Determining the Optimal Layer-wise Decomposition
Lucas Liebenwein
Alaa Maalouf
O. Gal
Dan Feldman
Daniela Rus
75
47
0
23 Jul 2021
Taxonomizing local versus global structure in neural network loss
  landscapes
Taxonomizing local versus global structure in neural network loss landscapes
Yaoqing Yang
Liam Hodgkinson
Ryan Theisen
Joe Zou
Joseph E. Gonzalez
Kannan Ramchandran
Michael W. Mahoney
121
37
0
23 Jul 2021
Bias Loss for Mobile Neural Networks
Bias Loss for Mobile Neural Networks
L. Abrahamyan
Valentin Ziatchin
Yiming Chen
Nikos Deligiannis
45
14
0
23 Jul 2021
OODformer: Out-Of-Distribution Detection Transformer
OODformer: Out-Of-Distribution Detection Transformer
Rajat Koner
Poulami Sinhamahapatra
Karsten Roscher
Stephan Günnemann
Volker Tresp
ViT
64
40
0
19 Jul 2021
Face.evoLVe: A High-Performance Face Recognition Library
Face.evoLVe: A High-Performance Face Recognition Library
Qingzhong Wang
Pengfei Zhang
Haoyi Xiong
Jian-jun Zhao
CVBM
110
63
0
19 Jul 2021
YOLOX: Exceeding YOLO Series in 2021
YOLOX: Exceeding YOLO Series in 2021
Zheng Ge
Songtao Liu
Feng Wang
Zeming Li
Jian Sun
ObjD
245
4,135
0
18 Jul 2021
Chimera: Efficiently Training Large-Scale Neural Networks with
  Bidirectional Pipelines
Chimera: Efficiently Training Large-Scale Neural Networks with Bidirectional Pipelines
Shigang Li
Torsten Hoefler
GNNAI4CELRM
130
138
0
14 Jul 2021
Accelerating Distributed K-FAC with Smart Parallelism of Computing and
  Communication Tasks
Accelerating Distributed K-FAC with Smart Parallelism of Computing and Communication Tasks
Shaoshuai Shi
Lin Zhang
Yue Liu
123
9
0
14 Jul 2021
Automated Learning Rate Scheduler for Large-batch Training
Automated Learning Rate Scheduler for Large-batch Training
Chiheon Kim
Saehoon Kim
Jongmin Kim
Donghoon Lee
Sungwoong Kim
54
20
0
13 Jul 2021
Joint Matrix Decomposition for Deep Convolutional Neural Networks
  Compression
Joint Matrix Decomposition for Deep Convolutional Neural Networks Compression
Shaowu Chen
Jihao Zhou
Weize Sun
Lei Huang
45
21
0
09 Jul 2021
REX: Revisiting Budgeted Training with an Improved Schedule
REX: Revisiting Budgeted Training with an Improved Schedule
John Chen
Cameron R. Wolfe
Anastasios Kyrillidis
59
9
0
09 Jul 2021
Poly-NL: Linear Complexity Non-local Layers with Polynomials
Poly-NL: Linear Complexity Non-local Layers with Polynomials
F. Babiloni
Ioannis Marras
Filippos Kokkinos
Jiankang Deng
Grigorios G. Chrysos
Stefanos Zafeiriou
63
6
0
06 Jul 2021
Simpler, Faster, Stronger: Breaking The log-K Curse On Contrastive
  Learners With FlatNCE
Simpler, Faster, Stronger: Breaking The log-K Curse On Contrastive Learners With FlatNCE
Junya Chen
Zhe Gan
Xuan Li
Qing Guo
Liqun Chen
...
Belinda Zeng
Wenlian Lu
Fan Li
Lawrence Carin
Chenyang Tao
96
28
0
02 Jul 2021
ResIST: Layer-Wise Decomposition of ResNets for Distributed Training
ResIST: Layer-Wise Decomposition of ResNets for Distributed Training
Chen Dun
Cameron R. Wolfe
C. Jermaine
Anastasios Kyrillidis
87
21
0
02 Jul 2021
Productivity, Portability, Performance: Data-Centric Python
Productivity, Portability, Performance: Data-Centric Python
Yiheng Wang
Yao Zhang
Yanzhang Wang
Yan Wan
Jiao Wang
Zhongyuan Wu
Yuhao Yang
Bowen She
169
101
0
01 Jul 2021
JUWELS Booster -- A Supercomputer for Large-Scale AI Research
JUWELS Booster -- A Supercomputer for Large-Scale AI Research
Stefan Kesselheim
A. Herten
K. Krajsek
J. Ebert
J. Jitsev
...
A. Strube
Roshni Kamath
Martin G. Schultz
M. Riedel
T. Lippert
GNN
78
16
0
30 Jun 2021
Self-Contrastive Learning: Single-viewed Supervised Contrastive
  Framework using Sub-network
Self-Contrastive Learning: Single-viewed Supervised Contrastive Framework using Sub-network
Sangmin Bae
Sungnyun Kim
Jongwoo Ko
Gihun Lee
SeungJong Noh
Se-Young Yun
SSL
81
6
0
29 Jun 2021
Early Convolutions Help Transformers See Better
Early Convolutions Help Transformers See Better
Tete Xiao
Mannat Singh
Eric Mintun
Trevor Darrell
Piotr Dollár
Ross B. Girshick
86
778
0
28 Jun 2021
Previous
123...192021...404142
Next