ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1706.02677
  4. Cited By
Accurate, Large Minibatch SGD: Training ImageNet in 1 Hour
v1v2 (latest)

Accurate, Large Minibatch SGD: Training ImageNet in 1 Hour

8 June 2017
Priya Goyal
Piotr Dollár
Ross B. Girshick
P. Noordhuis
Lukasz Wesolowski
Aapo Kyrola
Andrew Tulloch
Yangqing Jia
Kaiming He
    3DH
ArXiv (abs)PDFHTML

Papers citing "Accurate, Large Minibatch SGD: Training ImageNet in 1 Hour"

50 / 2,054 papers shown
Title
Margin-Based Regularization and Selective Sampling in Deep Neural
  Networks
Margin-Based Regularization and Selective Sampling in Deep Neural Networks
Berry Weinstein
Shai Fine
Y. Hel-Or
MQ
32
2
0
13 Sep 2020
Low-Rank Training of Deep Neural Networks for Emerging Memory Technology
Low-Rank Training of Deep Neural Networks for Emerging Memory Technology
Albert Gural
P. Nadeau
M. Tikekar
B. Murmann
68
5
0
08 Sep 2020
Improved Modeling of 3D Shapes with Multi-view Depth Maps
Improved Modeling of 3D Shapes with Multi-view Depth Maps
Kamal Gupta
Susmija Jabbireddy
Ketul Shah
Abhinav Shrivastava
Matthias Zwicker
3DV
43
5
0
07 Sep 2020
S-SGD: Symmetrical Stochastic Gradient Descent with Weight Noise
  Injection for Reaching Flat Minima
S-SGD: Symmetrical Stochastic Gradient Descent with Weight Noise Injection for Reaching Flat Minima
Wonyong Sung
Iksoo Choi
Jinhwan Park
Seokhyun Choi
Sungho Shin
ODL
58
7
0
05 Sep 2020
LiftFormer: 3D Human Pose Estimation using attention models
LiftFormer: 3D Human Pose Estimation using attention models
Adrian Llopart
47
9
0
01 Sep 2020
VarifocalNet: An IoU-aware Dense Object Detector
VarifocalNet: An IoU-aware Dense Object Detector
Haoyang Zhang
Ying Wang
Feras Dayoub
Niko Sünderhauf
ObjD
131
696
0
31 Aug 2020
Puzzle-AE: Novelty Detection in Images through Solving Puzzles
Puzzle-AE: Novelty Detection in Images through Solving Puzzles
Mohammadreza Salehi
Ainaz Eftekhar
Niousha Sadjadi
M. Rohban
Hamid R. Rabiee
AAML
183
44
0
29 Aug 2020
Pollux: Co-adaptive Cluster Scheduling for Goodput-Optimized Deep
  Learning
Pollux: Co-adaptive Cluster Scheduling for Goodput-Optimized Deep Learning
Aurick Qiao
Sang Keun Choe
Suhas Jayaram Subramanya
Willie Neiswanger
Qirong Ho
Hao Zhang
G. Ganger
Eric Xing
VLM
77
183
0
27 Aug 2020
Relation/Entity-Centric Reading Comprehension
Relation/Entity-Centric Reading Comprehension
Takeshi Onishi
28
0
0
27 Aug 2020
HydaLearn: Highly Dynamic Task Weighting for Multi-task Learning with
  Auxiliary Tasks
HydaLearn: Highly Dynamic Task Weighting for Multi-task Learning with Auxiliary Tasks
Sam Verboven
M. H. Chaudhary
Jeroen Berrevoets
Wouter Verbeke
57
7
0
26 Aug 2020
Improving Semi-supervised Federated Learning by Reducing the Gradient
  Diversity of Models
Improving Semi-supervised Federated Learning by Reducing the Gradient Diversity of Models
Zhengming Zhang
Yaoqing Yang
Z. Yao
Yujun Yan
Joseph E. Gonzalez
Michael W. Mahoney
FedML
131
36
0
26 Aug 2020
Precision Health Data: Requirements, Challenges and Existing Techniques
  for Data Security and Privacy
Precision Health Data: Requirements, Challenges and Existing Techniques for Data Security and Privacy
Chandra Thapa
S. Çamtepe
47
213
0
24 Aug 2020
Memory-based Jitter: Improving Visual Recognition on Long-tailed Data
  with Diversity In Memory
Memory-based Jitter: Improving Visual Recognition on Long-tailed Data with Diversity In Memory
Jialun Liu
Jingwei Zhang
Yi yang
Wenhui Li
Fangqiu Yi
Yifan Sun
84
41
0
22 Aug 2020
A(DP)$^2$SGD: Asynchronous Decentralized Parallel Stochastic Gradient
  Descent with Differential Privacy
A(DP)2^22SGD: Asynchronous Decentralized Parallel Stochastic Gradient Descent with Differential Privacy
Jie Xu
Wei Zhang
Fei Wang
FedML
71
8
0
21 Aug 2020
Enhancing Graph Neural Network-based Fraud Detectors against Camouflaged
  Fraudsters
Enhancing Graph Neural Network-based Fraud Detectors against Camouflaged Fraudsters
Yingtong Dou
Zhiwei Liu
Li Sun
Yutong Deng
Hao Peng
Philip S. Yu
AAML
132
483
0
19 Aug 2020
A Computational-Graph Partitioning Method for Training
  Memory-Constrained DNNs
A Computational-Graph Partitioning Method for Training Memory-Constrained DNNs
Fareed Qararyah
Mohamed Wahib
Douga Dikbayir
M. E. Belviranli
Didem Unat
71
10
0
19 Aug 2020
CosyPose: Consistent multi-view multi-object 6D pose estimation
CosyPose: Consistent multi-view multi-object 6D pose estimation
Yann Labbé
Justin Carpentier
Mathieu Aubry
Josef Sivic
107
443
0
19 Aug 2020
Training Deep Neural Networks Without Batch Normalization
Training Deep Neural Networks Without Batch Normalization
D. Gaur
Joachim Folz
Andreas Dengel
ODL
54
10
0
18 Aug 2020
AP-Loss for Accurate One-Stage Object Detection
AP-Loss for Accurate One-Stage Object Detection
Kean Chen
Weiyao Lin
Jianguo Li
John See
Ji Wang
Junni Zou
ObjD
97
67
0
17 Aug 2020
Domain-specific Communication Optimization for Distributed DNN Training
Domain-specific Communication Optimization for Distributed DNN Training
Hao Wang
Jingrong Chen
Xinchen Wan
Han Tian
Jiacheng Xia
Gaoxiong Zeng
Weiyan Wang
Kai Chen
Wei Bai
Junchen Jiang
AI4CE
26
16
0
16 Aug 2020
BroadFace: Looking at Tens of Thousands of People at Once for Face
  Recognition
BroadFace: Looking at Tens of Thousands of People at Once for Face Recognition
Y. Kim
Wonpyo Park
Jongju Shin
CVBM
141
51
0
15 Aug 2020
Can weight sharing outperform random architecture search? An
  investigation with TuNAS
Can weight sharing outperform random architecture search? An investigation with TuNAS
Gabriel Bender
Hanxiao Liu
Bo Chen
Grace Chu
Shuyang Cheng
Pieter-Jan Kindermans
Quoc V. Le
OOD
85
123
0
13 Aug 2020
Learning Temporally Invariant and Localizable Features via Data
  Augmentation for Video Recognition
Learning Temporally Invariant and Localizable Features via Data Augmentation for Video Recognition
Taeoh Kim
Hyeongmin Lee
Myeongah Cho
Hankook Lee
Dong Heon Cho
Sangyoun Lee
88
26
0
13 Aug 2020
PROFIT: A Novel Training Method for sub-4-bit MobileNet Models
PROFIT: A Novel Training Method for sub-4-bit MobileNet Models
Eunhyeok Park
S. Yoo
MQ
64
85
0
11 Aug 2020
MHSA-Net: Multi-Head Self-Attention Network for Occluded Person
  Re-Identification
MHSA-Net: Multi-Head Self-Attention Network for Occluded Person Re-Identification
Hongchen Tan
Xiuping Liu
Baocai Yin
Xin Li
100
84
0
10 Aug 2020
Incomplete Descriptor Mining with Elastic Loss for Person
  Re-Identification
Incomplete Descriptor Mining with Elastic Loss for Person Re-Identification
Hongchen Tan
Yuhao Bian
Huasheng Wang
Xiuping Liu
Baocai Yin
128
71
0
10 Aug 2020
A Survey on Large-scale Machine Learning
A Survey on Large-scale Machine Learning
Meng Wang
Weijie Fu
Xiangnan He
Shijie Hao
Xindong Wu
84
112
0
10 Aug 2020
Spatiotemporal Contrastive Video Representation Learning
Spatiotemporal Contrastive Video Representation Learning
Rui Qian
Tianjian Meng
Boqing Gong
Ming-Hsuan Yang
Haoran Wang
Serge J. Belongie
Huayu Chen
SSLAI4TS
144
502
0
09 Aug 2020
DurIAN-SC: Duration Informed Attention Network based Singing Voice
  Conversion System
DurIAN-SC: Duration Informed Attention Network based Singing Voice Conversion System
Liqiang Zhang
Chengzhu Yu
Heng Lu
Chao Weng
Chunlei Zhang
Yusong Wu
Xiang Xie
Zijin Li
Dong Yu
68
34
0
07 Aug 2020
1st Place Solutions of Waymo Open Dataset Challenge 2020 -- 2D Object
  Detection Track
1st Place Solutions of Waymo Open Dataset Challenge 2020 -- 2D Object Detection Track
Zehao Huang
Zehui Chen
Qiaofei Li
Hongkai Zhang
Naiyan Wang
76
13
0
04 Aug 2020
Making Coherence Out of Nothing At All: Measuring the Evolution of
  Gradient Alignment
Making Coherence Out of Nothing At All: Measuring the Evolution of Gradient Alignment
S. Chatterjee
Piotr Zielinski
54
8
0
03 Aug 2020
Generalized Zero-Shot Domain Adaptation via Coupled Conditional
  Variational Autoencoders
Generalized Zero-Shot Domain Adaptation via Coupled Conditional Variational Autoencoders
Qian Wang
T. Breckon
70
12
0
03 Aug 2020
Evolving Multi-Resolution Pooling CNN for Monaural Singing Voice
  Separation
Evolving Multi-Resolution Pooling CNN for Monaural Singing Voice Separation
Weitao Yuan
Bofei Dong
Shengbei Wang
M. Unoki
Wenwu Wang
58
12
0
03 Aug 2020
Augmented Skeleton Based Contrastive Action Learning with Momentum LSTM
  for Unsupervised Action Recognition
Augmented Skeleton Based Contrastive Action Learning with Momentum LSTM for Unsupervised Action Recognition
Haocong Rao
Shihao Xu
Xiping Hu
Jun Cheng
Bin Hu
106
195
0
01 Aug 2020
Multi-node Bert-pretraining: Cost-efficient Approach
Multi-node Bert-pretraining: Cost-efficient Approach
Jiahuang Lin
Xuelong Li
Gennady Pekhimenko
54
13
0
01 Aug 2020
LEMMA: A Multi-view Dataset for Learning Multi-agent Multi-task
  Activities
LEMMA: A Multi-view Dataset for Learning Multi-agent Multi-task Activities
Baoxiong Jia
Yixin Chen
Siyuan Huang
Yixin Zhu
Song-Chun Zhu
42
54
0
31 Jul 2020
Growing Efficient Deep Networks by Structured Continuous Sparsification
Growing Efficient Deep Networks by Structured Continuous Sparsification
Xin Yuan
Pedro H. P. Savarese
Michael Maire
3DPC
61
45
0
30 Jul 2020
Stochastic Normalized Gradient Descent with Momentum for Large-Batch
  Training
Stochastic Normalized Gradient Descent with Momentum for Large-Batch Training
Shen-Yi Zhao
Chang-Wei Shi
Yin-Peng Xie
Wu-Jun Li
ODL
87
10
0
28 Jul 2020
CSER: Communication-efficient SGD with Error Reset
CSER: Communication-efficient SGD with Error Reset
Cong Xie
Shuai Zheng
Oluwasanmi Koyejo
Indranil Gupta
Mu Li
Yanghua Peng
108
40
0
26 Jul 2020
The Case for Strong Scaling in Deep Learning: Training Large 3D CNNs
  with Hybrid Parallelism
The Case for Strong Scaling in Deep Learning: Training Large 3D CNNs with Hybrid Parallelism
Yosuke Oyama
N. Maruyama
Nikoli Dryden
Erin McCarthy
P. Harrington
J. Balewski
Satoshi Matsuoka
Peter Nugent
B. Van Essen
3DVAI4CE
71
37
0
25 Jul 2020
Improving compute efficacy frontiers with SliceOut
Improving compute efficacy frontiers with SliceOut
Pascal Notin
Aidan Gomez
Joanna Yoo
Y. Gal
21
1
0
21 Jul 2020
Hierarchical Contrastive Motion Learning for Video Action Recognition
Hierarchical Contrastive Motion Learning for Video Action Recognition
Xitong Yang
Xiaodong Yang
Sifei Liu
Deqing Sun
L. Davis
Jan Kautz
SSL
110
13
0
20 Jul 2020
DBQ: A Differentiable Branch Quantizer for Lightweight Deep Neural
  Networks
DBQ: A Differentiable Branch Quantizer for Lightweight Deep Neural Networks
Hassan Dbouk
Hetul Sanghvi
M. Mehendale
Naresh R Shanbhag
MQ
51
9
0
19 Jul 2020
Distribution-Balanced Loss for Multi-Label Classification in Long-Tailed
  Datasets
Distribution-Balanced Loss for Multi-Label Classification in Long-Tailed Datasets
Tong Wu
Qingqiu Huang
Ziwei Liu
Yu Wang
Dahua Lin
94
231
0
19 Jul 2020
Boundary-preserving Mask R-CNN
Boundary-preserving Mask R-CNN
Tianheng Cheng
Xinggang Wang
Lichao Huang
Wenyu Liu
ISeg
104
207
0
17 Jul 2020
Progressive Multi-stage Feature Mix for Person Re-Identification
Progressive Multi-stage Feature Mix for Person Re-Identification
Yan Zhang
Binyu He
Li Sun
34
0
0
17 Jul 2020
A Technical Report for VIPriors Image Classification Challenge
A Technical Report for VIPriors Image Classification Challenge
Zhipeng Luo
Ge Li
Zhiguang Zhang
VLM
68
3
0
17 Jul 2020
A New Look at Ghost Normalization
A New Look at Ghost Normalization
Neofytos Dimitriou
Ognjen Arandjelovic
138
8
0
16 Jul 2020
CSI: Novelty Detection via Contrastive Learning on Distributionally
  Shifted Instances
CSI: Novelty Detection via Contrastive Learning on Distributionally Shifted Instances
Jihoon Tack
Sangwoo Mo
Jongheon Jeong
Jinwoo Shin
OODD
85
608
0
16 Jul 2020
Probabilistic Anchor Assignment with IoU Prediction for Object Detection
Probabilistic Anchor Assignment with IoU Prediction for Object Detection
Kang-jik Kim
Hee Seok Lee
189
403
0
16 Jul 2020
Previous
123...262728...404142
Next