Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1706.02677
Cited By
v1
v2 (latest)
Accurate, Large Minibatch SGD: Training ImageNet in 1 Hour
8 June 2017
Priya Goyal
Piotr Dollár
Ross B. Girshick
P. Noordhuis
Lukasz Wesolowski
Aapo Kyrola
Andrew Tulloch
Yangqing Jia
Kaiming He
3DH
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"Accurate, Large Minibatch SGD: Training ImageNet in 1 Hour"
50 / 2,054 papers shown
Title
Margin-Based Regularization and Selective Sampling in Deep Neural Networks
Berry Weinstein
Shai Fine
Y. Hel-Or
MQ
32
2
0
13 Sep 2020
Low-Rank Training of Deep Neural Networks for Emerging Memory Technology
Albert Gural
P. Nadeau
M. Tikekar
B. Murmann
68
5
0
08 Sep 2020
Improved Modeling of 3D Shapes with Multi-view Depth Maps
Kamal Gupta
Susmija Jabbireddy
Ketul Shah
Abhinav Shrivastava
Matthias Zwicker
3DV
43
5
0
07 Sep 2020
S-SGD: Symmetrical Stochastic Gradient Descent with Weight Noise Injection for Reaching Flat Minima
Wonyong Sung
Iksoo Choi
Jinhwan Park
Seokhyun Choi
Sungho Shin
ODL
58
7
0
05 Sep 2020
LiftFormer: 3D Human Pose Estimation using attention models
Adrian Llopart
47
9
0
01 Sep 2020
VarifocalNet: An IoU-aware Dense Object Detector
Haoyang Zhang
Ying Wang
Feras Dayoub
Niko Sünderhauf
ObjD
131
696
0
31 Aug 2020
Puzzle-AE: Novelty Detection in Images through Solving Puzzles
Mohammadreza Salehi
Ainaz Eftekhar
Niousha Sadjadi
M. Rohban
Hamid R. Rabiee
AAML
183
44
0
29 Aug 2020
Pollux: Co-adaptive Cluster Scheduling for Goodput-Optimized Deep Learning
Aurick Qiao
Sang Keun Choe
Suhas Jayaram Subramanya
Willie Neiswanger
Qirong Ho
Hao Zhang
G. Ganger
Eric Xing
VLM
77
183
0
27 Aug 2020
Relation/Entity-Centric Reading Comprehension
Takeshi Onishi
28
0
0
27 Aug 2020
HydaLearn: Highly Dynamic Task Weighting for Multi-task Learning with Auxiliary Tasks
Sam Verboven
M. H. Chaudhary
Jeroen Berrevoets
Wouter Verbeke
57
7
0
26 Aug 2020
Improving Semi-supervised Federated Learning by Reducing the Gradient Diversity of Models
Zhengming Zhang
Yaoqing Yang
Z. Yao
Yujun Yan
Joseph E. Gonzalez
Michael W. Mahoney
FedML
131
36
0
26 Aug 2020
Precision Health Data: Requirements, Challenges and Existing Techniques for Data Security and Privacy
Chandra Thapa
S. Çamtepe
47
213
0
24 Aug 2020
Memory-based Jitter: Improving Visual Recognition on Long-tailed Data with Diversity In Memory
Jialun Liu
Jingwei Zhang
Yi yang
Wenhui Li
Fangqiu Yi
Yifan Sun
84
41
0
22 Aug 2020
A(DP)
2
^2
2
SGD: Asynchronous Decentralized Parallel Stochastic Gradient Descent with Differential Privacy
Jie Xu
Wei Zhang
Fei Wang
FedML
71
8
0
21 Aug 2020
Enhancing Graph Neural Network-based Fraud Detectors against Camouflaged Fraudsters
Yingtong Dou
Zhiwei Liu
Li Sun
Yutong Deng
Hao Peng
Philip S. Yu
AAML
132
483
0
19 Aug 2020
A Computational-Graph Partitioning Method for Training Memory-Constrained DNNs
Fareed Qararyah
Mohamed Wahib
Douga Dikbayir
M. E. Belviranli
Didem Unat
71
10
0
19 Aug 2020
CosyPose: Consistent multi-view multi-object 6D pose estimation
Yann Labbé
Justin Carpentier
Mathieu Aubry
Josef Sivic
107
443
0
19 Aug 2020
Training Deep Neural Networks Without Batch Normalization
D. Gaur
Joachim Folz
Andreas Dengel
ODL
54
10
0
18 Aug 2020
AP-Loss for Accurate One-Stage Object Detection
Kean Chen
Weiyao Lin
Jianguo Li
John See
Ji Wang
Junni Zou
ObjD
97
67
0
17 Aug 2020
Domain-specific Communication Optimization for Distributed DNN Training
Hao Wang
Jingrong Chen
Xinchen Wan
Han Tian
Jiacheng Xia
Gaoxiong Zeng
Weiyan Wang
Kai Chen
Wei Bai
Junchen Jiang
AI4CE
26
16
0
16 Aug 2020
BroadFace: Looking at Tens of Thousands of People at Once for Face Recognition
Y. Kim
Wonpyo Park
Jongju Shin
CVBM
141
51
0
15 Aug 2020
Can weight sharing outperform random architecture search? An investigation with TuNAS
Gabriel Bender
Hanxiao Liu
Bo Chen
Grace Chu
Shuyang Cheng
Pieter-Jan Kindermans
Quoc V. Le
OOD
85
123
0
13 Aug 2020
Learning Temporally Invariant and Localizable Features via Data Augmentation for Video Recognition
Taeoh Kim
Hyeongmin Lee
Myeongah Cho
Hankook Lee
Dong Heon Cho
Sangyoun Lee
88
26
0
13 Aug 2020
PROFIT: A Novel Training Method for sub-4-bit MobileNet Models
Eunhyeok Park
S. Yoo
MQ
64
85
0
11 Aug 2020
MHSA-Net: Multi-Head Self-Attention Network for Occluded Person Re-Identification
Hongchen Tan
Xiuping Liu
Baocai Yin
Xin Li
100
84
0
10 Aug 2020
Incomplete Descriptor Mining with Elastic Loss for Person Re-Identification
Hongchen Tan
Yuhao Bian
Huasheng Wang
Xiuping Liu
Baocai Yin
128
71
0
10 Aug 2020
A Survey on Large-scale Machine Learning
Meng Wang
Weijie Fu
Xiangnan He
Shijie Hao
Xindong Wu
84
112
0
10 Aug 2020
Spatiotemporal Contrastive Video Representation Learning
Rui Qian
Tianjian Meng
Boqing Gong
Ming-Hsuan Yang
Haoran Wang
Serge J. Belongie
Huayu Chen
SSL
AI4TS
144
502
0
09 Aug 2020
DurIAN-SC: Duration Informed Attention Network based Singing Voice Conversion System
Liqiang Zhang
Chengzhu Yu
Heng Lu
Chao Weng
Chunlei Zhang
Yusong Wu
Xiang Xie
Zijin Li
Dong Yu
68
34
0
07 Aug 2020
1st Place Solutions of Waymo Open Dataset Challenge 2020 -- 2D Object Detection Track
Zehao Huang
Zehui Chen
Qiaofei Li
Hongkai Zhang
Naiyan Wang
76
13
0
04 Aug 2020
Making Coherence Out of Nothing At All: Measuring the Evolution of Gradient Alignment
S. Chatterjee
Piotr Zielinski
54
8
0
03 Aug 2020
Generalized Zero-Shot Domain Adaptation via Coupled Conditional Variational Autoencoders
Qian Wang
T. Breckon
70
12
0
03 Aug 2020
Evolving Multi-Resolution Pooling CNN for Monaural Singing Voice Separation
Weitao Yuan
Bofei Dong
Shengbei Wang
M. Unoki
Wenwu Wang
58
12
0
03 Aug 2020
Augmented Skeleton Based Contrastive Action Learning with Momentum LSTM for Unsupervised Action Recognition
Haocong Rao
Shihao Xu
Xiping Hu
Jun Cheng
Bin Hu
106
195
0
01 Aug 2020
Multi-node Bert-pretraining: Cost-efficient Approach
Jiahuang Lin
Xuelong Li
Gennady Pekhimenko
54
13
0
01 Aug 2020
LEMMA: A Multi-view Dataset for Learning Multi-agent Multi-task Activities
Baoxiong Jia
Yixin Chen
Siyuan Huang
Yixin Zhu
Song-Chun Zhu
42
54
0
31 Jul 2020
Growing Efficient Deep Networks by Structured Continuous Sparsification
Xin Yuan
Pedro H. P. Savarese
Michael Maire
3DPC
61
45
0
30 Jul 2020
Stochastic Normalized Gradient Descent with Momentum for Large-Batch Training
Shen-Yi Zhao
Chang-Wei Shi
Yin-Peng Xie
Wu-Jun Li
ODL
87
10
0
28 Jul 2020
CSER: Communication-efficient SGD with Error Reset
Cong Xie
Shuai Zheng
Oluwasanmi Koyejo
Indranil Gupta
Mu Li
Yanghua Peng
108
40
0
26 Jul 2020
The Case for Strong Scaling in Deep Learning: Training Large 3D CNNs with Hybrid Parallelism
Yosuke Oyama
N. Maruyama
Nikoli Dryden
Erin McCarthy
P. Harrington
J. Balewski
Satoshi Matsuoka
Peter Nugent
B. Van Essen
3DV
AI4CE
71
37
0
25 Jul 2020
Improving compute efficacy frontiers with SliceOut
Pascal Notin
Aidan Gomez
Joanna Yoo
Y. Gal
21
1
0
21 Jul 2020
Hierarchical Contrastive Motion Learning for Video Action Recognition
Xitong Yang
Xiaodong Yang
Sifei Liu
Deqing Sun
L. Davis
Jan Kautz
SSL
110
13
0
20 Jul 2020
DBQ: A Differentiable Branch Quantizer for Lightweight Deep Neural Networks
Hassan Dbouk
Hetul Sanghvi
M. Mehendale
Naresh R Shanbhag
MQ
51
9
0
19 Jul 2020
Distribution-Balanced Loss for Multi-Label Classification in Long-Tailed Datasets
Tong Wu
Qingqiu Huang
Ziwei Liu
Yu Wang
Dahua Lin
94
231
0
19 Jul 2020
Boundary-preserving Mask R-CNN
Tianheng Cheng
Xinggang Wang
Lichao Huang
Wenyu Liu
ISeg
104
207
0
17 Jul 2020
Progressive Multi-stage Feature Mix for Person Re-Identification
Yan Zhang
Binyu He
Li Sun
34
0
0
17 Jul 2020
A Technical Report for VIPriors Image Classification Challenge
Zhipeng Luo
Ge Li
Zhiguang Zhang
VLM
68
3
0
17 Jul 2020
A New Look at Ghost Normalization
Neofytos Dimitriou
Ognjen Arandjelovic
138
8
0
16 Jul 2020
CSI: Novelty Detection via Contrastive Learning on Distributionally Shifted Instances
Jihoon Tack
Sangwoo Mo
Jongheon Jeong
Jinwoo Shin
OODD
85
608
0
16 Jul 2020
Probabilistic Anchor Assignment with IoU Prediction for Object Detection
Kang-jik Kim
Hee Seok Lee
189
403
0
16 Jul 2020
Previous
1
2
3
...
26
27
28
...
40
41
42
Next