Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1706.02677
Cited By
v1
v2 (latest)
Accurate, Large Minibatch SGD: Training ImageNet in 1 Hour
8 June 2017
Priya Goyal
Piotr Dollár
Ross B. Girshick
P. Noordhuis
Lukasz Wesolowski
Aapo Kyrola
Andrew Tulloch
Yangqing Jia
Kaiming He
3DH
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"Accurate, Large Minibatch SGD: Training ImageNet in 1 Hour"
50 / 2,054 papers shown
Title
Drawing Multiple Augmentation Samples Per Image During Training Efficiently Decreases Test Error
Stanislav Fort
Andrew Brock
Razvan Pascanu
Soham De
Samuel L. Smith
64
32
0
27 May 2021
SSAN: Separable Self-Attention Network for Video Representation Learning
Xudong Guo
Xun Guo
Yan Lu
ViT
AI4TS
55
26
0
27 May 2021
Rethinking InfoNCE: How Many Negative Samples Do You Need?
Chuhan Wu
Fangzhao Wu
Yongfeng Huang
65
45
0
27 May 2021
Joint-DetNAS: Upgrade Your Detector with NAS, Pruning and Dynamic Distillation
Lewei Yao
Renjie Pi
Hang Xu
Wei Zhang
Zhenguo Li
Tong Zhang
142
41
0
27 May 2021
Blurs Behave Like Ensembles: Spatial Smoothings to Improve Accuracy, Uncertainty, and Robustness
Namuk Park
S. Kim
UQCV
AAML
93
21
0
26 May 2021
Estimating the Uncertainty of Neural Network Forecasts for Influenza Prevalence Using Web Search Activity
Michael Morris
Peter A. Hayes
Ingemar J. Cox
Vasileios Lampos
115
1
0
26 May 2021
Large-Scale Attribute-Object Compositions
Filip Radenovic
Animesh Sinha
Albert Gordo
Tamara L. Berg
D. Mahajan
OCL
117
7
0
24 May 2021
Dorylus: Affordable, Scalable, and Accurate GNN Training with Distributed CPU Servers and Serverless Threads
John Thorpe
Yifan Qiao
Jon Eyolfson
Shen Teng
Guanzhou Hu
...
Jinliang Wei
Keval Vora
Ravi Netravali
Miryung Kim
G. Xu
GNN
66
144
0
24 May 2021
Fast Federated Learning by Balancing Communication Trade-Offs
Milad Khademi Nori
Sangseok Yun
Il-Min Kim
FedML
77
57
0
23 May 2021
PLM: Partial Label Masking for Imbalanced Multi-label Classification
Kevin Duarte
Yogesh S Rawat
M. Shah
76
15
0
22 May 2021
AutoLRS: Automatic Learning-Rate Schedule by Bayesian Optimization on the Fly
Yuchen Jin
Dinesh Manocha
Liangyu Zhao
Yibo Zhu
Chuanxiong Guo
Marco Canini
Arvind Krishnamurthy
95
19
0
22 May 2021
Content-Augmented Feature Pyramid Network with Light Linear Spatial Transformers for Object Detection
Yongxiang Gu
Xiaolin Qin
Yuncong Peng
Lu Li
ViT
20
7
0
20 May 2021
High performance and energy efficient inference for deep learning on ARM processors
Adrián Castelló
S. Barrachina
Manuel F. Dolz
Enrique S. Quintana-Ortí
Pau San Juan
3DH
BDL
16
1
0
19 May 2021
Accelerating Gossip SGD with Periodic Global Averaging
Yiming Chen
Kun Yuan
Yingya Zhang
Pan Pan
Yinghui Xu
W. Yin
79
44
0
19 May 2021
Exemplar-Based Open-Set Panoptic Segmentation Network
Jaedong Hwang
Seoung Wug Oh
Joon-Young Lee
Bohyung Han
VLM
140
51
0
18 May 2021
Divide and Contrast: Self-supervised Learning from Uncurated Data
Yonglong Tian
Olivier J. Hénaff
Aaron van den Oord
SSL
138
101
0
17 May 2021
Rethinking "Batch" in BatchNorm
Yuxin Wu
Justin Johnson
BDL
123
66
0
17 May 2021
Drill the Cork of Information Bottleneck by Inputting the Most Important Data
Xinyu Peng
Jiawei Zhang
Feiyue Wang
Li Li
43
6
0
15 May 2021
MutualNet: Adaptive ConvNet via Mutual Learning from Different Model Configurations
Taojiannan Yang
Sijie Zhu
Matías Mendieta
Pu Wang
Ravikumar Balakrishnan
Minwoo Lee
T. Han
M. Shah
Chong Chen
3DH
OOD
102
24
0
14 May 2021
Scaling Ensemble Distribution Distillation to Many Classes with Proxy Targets
Max Ryabinin
A. Malinin
Mark Gales
UQCV
53
18
0
14 May 2021
Disentangling Sampling and Labeling Bias for Learning in Large-Output Spaces
A. S. Rawat
A. Menon
Wittawat Jitkrittum
Sadeep Jayasumana
Felix X. Yu
Sashank J. Reddi
Sanjiv Kumar
62
9
0
12 May 2021
VICReg: Variance-Invariance-Covariance Regularization for Self-Supervised Learning
Adrien Bardes
Jean Ponce
Yann LeCun
SSL
DML
234
947
0
11 May 2021
Contrastive Attraction and Contrastive Repulsion for Representation Learning
Huangjie Zheng
Xu Chen
Jiangchao Yao
Hongxia Yang
Chunyuan Li
Ya Zhang
Hao Zhang
Ivor Tsang
Jingren Zhou
Mingyuan Zhou
SSL
113
12
0
08 May 2021
Probabilistic Ranking-Aware Ensembles for Enhanced Object Detections
Mingyuan Mao
Baochang Zhang
David Doermann
Jie Guo
Shumin Han
Yuan Feng
Xiaodi Wang
Errui Ding
65
2
0
07 May 2021
Automated scoring of pre-REM sleep in mice with deep learning
Niklas Grieger
J. Schwabedal
Stefanie Wendel
Y. Ritze
Stephan Bialonski
34
20
0
05 May 2021
On the limit of English conversational speech recognition
Zoltán Tüske
G. Saon
Brian Kingsbury
94
50
0
03 May 2021
On Feature Decorrelation in Self-Supervised Learning
Tianyu Hua
Wenxiao Wang
Zihui Xue
Sucheng Ren
Yue Wang
Hang Zhao
SSL
OOD
198
197
0
02 May 2021
A Large-Scale Study on Unsupervised Spatiotemporal Representation Learning
Christoph Feichtenhofer
Haoqi Fan
Bo Xiong
Ross B. Girshick
Kaiming He
SSL
AI4TS
110
263
0
29 Apr 2021
Emerging Properties in Self-Supervised Vision Transformers
Mathilde Caron
Hugo Touvron
Ishan Misra
Hervé Jégou
Julien Mairal
Piotr Bojanowski
Armand Joulin
1.0K
6,154
0
29 Apr 2021
Pseudo-IoU: Improving Label Assignment in Anchor-Free Object Detection
Jiachen Li
Bowen Cheng
Rogerio Feris
Jinjun Xiong
Thomas S.Huang
Wen-mei W. Hwu
Humphrey Shi
57
19
0
29 Apr 2021
SGNet: A Super-class Guided Network for Image Classification and Object Detection
Kaidong Li
Ningning Wang
Yiju Yang
Guanghui Wang
156
22
0
26 Apr 2021
DecentLaM: Decentralized Momentum SGD for Large-batch Deep Training
Kun Yuan
Yiming Chen
Xinmeng Huang
Yingya Zhang
Pan Pan
Yinghui Xu
W. Yin
MoE
115
64
0
24 Apr 2021
Partitioning sparse deep neural networks for scalable training and inference
G. Demirci
Hakan Ferhatosmanoglu
35
11
0
23 Apr 2021
Multiscale Vision Transformers
Haoqi Fan
Bo Xiong
K. Mangalam
Yanghao Li
Zhicheng Yan
Jitendra Malik
Christoph Feichtenhofer
ViT
143
1,274
0
22 Apr 2021
Analyzing Monotonic Linear Interpolation in Neural Network Loss Landscapes
James Lucas
Juhan Bae
Michael Ruogu Zhang
Stanislav Fort
R. Zemel
Roger C. Grosse
MoMe
244
28
0
22 Apr 2021
IB-DRR: Incremental Learning with Information-Back Discrete Representation Replay
Jian Jiang
Edoardo Cetin
Oya Celiktutan
56
9
0
21 Apr 2021
ScaleCom: Scalable Sparsified Gradient Compression for Communication-Efficient Distributed Training
Chia-Yu Chen
Jiamin Ni
Songtao Lu
Xiaodong Cui
Pin-Yu Chen
...
Naigang Wang
Swagath Venkataramani
Vijayalakshmi Srinivasan
Wei Zhang
K. Gopalakrishnan
79
67
0
21 Apr 2021
Parallel Physics-Informed Neural Networks via Domain Decomposition
K. Shukla
Ameya Dilip Jagtap
George Karniadakis
PINN
181
289
0
20 Apr 2021
Federated Word2Vec: Leveraging Federated Learning to Encourage Collaborative Representation Learning
Daniel Garcia Bernal
Lodovico Giaretta
Sarunas Girdzijauskas
Magnus Sahlgren
FedML
108
4
0
19 Apr 2021
Single-view robot pose and joint angle estimation via render & compare
Yann Labbé
Justin Carpentier
Mathieu Aubry
Josef Sivic
108
43
0
19 Apr 2021
Distilling Knowledge via Knowledge Review
Pengguang Chen
Shu Liu
Hengshuang Zhao
Jiaya Jia
220
453
0
19 Apr 2021
Writing in The Air: Unconstrained Text Recognition from Finger Movement Using Spatio-Temporal Convolution
Ue-Hwan Kim
Yewon Hwang
Sun-Kyung Lee
Jong-Hwan Kim
64
20
0
19 Apr 2021
Sync-Switch: Hybrid Parameter Synchronization for Distributed Deep Learning
Shijian Li
Oren Mangoubi
Lijie Xu
Tian Guo
101
15
0
16 Apr 2021
"BNN - BN = ?": Training Binary Neural Networks without Batch Normalization
Tianlong Chen
Zhenyu Zhang
Xu Ouyang
Zechun Liu
Zhiqiang Shen
Zhangyang Wang
MQ
102
37
0
16 Apr 2021
Reinforced Neighborhood Selection Guided Multi-Relational Graph Neural Networks
Hao Peng
Ruitong Zhang
Yingtong Dou
Renyu Yang
Jingyi Zhang
Philip S. Yu
143
119
0
16 Apr 2021
Points as Queries: Weakly Semi-supervised Object Detection by Points
Liangyu Chen
Tong Yang
Xinming Zhang
Wei Zhang
Jian Sun
100
86
0
15 Apr 2021
Learning Metrics from Mean Teacher: A Supervised Learning Method for Improving the Generalization of Speaker Verification System
Ju-ho Kim
Hye-jin Shim
Jee-weon Jung
Ha-Jin Yu
114
1
0
14 Apr 2021
Distributed Learning Systems with First-order Methods
Ji Liu
Ce Zhang
36
44
0
12 Apr 2021
Efficient Large-Scale Language Model Training on GPU Clusters Using Megatron-LM
Deepak Narayanan
Mohammad Shoeybi
Jared Casper
P. LeGresley
M. Patwary
...
Prethvi Kashinkunti
J. Bernauer
Bryan Catanzaro
Amar Phanishayee
Matei A. Zaharia
MoE
232
716
0
09 Apr 2021
InAugment: Improving Classifiers via Internal Augmentation
Moab Arar
Ariel Shamir
Amit H. Bermano
31
2
0
08 Apr 2021
Previous
1
2
3
...
21
22
23
...
40
41
42
Next