Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1706.02677
Cited By
v1
v2 (latest)
Accurate, Large Minibatch SGD: Training ImageNet in 1 Hour
8 June 2017
Priya Goyal
Piotr Dollár
Ross B. Girshick
P. Noordhuis
Lukasz Wesolowski
Aapo Kyrola
Andrew Tulloch
Yangqing Jia
Kaiming He
3DH
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"Accurate, Large Minibatch SGD: Training ImageNet in 1 Hour"
50 / 2,054 papers shown
Title
Improving Video-Text Retrieval by Multi-Stream Corpus Alignment and Dual Softmax Loss
Xingyi Cheng
Hezheng Lin
Xiangyu Wu
Fan Yang
Dong Shen
93
155
0
09 Sep 2021
ISyNet: Convolutional Neural Networks design for AI accelerator
Alexey Letunovskiy
Vladimir Korviakov
V. Polovnikov
Anastasiia Kargapoltseva
I. Mazurenko
Yepan Xiong
104
1
0
04 Sep 2021
LIGAR: Lightweight General-purpose Action Recognition
Evgeny Izutov
46
3
0
30 Aug 2021
Exploring and Improving Mobile Level Vision Transformers
Pengguang Chen
Yixin Chen
Shu Liu
Ming-Hsuan Yang
Jiaya Jia
ViT
104
4
0
30 Aug 2021
4-bit Quantization of LSTM-based Speech Recognition Models
A. Fasoli
Chia-Yu Chen
Mauricio Serrano
Xiao Sun
Naigang Wang
...
Xiaodong Cui
Brian Kingsbury
Wei Zhang
Zoltán Tüske
K. Gopalakrishnan
MQ
75
23
0
27 Aug 2021
EmoBERTa: Speaker-Aware Emotion Recognition in Conversation with RoBERTa
Taewoon Kim
Piek Vossen
98
102
0
26 Aug 2021
Unsupervised domain adaptation for clinician pose estimation and instance segmentation in the operating room
V. Srivastav
A. Gangi
N. Padoy
OOD
83
11
0
26 Aug 2021
Multi-Task Self-Training for Learning General Representations
Golnaz Ghiasi
Barret Zoph
E. D. Cubuk
Quoc V. Le
Nayeon Lee
SSL
91
102
0
25 Aug 2021
A Scaling Law for Synthetic-to-Real Transfer: How Much Is Your Pre-training Effective?
Hiroaki Mikami
Kenji Fukumizu
Shogo Murai
Shuji Suzuki
Yuta Kikuchi
Taiji Suzuki
S. Maeda
Kohei Hayashi
92
12
0
25 Aug 2021
CE-Dedup: Cost-Effective Convolutional Neural Nets Training based on Image Deduplication
Xuan Li
Liqiong Chang
Xue Liu
44
8
0
23 Aug 2021
Shift-Curvature, SGD, and Generalization
Arwen V. Bradley
C. Gomez-Uribe
Manish Reddy Vuyyuru
62
3
0
21 Aug 2021
Online Continual Learning with Natural Distribution Shifts: An Empirical Study with Visual Data
Zhipeng Cai
Ozan Sener
V. Koltun
CLL
65
87
0
20 Aug 2021
Deep MRI Reconstruction with Radial Subsampling
George Yiasemis
Chaoping Zhang
C. Sánchez
Jan-Jakob Sonke
Jonas Teuwen
128
9
0
17 Aug 2021
Contextual Convolutional Neural Networks
Ionut Cosmin Duta
Mariana-Iuliana Georgescu
Radu Tudor Ionescu
62
7
0
17 Aug 2021
Masked Face Recognition Challenge: The WebFace260M Track Report
Zheng Zhu
Guan Huang
Jiankang Deng
Yun Ye
Junjie Huang
...
Tian Yang
Jia Guo
Jiwen Lu
Dalong Du
Jie Zhou
88
26
0
16 Aug 2021
Implicit Regularization of Bregman Proximal Point Algorithm and Mirror Descent on Separable Data
Yan Li
Caleb Ju
Ethan X. Fang
T. Zhao
69
9
0
15 Aug 2021
GeoCLR: Georeference Contrastive Learning for Efficient Seafloor Image Interpretation
Takaki Yamada
Adam Prugel-Bennett
Stefan B. Williams
Oscar Pizarro
B. Thornton
67
14
0
13 Aug 2021
A Distributed SGD Algorithm with Global Sketching for Deep Learning Training Acceleration
Lingfei Dai
Boyu Diao
Chao Li
Yongjun Xu
68
5
0
13 Aug 2021
Logit Attenuating Weight Normalization
Aman Gupta
R. Ramanath
Jun Shi
Anika Ramachandran
Sirou Zhou
Mingzhou Zhou
S. Keerthi
75
1
0
12 Aug 2021
Representation Learning for Remote Sensing: An Unsupervised Sensor Fusion Approach
Aidan M. Swope
X. Rudelis
Kyle T. Story
SSL
130
20
0
11 Aug 2021
FedPAGE: A Fast Local Stochastic Gradient Method for Communication-Efficient Federated Learning
Haoyu Zhao
Zhize Li
Peter Richtárik
FedML
79
29
0
10 Aug 2021
Tensor Yard: One-Shot Algorithm of Hardware-Friendly Tensor-Train Decomposition for Convolutional Neural Networks
Anuar Taskynov
Vladimir Korviakov
I. Mazurenko
Yepan Xiong
61
2
0
09 Aug 2021
BenchENAS: A Benchmarking Platform for Evolutionary Neural Architecture Search
Xiangning Xie
Yuqiao Liu
Yanan Sun
Gary G. Yen
Bing Xue
Mengjie Zhang
117
19
0
09 Aug 2021
Online Evolutionary Batch Size Orchestration for Scheduling Deep Learning Workloads in GPU Clusters
Chen Sun
Shenggui Li
Jinyue Wang
Jun Yu
114
48
0
08 Aug 2021
Impact of Aliasing on Generalization in Deep Convolutional Networks
C. N. Vasconcelos
Hugo Larochelle
Vincent Dumoulin
Rob Romijnders
Nicolas Le Roux
Ross Goroshin
OOD
124
36
0
07 Aug 2021
Toward Efficient Online Scheduling for Distributed Machine Learning Systems
Menglu Yu
Jia Liu
Chuan Wu
Bo Ji
Elizabeth S. Bentley
89
6
0
06 Aug 2021
Learning to Elect
Cem Anil
Xuchan Bao
27
8
0
05 Aug 2021
Token Shift Transformer for Video Classification
Hao Zhang
Y. Hao
Chong-Wah Ngo
ViT
87
119
0
05 Aug 2021
ACE: Ally Complementary Experts for Solving Long-Tailed Recognition in One-Shot
Jiarui Cai
Yizhou Wang
Lei Li
OODD
119
140
0
05 Aug 2021
Domain Adaptor Networks for Hyperspectral Image Recognition
Gustavo Pérez
Subhransu Maji
31
0
0
03 Aug 2021
AASAE: Augmentation-Augmented Stochastic Autoencoders
William Falcon
A. Jha
Teddy Koker
Kyunghyun Cho
102
0
0
26 Jul 2021
Adaptive Recursive Circle Framework for Fine-grained Action Recognition
Hanxi Lin
Xinxiao Wu
Jiebo Luo
65
2
0
25 Jul 2021
Compressing Neural Networks: Towards Determining the Optimal Layer-wise Decomposition
Lucas Liebenwein
Alaa Maalouf
O. Gal
Dan Feldman
Daniela Rus
75
47
0
23 Jul 2021
Taxonomizing local versus global structure in neural network loss landscapes
Yaoqing Yang
Liam Hodgkinson
Ryan Theisen
Joe Zou
Joseph E. Gonzalez
Kannan Ramchandran
Michael W. Mahoney
121
37
0
23 Jul 2021
Bias Loss for Mobile Neural Networks
L. Abrahamyan
Valentin Ziatchin
Yiming Chen
Nikos Deligiannis
45
14
0
23 Jul 2021
OODformer: Out-Of-Distribution Detection Transformer
Rajat Koner
Poulami Sinhamahapatra
Karsten Roscher
Stephan Günnemann
Volker Tresp
ViT
64
40
0
19 Jul 2021
Face.evoLVe: A High-Performance Face Recognition Library
Qingzhong Wang
Pengfei Zhang
Haoyi Xiong
Jian-jun Zhao
CVBM
110
63
0
19 Jul 2021
YOLOX: Exceeding YOLO Series in 2021
Zheng Ge
Songtao Liu
Feng Wang
Zeming Li
Jian Sun
ObjD
245
4,135
0
18 Jul 2021
Chimera: Efficiently Training Large-Scale Neural Networks with Bidirectional Pipelines
Shigang Li
Torsten Hoefler
GNN
AI4CE
LRM
130
138
0
14 Jul 2021
Accelerating Distributed K-FAC with Smart Parallelism of Computing and Communication Tasks
Shaoshuai Shi
Lin Zhang
Yue Liu
123
9
0
14 Jul 2021
Automated Learning Rate Scheduler for Large-batch Training
Chiheon Kim
Saehoon Kim
Jongmin Kim
Donghoon Lee
Sungwoong Kim
54
20
0
13 Jul 2021
Joint Matrix Decomposition for Deep Convolutional Neural Networks Compression
Shaowu Chen
Jihao Zhou
Weize Sun
Lei Huang
45
21
0
09 Jul 2021
REX: Revisiting Budgeted Training with an Improved Schedule
John Chen
Cameron R. Wolfe
Anastasios Kyrillidis
59
9
0
09 Jul 2021
Poly-NL: Linear Complexity Non-local Layers with Polynomials
F. Babiloni
Ioannis Marras
Filippos Kokkinos
Jiankang Deng
Grigorios G. Chrysos
Stefanos Zafeiriou
63
6
0
06 Jul 2021
Simpler, Faster, Stronger: Breaking The log-K Curse On Contrastive Learners With FlatNCE
Junya Chen
Zhe Gan
Xuan Li
Qing Guo
Liqun Chen
...
Belinda Zeng
Wenlian Lu
Fan Li
Lawrence Carin
Chenyang Tao
96
28
0
02 Jul 2021
ResIST: Layer-Wise Decomposition of ResNets for Distributed Training
Chen Dun
Cameron R. Wolfe
C. Jermaine
Anastasios Kyrillidis
87
21
0
02 Jul 2021
Productivity, Portability, Performance: Data-Centric Python
Yiheng Wang
Yao Zhang
Yanzhang Wang
Yan Wan
Jiao Wang
Zhongyuan Wu
Yuhao Yang
Bowen She
169
101
0
01 Jul 2021
JUWELS Booster -- A Supercomputer for Large-Scale AI Research
Stefan Kesselheim
A. Herten
K. Krajsek
J. Ebert
J. Jitsev
...
A. Strube
Roshni Kamath
Martin G. Schultz
M. Riedel
T. Lippert
GNN
78
16
0
30 Jun 2021
Self-Contrastive Learning: Single-viewed Supervised Contrastive Framework using Sub-network
Sangmin Bae
Sungnyun Kim
Jongwoo Ko
Gihun Lee
SeungJong Noh
Se-Young Yun
SSL
81
6
0
29 Jun 2021
Early Convolutions Help Transformers See Better
Tete Xiao
Mannat Singh
Eric Mintun
Trevor Darrell
Piotr Dollár
Ross B. Girshick
86
778
0
28 Jun 2021
Previous
1
2
3
...
19
20
21
...
40
41
42
Next