Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1706.02677
Cited By
v1
v2 (latest)
Accurate, Large Minibatch SGD: Training ImageNet in 1 Hour
8 June 2017
Priya Goyal
Piotr Dollár
Ross B. Girshick
P. Noordhuis
Lukasz Wesolowski
Aapo Kyrola
Andrew Tulloch
Yangqing Jia
Kaiming He
3DH
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"Accurate, Large Minibatch SGD: Training ImageNet in 1 Hour"
50 / 2,054 papers shown
Title
GLEAM: Greedy Learning for Large-Scale Accelerated MRI Reconstruction
Batu Mehmet Ozturkler
Arda Sahiner
Tolga Ergen
Arjun D Desai
Christopher M. Sandino
S. Vasanawala
John M. Pauly
Morteza Mardani
Mert Pilanci
55
4
0
18 Jul 2022
TS2-Net: Token Shift and Selection Transformer for Text-Video Retrieval
Yuqi Liu
Pengfei Xiong
Luhui Xu
Shengming Cao
Qin Jin
95
122
0
16 Jul 2022
On the Strong Correlation Between Model Invariance and Generalization
Weijian Deng
Stephen Gould
Liang Zheng
OOD
91
19
0
14 Jul 2022
Scene Text Recognition with Permuted Autoregressive Sequence Models
Darwin Bautista
Rowel Atienza
108
174
0
14 Jul 2022
TCT: Convexifying Federated Learning using Bootstrapped Neural Tangent Kernels
Yaodong Yu
Alexander Wei
Sai Praneeth Karimireddy
Yi-An Ma
Michael I. Jordan
FedML
80
31
0
13 Jul 2022
DSPNet: Towards Slimmable Pretrained Networks based on Discriminative Self-supervised Learning
Shaoru Wang
Zeming Li
Jin Gao
Liang Li
Weiming Hu
66
0
0
13 Jul 2022
Towards understanding how momentum improves generalization in deep learning
Samy Jelassi
Yuanzhi Li
ODL
MLT
AI4CE
90
38
0
13 Jul 2022
Long-term Leap Attention, Short-term Periodic Shift for Video Classification
Huatian Zhang
Lechao Cheng
Y. Hao
Chong-Wah Ngo
ViT
77
10
0
12 Jul 2022
Instance Shadow Detection with A Single-Stage Detector
Tianyu Wang
Xiaowei Hu
Pheng-Ann Heng
Chi-Wing Fu
85
28
0
11 Jul 2022
Efficient Multi-Task RGB-D Scene Analysis for Indoor Environments
Daniel Seichter
Söhnke Benedikt Fischedick
Mona Köhler
H. Groß
89
40
0
10 Jul 2022
A Mask Attention Interaction and Scale Enhancement Network for SAR Ship Instance Segmentation
Tianwen Zhang
Xiaoling Zhang
58
54
0
08 Jul 2022
An Embedding-Dynamic Approach to Self-supervised Learning
Suhong Moon
Domas Buracas
Seunghyun Park
Jinkyu Kim
John F. Canny
OCL
126
4
0
07 Jul 2022
PoF: Post-Training of Feature Extractor for Improving Generalization
Ikuro Sato
Ryota Yamada
Masayuki Tanaka
Nakamasa Inoue
Rei Kawakami
39
4
0
05 Jul 2022
End-to-end Learning for Image-based Detection of Molecular Alterations in Digital Pathology
Marvin Teichmann
A. Aichert
H. Bohnenberger
P. Ströbel
T. Heimann
MedIm
18
3
0
30 Jun 2022
On-Device Training Under 256KB Memory
Ji Lin
Ligeng Zhu
Wei-Ming Chen
Wei-Chen Wang
Chuang Gan
Song Han
MQ
144
213
0
30 Jun 2022
Scalable K-FAC Training for Deep Neural Networks with Distributed Preconditioning
Lin Zhang
Shaoshuai Shi
Wei Wang
Yue Liu
70
10
0
30 Jun 2022
Exploring Temporally Dynamic Data Augmentation for Video Recognition
Taeoh Kim
Jinhyung Kim
Minho Shim
Sangdoo Yun
Myunggu Kang
Dongyoon Wee
Sangyoun Lee
AI4TS
118
10
0
30 Jun 2022
QTI Submission to DCASE 2021: residual normalization for device-imbalanced acoustic scene classification with efficient design
Byeonggeun Kim
Seunghan Yang
Jangho Kim
Simyung Chang
81
58
0
28 Jun 2022
Studying Generalization Through Data Averaging
C. Gomez-Uribe
FedML
138
0
0
28 Jun 2022
AutoInit: Automatic Initialization via Jacobian Tuning
Tianyu He
Darshil Doshi
Andrey Gromov
68
4
0
27 Jun 2022
Zero Stability Well Predicts Performance of Convolutional Neural Networks
Liangming Chen
Long Jin
Mingsheng Shang
MLT
79
8
0
27 Jun 2022
Multi-Scale Spatial Temporal Graph Convolutional Network for Skeleton-Based Action Recognition
Zhan Chen
Sicheng Li
Bing Yang
Qinghan Li
Hong Liu
79
268
0
27 Jun 2022
p-Meta: Towards On-device Deep Model Adaptation
Zhongnan Qu
Zimu Zhou
Yongxin Tong
Lothar Thiele
75
13
0
25 Jun 2022
Domain Generalization with Relaxed Instance Frequency-wise Normalization for Multi-device Acoustic Scene Classification
Byeonggeun Kim
Seunghan Yang
Jangho Kim
Hyunsin Park
Juntae Lee
Simyung Chang
97
29
0
24 Jun 2022
MaskViT: Masked Visual Pre-Training for Video Prediction
Agrim Gupta
Stephen Tian
Yunzhi Zhang
Jiajun Wu
Roberto Martín-Martín
Li Fei-Fei
190
121
0
23 Jun 2022
TiCo: Transformation Invariance and Covariance Contrast for Self-Supervised Visual Representation Learning
Jiachen Zhu
R. M. Moraes
Serkan Karakulak
Vlad Sobol
A. Canziani
Yann LeCun
SSL
61
22
0
21 Jun 2022
On the Maximum Hessian Eigenvalue and Generalization
Simran Kaur
Jérémy E. Cohen
Zachary Chase Lipton
115
43
0
21 Jun 2022
Shifted Compression Framework: Generalizations and Improvements
Egor Shulgin
Peter Richtárik
63
6
0
21 Jun 2022
An Efficient Industrial Federated Learning Framework for AIoT: A Face Recognition Application
Youlong Ding
Xueyang Wu
Zhitao Li
Zeheng Wu
S. Tan
Qian Xu
Weike Pan
Qiang Yang
FedML
70
4
0
21 Jun 2022
Human-in-the-loop Speaker Adaptation for DNN-based Multi-speaker TTS
K. Udagawa
Yuki Saito
Hiroshi Saruwatari
28
6
0
21 Jun 2022
SemMAE: Semantic-Guided Masking for Learning Masked Autoencoders
Gang Li
Heliang Zheng
Daqing Liu
Chaoyue Wang
Fuchun Sun
Changwen Zheng
119
130
0
21 Jun 2022
Out-of-distribution Detection by Cross-class Vicinity Distribution of In-distribution Data
Zhilin Zhao
LongBing Cao
Kun-Yu Lin
OOD
48
2
0
19 Jun 2022
Open-Sampling: Exploring Out-of-Distribution data for Re-balancing Long-tailed datasets
Hongxin Wei
Lue Tao
Renchunzi Xie
Lei Feng
Bo An
OODD
64
39
0
17 Jun 2022
The State of Sparse Training in Deep Reinforcement Learning
L. Graesser
Utku Evci
Erich Elsen
Pablo Samuel Castro
OffRL
75
40
0
17 Jun 2022
iBoot: Image-bootstrapped Self-Supervised Video Representation Learning
F. Saleh
Fuwen Tan
Adrian Bulat
Georgios Tzimiropoulos
Brais Martínez
SSL
99
1
0
16 Jun 2022
Adapting Self-Supervised Vision Transformers by Probing Attention-Conditioned Masking Consistency
Viraj Prabhu
Sriram Yenamandra
Aaditya K. Singh
Judy Hoffman
69
15
0
16 Jun 2022
Patch-level Representation Learning for Self-supervised Vision Transformers
Sukmin Yun
Hankook Lee
Jaehyung Kim
Jinwoo Shin
ViT
120
68
0
16 Jun 2022
Identifying Electrocardiogram Abnormalities Using a Handcrafted-Rule-Enhanced Neural Network
Yu Bian
Jintai Chen
Xiaojun Chen
Xiaoxian Yang
Da Chen
Jian Wu
48
10
0
16 Jun 2022
Write and Paint: Generative Vision-Language Models are Unified Modal Learners
Shizhe Diao
Wangchunshu Zhou
Xinsong Zhang
Jiawei Wang
MLLM
AI4CE
97
17
0
15 Jun 2022
A Simple Data Mixing Prior for Improving Self-Supervised Learning
Sucheng Ren
Huiyu Wang
Zhengqi Gao
Shengfeng He
Alan Yuille
Yuyin Zhou
Cihang Xie
51
35
0
15 Jun 2022
Asynchronous SGD Beats Minibatch SGD Under Arbitrary Delays
Konstantin Mishchenko
Francis R. Bach
Mathieu Even
Blake E. Woodworth
83
61
0
15 Jun 2022
Self-Supervised Representation Learning With MUlti-Segmental Informational Coding (MUSIC)
Chuang Niu
Ge Wang
SSL
62
6
0
13 Jun 2022
Distributed Adversarial Training to Robustify Deep Neural Networks at Scale
Gaoyuan Zhang
Songtao Lu
Yihua Zhang
Xiangyi Chen
Pin-Yu Chen
Quanfu Fan
Lee Martie
L. Horesh
Min-Fong Hong
Sijia Liu
OOD
73
12
0
13 Jun 2022
Towards Understanding Sharpness-Aware Minimization
Maksym Andriushchenko
Nicolas Flammarion
AAML
124
142
0
13 Jun 2022
2nd Place Solution for ICCV 2021 VIPriors Image Classification Challenge: An Attract-and-Repulse Learning Approach
Yilu Guo
Shicai Yang
Weijie Chen
Liang Ma
Di Xie
Shiliang Pu
65
1
0
13 Jun 2022
Modeling the Machine Learning Multiverse
Samuel J. Bell
Onno P. Kampman
Jesse Dodge
Neil D. Lawrence
80
18
0
13 Jun 2022
Anchor Sampling for Federated Learning with Partial Client Participation
Feijie Wu
Song Guo
Zhihao Qu
Shiqi He
Ziming Liu
Jing Gao
FedML
83
14
0
13 Jun 2022
MLLess: Achieving Cost Efficiency in Serverless Machine Learning Training
Pablo Gimeno Sarroca
Marc Sánchez Artigas
55
16
0
12 Jun 2022
Narrowing the Gap: Improved Detector Training with Noisy Location Annotations
Shaoru Wang
Jin Gao
Bing Li
Weiming Hu
ObjD
NoLa
73
9
0
12 Jun 2022
Learning Imbalanced Datasets with Maximum Margin Loss
Haeyong Kang
Thang Vu
Chang D. Yoo
118
18
0
11 Jun 2022
Previous
1
2
3
...
13
14
15
...
40
41
42
Next