ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1706.02677
  4. Cited By
Accurate, Large Minibatch SGD: Training ImageNet in 1 Hour
v1v2 (latest)

Accurate, Large Minibatch SGD: Training ImageNet in 1 Hour

8 June 2017
Priya Goyal
Piotr Dollár
Ross B. Girshick
P. Noordhuis
Lukasz Wesolowski
Aapo Kyrola
Andrew Tulloch
Yangqing Jia
Kaiming He
    3DH
ArXiv (abs)PDFHTML

Papers citing "Accurate, Large Minibatch SGD: Training ImageNet in 1 Hour"

50 / 2,054 papers shown
Title
Efficient Progressive High Dynamic Range Image Restoration via Attention
  and Alignment Network
Efficient Progressive High Dynamic Range Image Restoration via Attention and Alignment Network
G. Yu
Jin Zhang
Zhe Ma
Hongbin Wang
63
7
0
20 Apr 2022
The Devil is in the Frequency: Geminated Gestalt Autoencoder for
  Self-Supervised Visual Pre-Training
The Devil is in the Frequency: Geminated Gestalt Autoencoder for Self-Supervised Visual Pre-Training
Hao Liu
Xinghua Jiang
Xin Li
Antai Guo
Deqiang Jiang
Bo Ren
88
39
0
18 Apr 2022
Data-heterogeneity-aware Mixing for Decentralized Learning
Data-heterogeneity-aware Mixing for Decentralized Learning
Yatin Dandi
Anastasia Koloskova
Martin Jaggi
Sebastian U. Stich
88
19
0
13 Apr 2022
Distributionally Robust Models with Parametric Likelihood Ratios
Distributionally Robust Models with Parametric Likelihood Ratios
Paul Michel
Tatsunori Hashimoto
Graham Neubig
OOD
91
18
0
13 Apr 2022
CowClip: Reducing CTR Prediction Model Training Time from 12 hours to 10
  minutes on 1 GPU
CowClip: Reducing CTR Prediction Model Training Time from 12 hours to 10 minutes on 1 GPU
Zangwei Zheng
Peng Xu
Xuan Zou
Da Tang
Zhen Li
...
Xiangzhuo Ding
Fuzhao Xue
Ziheng Qing
Youlong Cheng
Yang You
VLM
86
7
0
13 Apr 2022
Few-shot Forgery Detection via Guided Adversarial Interpolation
Few-shot Forgery Detection via Guided Adversarial Interpolation
Haonan Qiu
Siyu Chen
Bei Gan
Kunze Wang
Huafeng Shi
Jing Shao
Ziwei Liu
AAML
118
6
0
12 Apr 2022
Towards Open-Set Object Detection and Discovery
Towards Open-Set Object Detection and Discovery
Jiyang Zheng
Weihao Li
Jie Hong
L. Petersson
Nick Barnes
ObjD
96
67
0
12 Apr 2022
An Empirical Study of End-to-End Temporal Action Detection
An Empirical Study of End-to-End Temporal Action Detection
Xiaolong Liu
S. Bai
Xiang Bai
92
59
0
06 Apr 2022
Complex-Valued Autoencoders for Object Discovery
Complex-Valued Autoencoders for Object Discovery
Sindy Löwe
Phillip Lippe
Maja R. Rudolph
Max Welling
BDLOCL
173
39
0
05 Apr 2022
Joint Hand Motion and Interaction Hotspots Prediction from Egocentric
  Videos
Joint Hand Motion and Interaction Hotspots Prediction from Egocentric Videos
Shao-Wei Liu
Subarna Tripathi
Somdeb Majumdar
Xiaolong Wang
EgoV
112
97
0
04 Apr 2022
MultiMAE: Multi-modal Multi-task Masked Autoencoders
MultiMAE: Multi-modal Multi-task Masked Autoencoders
Roman Bachmann
David Mizrahi
Andrei Atanov
Amir Zamir
146
281
0
04 Apr 2022
Question-Driven Graph Fusion Network For Visual Question Answering
Question-Driven Graph Fusion Network For Visual Question Answering
Yuxi Qian
Yuncong Hu
Ruonan Wang
Fangxiang Feng
Xiaojie Wang
GNN
138
10
0
03 Apr 2022
Co-VQA : Answering by Interactive Sub Question Sequence
Co-VQA : Answering by Interactive Sub Question Sequence
Ruonan Wang
Yuxi Qian
Fangxiang Feng
Xiaojie Wang
Huixing Jiang
LRM
75
17
0
02 Apr 2022
On the Importance of Asymmetry for Siamese Representation Learning
On the Importance of Asymmetry for Siamese Representation Learning
Tianlin Li
Haoqi Fan
Yuandong Tian
Daisuke Kihara
Xinlei Chen
SSL
128
52
0
01 Apr 2022
NC-DRE: Leveraging Non-entity Clue Information for Document-level
  Relation Extraction
NC-DRE: Leveraging Non-entity Clue Information for Document-level Relation Extraction
Li Zhang
Yidong Cheng
63
2
0
01 Apr 2022
GALA: Toward Geometry-and-Lighting-Aware Object Search for Compositing
GALA: Toward Geometry-and-Lighting-Aware Object Search for Compositing
Sijie Zhu
Zhe Lin
Scott D. Cohen
Jason Kuen
Zhifei Zhang
Chen Chen
44
6
0
31 Mar 2022
Exploring Plain Vision Transformer Backbones for Object Detection
Exploring Plain Vision Transformer Backbones for Object Detection
Yanghao Li
Hanzi Mao
Ross B. Girshick
Kaiming He
ViT
108
820
0
30 Mar 2022
Dual Temperature Helps Contrastive Learning Without Many Negative
  Samples: Towards Understanding and Simplifying MoCo
Dual Temperature Helps Contrastive Learning Without Many Negative Samples: Towards Understanding and Simplifying MoCo
Chaoning Zhang
Kang Zhang
T. Pham
Axi Niu
Zhinan Qiao
Chang D. Yoo
In So Kweon
117
57
0
30 Mar 2022
PP-YOLOE: An evolved version of YOLO
PP-YOLOE: An evolved version of YOLO
Shangliang Xu
Xinxin Wang
Wenyu Lv
Qinyao Chang
Cheng Cui
...
Guanzhong Wang
Qingqing Dang
Shengyun Wei
Yuning Du
Baohua Lai
ObjD
120
277
0
30 Mar 2022
Neural Inertial Localization
Neural Inertial Localization
Sachini Herath
David Caruso
Chen Liu
Yufan Chen
Yasutaka Furukawa
57
30
0
29 Mar 2022
In-N-Out Generative Learning for Dense Unsupervised Video Segmentation
In-N-Out Generative Learning for Dense Unsupervised Video Segmentation
Xiaomiao Pan
Peike Li
Zongxin Yang
Huiling Zhou
Chang Zhou
Hongxia Yang
Jingren Zhou
Yi Yang
VOS
86
12
0
29 Mar 2022
TGL: A General Framework for Temporal GNN Training on Billion-Scale
  Graphs
TGL: A General Framework for Temporal GNN Training on Billion-Scale Graphs
Hongkuan Zhou
Da Zheng
Israt Nisa
Vasileios Ioannidis
Xiang Song
George Karypis
AI4CE
95
89
0
28 Mar 2022
A Densely Connected Criss-Cross Attention Network for Document-level
  Relation Extraction
A Densely Connected Criss-Cross Attention Network for Document-level Relation Extraction
Li Zhang
Yidong Cheng
3DV
47
3
0
26 Mar 2022
Locally Asynchronous Stochastic Gradient Descent for Decentralised Deep
  Learning
Locally Asynchronous Stochastic Gradient Descent for Decentralised Deep Learning
Tomer Avidor
Nadav Tal-Israel
28
2
0
24 Mar 2022
Learning Dense Correspondence from Synthetic Environments
Learning Dense Correspondence from Synthetic Environments
Mithun Lal
Anthony Paproki
N. Habili
L. Petersson
Olivier Salvado
Clinton Fookes
3DH3DV
81
0
0
24 Mar 2022
Real-time Object Detection for Streaming Perception
Real-time Object Detection for Streaming Perception
Jinrong Yang
Songtao Liu
Zeming Li
Xiaoping Li
Jian Sun
105
51
0
23 Mar 2022
Visual Prompt Tuning
Visual Prompt Tuning
Menglin Jia
Luming Tang
Bor-Chun Chen
Claire Cardie
Serge Belongie
Bharath Hariharan
Ser-Nam Lim
VLMVPVLM
237
1,658
0
23 Mar 2022
FedDC: Federated Learning with Non-IID Data via Local Drift Decoupling
  and Correction
FedDC: Federated Learning with Non-IID Data via Local Drift Decoupling and Correction
Liang Gao
Huazhu Fu
Li Li
Yingwen Chen
Minghua Xu
Chengzhong Xu
FedML
117
257
0
22 Mar 2022
Local Stochastic Factored Gradient Descent for Distributed Quantum State
  Tomography
Local Stochastic Factored Gradient Descent for Distributed Quantum State Tomography
Junhyung Lyle Kim
Taha Toghani
César A. Uribe
Anastasios Kyrillidis
56
3
0
22 Mar 2022
Dense Siamese Network for Dense Unsupervised Learning
Dense Siamese Network for Dense Unsupervised Learning
Wenwei Zhang
Jiangmiao Pang
Kai-xiang Chen
Chen Change Loy
70
15
0
21 Mar 2022
Continual Spatio-Temporal Graph Convolutional Networks
Continual Spatio-Temporal Graph Convolutional Networks
Lukas Hedegaard
Negar Heidari
Alexandros Iosifidis
3DHGNN
54
25
0
21 Mar 2022
A Local Convergence Theory for the Stochastic Gradient Descent Method in
  Non-Convex Optimization With Non-isolated Local Minima
A Local Convergence Theory for the Stochastic Gradient Descent Method in Non-Convex Optimization With Non-isolated Local Minima
Tae-Eon Ko
Xiantao Li
72
2
0
21 Mar 2022
Small Batch Sizes Improve Training of Low-Resource Neural MT
Small Batch Sizes Improve Training of Low-Resource Neural MT
Àlex R. Atrio
Andrei Popescu-Belis
64
6
0
20 Mar 2022
Read Top News First: A Document Reordering Approach for Multi-Document
  News Summarization
Read Top News First: A Document Reordering Approach for Multi-Document News Summarization
Chao Zhao
Tenghao Huang
Somnath Basu Roy Chowdhury
Muthu Kumar Chandrasekaran
Kathleen McKeown
Snigdha Chaturvedi
MoMe
44
17
0
19 Mar 2022
Discovering Objects that Can Move
Discovering Objects that Can Move
Zhipeng Bao
P. Tokmakov
Allan Jabri
Yu-Xiong Wang
Adrien Gaidon
M. Hebert
OCL
110
44
0
18 Mar 2022
On the Generalization Mystery in Deep Learning
On the Generalization Mystery in Deep Learning
S. Chatterjee
Piotr Zielinski
OOD
77
35
0
18 Mar 2022
Attribute Surrogates Learning and Spectral Tokens Pooling in
  Transformers for Few-shot Learning
Attribute Surrogates Learning and Spectral Tokens Pooling in Transformers for Few-shot Learning
Yang He
Weihan Liang
Dongyang Zhao
Hong-Yu Zhou
Weifeng Ge
Yizhou Yu
Wenqiang Zhang
ViT
100
46
0
17 Mar 2022
On Redundancy and Diversity in Cell-based Neural Architecture Search
On Redundancy and Diversity in Cell-based Neural Architecture Search
Xingchen Wan
Binxin Ru
Pedro M. Esperancca
Zhenguo Li
105
21
0
16 Mar 2022
Hardware Approximate Techniques for Deep Neural Network Accelerators: A
  Survey
Hardware Approximate Techniques for Deep Neural Network Accelerators: A Survey
Giorgos Armeniakos
Georgios Zervakis
Dimitrios Soudris
J. Henkel
284
98
0
16 Mar 2022
Towards understanding deep learning with the natural clustering prior
Towards understanding deep learning with the natural clustering prior
Simon Carbonnelle
54
0
0
15 Mar 2022
Interspace Pruning: Using Adaptive Filter Representations to Improve
  Training of Sparse CNNs
Interspace Pruning: Using Adaptive Filter Representations to Improve Training of Sparse CNNs
Paul Wimmer
Jens Mehnert
Alexandru Paul Condurache
CVBM
64
20
0
15 Mar 2022
Scaling the Wild: Decentralizing Hogwild!-style Shared-memory SGD
Scaling the Wild: Decentralizing Hogwild!-style Shared-memory SGD
Bapi Chatterjee
Vyacheslav Kungurtsev
Dan Alistarh
FedML
54
2
0
13 Mar 2022
G$^3$SR: Global Graph Guided Session-based Recommendation
G3^33SR: Global Graph Guided Session-based Recommendation
Zhiwei Deng
Changdong Wang
Ling Huang
Jianhuang Lai
Philip S. Yu
72
14
0
12 Mar 2022
GRAND+: Scalable Graph Random Neural Networks
GRAND+: Scalable Graph Random Neural Networks
Wenzheng Feng
Yuxiao Dong
Tinglin Huang
Ziqi Yin
Xu Cheng
Evgeny Kharlamov
Jie Tang
GNN
68
43
0
12 Mar 2022
DHEN: A Deep and Hierarchical Ensemble Network for Large-Scale
  Click-Through Rate Prediction
DHEN: A Deep and Hierarchical Ensemble Network for Large-Scale Click-Through Rate Prediction
Buyun Zhang
Liangchen Luo
Xi Liu
Jay Li
Zeliang Chen
...
Yasmine Badr
Jongsoo Park
Jiyan Yang
Dheevatsa Mudigere
Ellie Wen
3DV
52
12
0
11 Mar 2022
Masked Visual Pre-training for Motor Control
Masked Visual Pre-training for Motor Control
Tete Xiao
Ilija Radosavovic
Trevor Darrell
Jitendra Malik
SSL
119
250
0
11 Mar 2022
TFCNet: Temporal Fully Connected Networks for Static Unbiased Temporal
  Reasoning
TFCNet: Temporal Fully Connected Networks for Static Unbiased Temporal Reasoning
Shiwen Zhang
AI4TS
91
12
0
11 Mar 2022
Part-level Action Parsing via a Pose-guided Coarse-to-Fine Framework
Part-level Action Parsing via a Pose-guided Coarse-to-Fine Framework
Xiaodong Chen
Xinchen Liu
Wu Liu
Kun Liu
Dong Wu
Yongdong Zhang
Tao Mei
48
4
0
09 Mar 2022
Data-Efficient and Interpretable Tabular Anomaly Detection
Data-Efficient and Interpretable Tabular Anomaly Detection
C. Chang
Jinsung Yoon
Sercan O. Arik
Madeleine Udell
Tomas Pfister
56
20
0
03 Mar 2022
A Multimodal German Dataset for Automatic Lip Reading Systems and
  Transfer Learning
A Multimodal German Dataset for Automatic Lip Reading Systems and Transfer Learning
Gerald Schwiebert
C. Weber
Leyuan Qu
Henrique Siqueira
S. Wermter
68
12
0
27 Feb 2022
Previous
123...151617...404142
Next