Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1706.02677
Cited By
v1
v2 (latest)
Accurate, Large Minibatch SGD: Training ImageNet in 1 Hour
8 June 2017
Priya Goyal
Piotr Dollár
Ross B. Girshick
P. Noordhuis
Lukasz Wesolowski
Aapo Kyrola
Andrew Tulloch
Yangqing Jia
Kaiming He
3DH
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"Accurate, Large Minibatch SGD: Training ImageNet in 1 Hour"
50 / 2,054 papers shown
Title
Efficient Progressive High Dynamic Range Image Restoration via Attention and Alignment Network
G. Yu
Jin Zhang
Zhe Ma
Hongbin Wang
63
7
0
20 Apr 2022
The Devil is in the Frequency: Geminated Gestalt Autoencoder for Self-Supervised Visual Pre-Training
Hao Liu
Xinghua Jiang
Xin Li
Antai Guo
Deqiang Jiang
Bo Ren
88
39
0
18 Apr 2022
Data-heterogeneity-aware Mixing for Decentralized Learning
Yatin Dandi
Anastasia Koloskova
Martin Jaggi
Sebastian U. Stich
88
19
0
13 Apr 2022
Distributionally Robust Models with Parametric Likelihood Ratios
Paul Michel
Tatsunori Hashimoto
Graham Neubig
OOD
91
18
0
13 Apr 2022
CowClip: Reducing CTR Prediction Model Training Time from 12 hours to 10 minutes on 1 GPU
Zangwei Zheng
Peng Xu
Xuan Zou
Da Tang
Zhen Li
...
Xiangzhuo Ding
Fuzhao Xue
Ziheng Qing
Youlong Cheng
Yang You
VLM
86
7
0
13 Apr 2022
Few-shot Forgery Detection via Guided Adversarial Interpolation
Haonan Qiu
Siyu Chen
Bei Gan
Kunze Wang
Huafeng Shi
Jing Shao
Ziwei Liu
AAML
118
6
0
12 Apr 2022
Towards Open-Set Object Detection and Discovery
Jiyang Zheng
Weihao Li
Jie Hong
L. Petersson
Nick Barnes
ObjD
96
67
0
12 Apr 2022
An Empirical Study of End-to-End Temporal Action Detection
Xiaolong Liu
S. Bai
Xiang Bai
92
59
0
06 Apr 2022
Complex-Valued Autoencoders for Object Discovery
Sindy Löwe
Phillip Lippe
Maja R. Rudolph
Max Welling
BDL
OCL
173
39
0
05 Apr 2022
Joint Hand Motion and Interaction Hotspots Prediction from Egocentric Videos
Shao-Wei Liu
Subarna Tripathi
Somdeb Majumdar
Xiaolong Wang
EgoV
112
97
0
04 Apr 2022
MultiMAE: Multi-modal Multi-task Masked Autoencoders
Roman Bachmann
David Mizrahi
Andrei Atanov
Amir Zamir
146
281
0
04 Apr 2022
Question-Driven Graph Fusion Network For Visual Question Answering
Yuxi Qian
Yuncong Hu
Ruonan Wang
Fangxiang Feng
Xiaojie Wang
GNN
138
10
0
03 Apr 2022
Co-VQA : Answering by Interactive Sub Question Sequence
Ruonan Wang
Yuxi Qian
Fangxiang Feng
Xiaojie Wang
Huixing Jiang
LRM
75
17
0
02 Apr 2022
On the Importance of Asymmetry for Siamese Representation Learning
Tianlin Li
Haoqi Fan
Yuandong Tian
Daisuke Kihara
Xinlei Chen
SSL
128
52
0
01 Apr 2022
NC-DRE: Leveraging Non-entity Clue Information for Document-level Relation Extraction
Li Zhang
Yidong Cheng
63
2
0
01 Apr 2022
GALA: Toward Geometry-and-Lighting-Aware Object Search for Compositing
Sijie Zhu
Zhe Lin
Scott D. Cohen
Jason Kuen
Zhifei Zhang
Chen Chen
44
6
0
31 Mar 2022
Exploring Plain Vision Transformer Backbones for Object Detection
Yanghao Li
Hanzi Mao
Ross B. Girshick
Kaiming He
ViT
108
820
0
30 Mar 2022
Dual Temperature Helps Contrastive Learning Without Many Negative Samples: Towards Understanding and Simplifying MoCo
Chaoning Zhang
Kang Zhang
T. Pham
Axi Niu
Zhinan Qiao
Chang D. Yoo
In So Kweon
117
57
0
30 Mar 2022
PP-YOLOE: An evolved version of YOLO
Shangliang Xu
Xinxin Wang
Wenyu Lv
Qinyao Chang
Cheng Cui
...
Guanzhong Wang
Qingqing Dang
Shengyun Wei
Yuning Du
Baohua Lai
ObjD
120
277
0
30 Mar 2022
Neural Inertial Localization
Sachini Herath
David Caruso
Chen Liu
Yufan Chen
Yasutaka Furukawa
57
30
0
29 Mar 2022
In-N-Out Generative Learning for Dense Unsupervised Video Segmentation
Xiaomiao Pan
Peike Li
Zongxin Yang
Huiling Zhou
Chang Zhou
Hongxia Yang
Jingren Zhou
Yi Yang
VOS
86
12
0
29 Mar 2022
TGL: A General Framework for Temporal GNN Training on Billion-Scale Graphs
Hongkuan Zhou
Da Zheng
Israt Nisa
Vasileios Ioannidis
Xiang Song
George Karypis
AI4CE
95
89
0
28 Mar 2022
A Densely Connected Criss-Cross Attention Network for Document-level Relation Extraction
Li Zhang
Yidong Cheng
3DV
47
3
0
26 Mar 2022
Locally Asynchronous Stochastic Gradient Descent for Decentralised Deep Learning
Tomer Avidor
Nadav Tal-Israel
28
2
0
24 Mar 2022
Learning Dense Correspondence from Synthetic Environments
Mithun Lal
Anthony Paproki
N. Habili
L. Petersson
Olivier Salvado
Clinton Fookes
3DH
3DV
81
0
0
24 Mar 2022
Real-time Object Detection for Streaming Perception
Jinrong Yang
Songtao Liu
Zeming Li
Xiaoping Li
Jian Sun
105
51
0
23 Mar 2022
Visual Prompt Tuning
Menglin Jia
Luming Tang
Bor-Chun Chen
Claire Cardie
Serge Belongie
Bharath Hariharan
Ser-Nam Lim
VLM
VPVLM
237
1,658
0
23 Mar 2022
FedDC: Federated Learning with Non-IID Data via Local Drift Decoupling and Correction
Liang Gao
Huazhu Fu
Li Li
Yingwen Chen
Minghua Xu
Chengzhong Xu
FedML
117
257
0
22 Mar 2022
Local Stochastic Factored Gradient Descent for Distributed Quantum State Tomography
Junhyung Lyle Kim
Taha Toghani
César A. Uribe
Anastasios Kyrillidis
56
3
0
22 Mar 2022
Dense Siamese Network for Dense Unsupervised Learning
Wenwei Zhang
Jiangmiao Pang
Kai-xiang Chen
Chen Change Loy
70
15
0
21 Mar 2022
Continual Spatio-Temporal Graph Convolutional Networks
Lukas Hedegaard
Negar Heidari
Alexandros Iosifidis
3DH
GNN
54
25
0
21 Mar 2022
A Local Convergence Theory for the Stochastic Gradient Descent Method in Non-Convex Optimization With Non-isolated Local Minima
Tae-Eon Ko
Xiantao Li
72
2
0
21 Mar 2022
Small Batch Sizes Improve Training of Low-Resource Neural MT
Àlex R. Atrio
Andrei Popescu-Belis
64
6
0
20 Mar 2022
Read Top News First: A Document Reordering Approach for Multi-Document News Summarization
Chao Zhao
Tenghao Huang
Somnath Basu Roy Chowdhury
Muthu Kumar Chandrasekaran
Kathleen McKeown
Snigdha Chaturvedi
MoMe
44
17
0
19 Mar 2022
Discovering Objects that Can Move
Zhipeng Bao
P. Tokmakov
Allan Jabri
Yu-Xiong Wang
Adrien Gaidon
M. Hebert
OCL
110
44
0
18 Mar 2022
On the Generalization Mystery in Deep Learning
S. Chatterjee
Piotr Zielinski
OOD
77
35
0
18 Mar 2022
Attribute Surrogates Learning and Spectral Tokens Pooling in Transformers for Few-shot Learning
Yang He
Weihan Liang
Dongyang Zhao
Hong-Yu Zhou
Weifeng Ge
Yizhou Yu
Wenqiang Zhang
ViT
100
46
0
17 Mar 2022
On Redundancy and Diversity in Cell-based Neural Architecture Search
Xingchen Wan
Binxin Ru
Pedro M. Esperancca
Zhenguo Li
105
21
0
16 Mar 2022
Hardware Approximate Techniques for Deep Neural Network Accelerators: A Survey
Giorgos Armeniakos
Georgios Zervakis
Dimitrios Soudris
J. Henkel
284
98
0
16 Mar 2022
Towards understanding deep learning with the natural clustering prior
Simon Carbonnelle
54
0
0
15 Mar 2022
Interspace Pruning: Using Adaptive Filter Representations to Improve Training of Sparse CNNs
Paul Wimmer
Jens Mehnert
Alexandru Paul Condurache
CVBM
64
20
0
15 Mar 2022
Scaling the Wild: Decentralizing Hogwild!-style Shared-memory SGD
Bapi Chatterjee
Vyacheslav Kungurtsev
Dan Alistarh
FedML
54
2
0
13 Mar 2022
G
3
^3
3
SR: Global Graph Guided Session-based Recommendation
Zhiwei Deng
Changdong Wang
Ling Huang
Jianhuang Lai
Philip S. Yu
72
14
0
12 Mar 2022
GRAND+: Scalable Graph Random Neural Networks
Wenzheng Feng
Yuxiao Dong
Tinglin Huang
Ziqi Yin
Xu Cheng
Evgeny Kharlamov
Jie Tang
GNN
68
43
0
12 Mar 2022
DHEN: A Deep and Hierarchical Ensemble Network for Large-Scale Click-Through Rate Prediction
Buyun Zhang
Liangchen Luo
Xi Liu
Jay Li
Zeliang Chen
...
Yasmine Badr
Jongsoo Park
Jiyan Yang
Dheevatsa Mudigere
Ellie Wen
3DV
52
12
0
11 Mar 2022
Masked Visual Pre-training for Motor Control
Tete Xiao
Ilija Radosavovic
Trevor Darrell
Jitendra Malik
SSL
119
250
0
11 Mar 2022
TFCNet: Temporal Fully Connected Networks for Static Unbiased Temporal Reasoning
Shiwen Zhang
AI4TS
91
12
0
11 Mar 2022
Part-level Action Parsing via a Pose-guided Coarse-to-Fine Framework
Xiaodong Chen
Xinchen Liu
Wu Liu
Kun Liu
Dong Wu
Yongdong Zhang
Tao Mei
48
4
0
09 Mar 2022
Data-Efficient and Interpretable Tabular Anomaly Detection
C. Chang
Jinsung Yoon
Sercan O. Arik
Madeleine Udell
Tomas Pfister
56
20
0
03 Mar 2022
A Multimodal German Dataset for Automatic Lip Reading Systems and Transfer Learning
Gerald Schwiebert
C. Weber
Leyuan Qu
Henrique Siqueira
S. Wermter
68
12
0
27 Feb 2022
Previous
1
2
3
...
15
16
17
...
40
41
42
Next