Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1703.03633
Cited By
v1
v2
v3 (latest)
Learning Gradient Descent: Better Generalization and Longer Horizons
10 March 2017
Kaifeng Lyu
Shunhua Jiang
Jian Li
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"Learning Gradient Descent: Better Generalization and Longer Horizons"
11 / 11 papers shown
Title
From Learning to Optimize to Learning Optimization Algorithms
Camille Castera
Peter Ochs
168
1
0
28 May 2024
Learning to reinforcement learn
Jane X. Wang
Z. Kurth-Nelson
Dhruva Tirumala
Hubert Soyer
Joel Z Leibo
Rémi Munos
Charles Blundell
D. Kumaran
M. Botvinick
OffRL
97
983
0
17 Nov 2016
Learning to Learn without Gradient Descent by Gradient Descent
Yutian Chen
Matthew W. Hoffman
Sergio Gomez Colmenarejo
Misha Denil
Timothy Lillicrap
Matt Botvinick
Nando de Freitas
64
42
0
11 Nov 2016
Learning to learn by gradient descent by gradient descent
Marcin Andrychowicz
Misha Denil
Sergio Gomez Colmenarejo
Matthew W. Hoffman
David Pfau
Tom Schaul
Brendan Shillingford
Nando de Freitas
124
2,008
0
14 Jun 2016
Learning to Optimize
Ke Li
Jitendra Malik
63
257
0
06 Jun 2016
TensorFlow: Large-Scale Machine Learning on Heterogeneous Distributed Systems
Martín Abadi
Ashish Agarwal
P. Barham
E. Brevdo
Zhiwen Chen
...
Pete Warden
Martin Wattenberg
Martin Wicke
Yuan Yu
Xiaoqiang Zheng
289
11,150
0
14 Mar 2016
Using Deep Q-Learning to Control Optimization Hyperparameters
Samantha Hansen
46
40
0
12 Feb 2016
Fast and Accurate Deep Network Learning by Exponential Linear Units (ELUs)
Djork-Arné Clevert
Thomas Unterthiner
Sepp Hochreiter
307
5,536
0
23 Nov 2015
Adam: A Method for Stochastic Optimization
Diederik P. Kingma
Jimmy Ba
ODL
2.1K
150,364
0
22 Dec 2014
Very Deep Convolutional Networks for Large-Scale Image Recognition
Karen Simonyan
Andrew Zisserman
FAtt
MDE
1.7K
100,529
0
04 Sep 2014
ADADELTA: An Adaptive Learning Rate Method
Matthew D. Zeiler
ODL
165
6,632
0
22 Dec 2012
1