ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1406.2572
  4. Cited By
Identifying and attacking the saddle point problem in high-dimensional
  non-convex optimization

Identifying and attacking the saddle point problem in high-dimensional non-convex optimization

10 June 2014
Yann N. Dauphin
Razvan Pascanu
Çağlar Gülçehre
Kyunghyun Cho
Surya Ganguli
Yoshua Bengio
    ODL
ArXivPDFHTML

Papers citing "Identifying and attacking the saddle point problem in high-dimensional non-convex optimization"

33 / 233 papers shown
Title
Convexified Convolutional Neural Networks
Convexified Convolutional Neural Networks
Yuchen Zhang
Percy Liang
Martin J. Wainwright
26
64
0
04 Sep 2016
Mollifying Networks
Mollifying Networks
Çağlar Gülçehre
Marcin Moczulski
Francesco Visin
Yoshua Bengio
23
46
0
17 Aug 2016
TerpreT: A Probabilistic Programming Language for Program Induction
TerpreT: A Probabilistic Programming Language for Program Induction
Alexander L. Gaunt
Marc Brockschmidt
Rishabh Singh
Nate Kushman
Pushmeet Kohli
Jonathan Taylor
Daniel Tarlow
35
123
0
15 Aug 2016
Convolutional Neural Networks Analyzed via Convolutional Sparse Coding
Convolutional Neural Networks Analyzed via Convolutional Sparse Coding
Vardan Papyan
Yaniv Romano
Michael Elad
59
284
0
27 Jul 2016
On the Expressive Power of Deep Neural Networks
On the Expressive Power of Deep Neural Networks
M. Raghu
Ben Poole
Jon M. Kleinberg
Surya Ganguli
Jascha Narain Sohl-Dickstein
29
777
0
16 Jun 2016
Optimization Methods for Large-Scale Machine Learning
Optimization Methods for Large-Scale Machine Learning
Léon Bottou
Frank E. Curtis
J. Nocedal
96
3,176
0
15 Jun 2016
Discovery of Latent Factors in High-dimensional Data Using Tensor
  Methods
Discovery of Latent Factors in High-dimensional Data Using Tensor Methods
Furong Huang
30
6
0
10 Jun 2016
CaMKII activation supports reward-based neural network optimization
  through Hamiltonian sampling
CaMKII activation supports reward-based neural network optimization through Hamiltonian sampling
Zhaofei Yu
David Kappel
Robert Legenstein
Sen Song
Feng Chen
Wolfgang Maass
23
1
0
01 Jun 2016
No bad local minima: Data independent training error guarantees for
  multilayer neural networks
No bad local minima: Data independent training error guarantees for multilayer neural networks
Daniel Soudry
Y. Carmon
19
235
0
26 May 2016
Deep Learning without Poor Local Minima
Deep Learning without Poor Local Minima
Kenji Kawaguchi
ODL
24
917
0
23 May 2016
Training Neural Networks Without Gradients: A Scalable ADMM Approach
Training Neural Networks Without Gradients: A Scalable ADMM Approach
Gavin Taylor
R. Burmeister
Zheng Xu
Bharat Singh
Ankit B. Patel
Tom Goldstein
ODL
18
272
0
06 May 2016
Deep Learning in Bioinformatics
Deep Learning in Bioinformatics
Seonwoo Min
Byunghan Lee
Sungroh Yoon
AI4CE
3DV
36
1,351
0
21 Mar 2016
DeepSpark: A Spark-Based Distributed Deep Learning Framework for
  Commodity Clusters
DeepSpark: A Spark-Based Distributed Deep Learning Framework for Commodity Clusters
Hanjoo Kim
Jaehong Park
Jaehee Jang
Sungroh Yoon
BDL
32
37
0
26 Feb 2016
Stuck in a What? Adventures in Weight Space
Stuck in a What? Adventures in Weight Space
Zachary Chase Lipton
18
18
0
23 Feb 2016
Communication-Efficient Learning of Deep Networks from Decentralized
  Data
Communication-Efficient Learning of Deep Networks from Decentralized Data
H. B. McMahan
Eider Moore
Daniel Ramage
S. Hampson
Blaise Agüera y Arcas
FedML
26
17,032
0
17 Feb 2016
Gradient Descent Converges to Minimizers
Gradient Descent Converges to Minimizers
J. Lee
Max Simchowitz
Michael I. Jordan
Benjamin Recht
32
212
0
16 Feb 2016
Preconditioned Stochastic Gradient Langevin Dynamics for Deep Neural
  Networks
Preconditioned Stochastic Gradient Langevin Dynamics for Deep Neural Networks
Chunyuan Li
Changyou Chen
David Carlson
Lawrence Carin
ODL
BDL
27
320
0
23 Dec 2015
On the energy landscape of deep networks
On the energy landscape of deep networks
Pratik Chaudhari
Stefano Soatto
ODL
40
27
0
20 Nov 2015
Online Batch Selection for Faster Training of Neural Networks
Online Batch Selection for Faster Training of Neural Networks
I. Loshchilov
Frank Hutter
ODL
37
296
0
19 Nov 2015
On the interplay of network structure and gradient convergence in deep
  learning
On the interplay of network structure and gradient convergence in deep learning
V. Ithapu
Sathya Ravi
Vikas Singh
23
3
0
17 Nov 2015
On the Quality of the Initial Basin in Overspecified Neural Networks
On the Quality of the Initial Basin in Overspecified Neural Networks
Itay Safran
Ohad Shamir
19
127
0
13 Nov 2015
When Are Nonconvex Problems Not Scary?
When Are Nonconvex Problems Not Scary?
Ju Sun
Qing Qu
John N. Wright
24
166
0
21 Oct 2015
$\ell_1$-regularized Neural Networks are Improperly Learnable in
  Polynomial Time
ℓ1\ell_1ℓ1​-regularized Neural Networks are Improperly Learnable in Polynomial Time
Yuchen Zhang
J. Lee
Michael I. Jordan
30
101
0
13 Oct 2015
A Primer on Neural Network Models for Natural Language Processing
A Primer on Neural Network Models for Natural Language Processing
Yoav Goldberg
AI4CE
50
1,128
0
02 Oct 2015
What is Holding Back Convnets for Detection?
What is Holding Back Convnets for Detection?
Bojan Pepik
Rodrigo Benenson
Tobias Ritschel
Bernt Schiele
ObjD
24
64
0
12 Aug 2015
Training Very Deep Networks
Training Very Deep Networks
R. Srivastava
Klaus Greff
Jürgen Schmidhuber
21
1,671
0
22 Jul 2015
Adaptive Normalized Risk-Averting Training For Deep Neural Networks
Adaptive Normalized Risk-Averting Training For Deep Neural Networks
Zhiguang Wang
Tim Oates
J. Lo
33
6
0
08 Jun 2015
Deep Neural Networks with Random Gaussian Weights: A Universal
  Classification Strategy?
Deep Neural Networks with Random Gaussian Weights: A Universal Classification Strategy?
Raja Giryes
Guillermo Sapiro
A. Bronstein
43
187
0
30 Apr 2015
Escaping From Saddle Points --- Online Stochastic Gradient for Tensor
  Decomposition
Escaping From Saddle Points --- Online Stochastic Gradient for Tensor Decomposition
Rong Ge
Furong Huang
Chi Jin
Yang Yuan
58
1,056
0
06 Mar 2015
Explorations on high dimensional landscapes
Explorations on high dimensional landscapes
Levent Sagun
V. U. Güney
Gerard Ben Arous
Yann LeCun
24
65
0
20 Dec 2014
On the Stability of Deep Networks
On the Stability of Deep Networks
Raja Giryes
Guillermo Sapiro
A. Bronstein
48
14
0
18 Dec 2014
The Loss Surfaces of Multilayer Networks
The Loss Surfaces of Multilayer Networks
A. Choromańska
Mikael Henaff
Michaël Mathieu
Gerard Ben Arous
Yann LeCun
ODL
183
1,185
0
30 Nov 2014
Fast, Robust and Non-convex Subspace Recovery
Fast, Robust and Non-convex Subspace Recovery
Gilad Lerman
Tyler Maunu
38
77
0
24 Jun 2014
Previous
12345