ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1511.05297
  4. Cited By
On the interplay of network structure and gradient convergence in deep
  learning
v1v2v3v4v5v6v7v8 (latest)

On the interplay of network structure and gradient convergence in deep learning

17 November 2015
V. Ithapu
Sathya Ravi
Vikas Singh
ArXiv (abs)PDFHTML

Papers citing "On the interplay of network structure and gradient convergence in deep learning"

23 / 23 papers shown
Title
Network Morphism
Network Morphism
Tao Wei
Changhu Wang
Y. Rui
Chen Chen
98
177
0
05 Mar 2016
Why are deep nets reversible: A simple theory, with implications for
  training
Why are deep nets reversible: A simple theory, with implications for training
Sanjeev Arora
Yingyu Liang
Tengyu Ma
87
54
0
18 Nov 2015
A Survey: Time Travel in Deep Learning Space: An Introduction to Deep
  Learning Models and How Deep Learning Models Evolved from the Initial Ideas
A Survey: Time Travel in Deep Learning Space: An Introduction to Deep Learning Models and How Deep Learning Models Evolved from the Initial Ideas
Haohan Wang
Bhiksha Raj
AI4TS
71
34
0
16 Oct 2015
Train faster, generalize better: Stability of stochastic gradient
  descent
Train faster, generalize better: Stability of stochastic gradient descent
Moritz Hardt
Benjamin Recht
Y. Singer
120
1,243
0
03 Sep 2015
Constrained Convolutional Neural Networks for Weakly Supervised
  Segmentation
Constrained Convolutional Neural Networks for Weakly Supervised Segmentation
Deepak Pathak
Philipp Krahenbuhl
Trevor Darrell
SSeg
109
614
0
11 Jun 2015
Weight Uncertainty in Neural Networks
Weight Uncertainty in Neural Networks
Charles Blundell
Julien Cornebise
Koray Kavukcuoglu
Daan Wierstra
UQCVBDL
194
1,894
0
20 May 2015
A Probabilistic Theory of Deep Learning
A Probabilistic Theory of Deep Learning
Ankit B. Patel
M. T. Nguyen
Richard G. Baraniuk
BDLOODUQCV
89
89
0
02 Apr 2015
Batch Normalization: Accelerating Deep Network Training by Reducing
  Internal Covariate Shift
Batch Normalization: Accelerating Deep Network Training by Reducing Internal Covariate Shift
Sergey Ioffe
Christian Szegedy
OOD
471
43,357
0
11 Feb 2015
Breaking the Curse of Dimensionality with Convex Neural Networks
Breaking the Curse of Dimensionality with Convex Neural Networks
Francis R. Bach
200
706
0
30 Dec 2014
FitNets: Hints for Thin Deep Nets
FitNets: Hints for Thin Deep Nets
Adriana Romero
Nicolas Ballas
Samira Ebrahimi Kahou
Antoine Chassang
C. Gatta
Yoshua Bengio
FedML
332
3,906
0
19 Dec 2014
How transferable are features in deep neural networks?
How transferable are features in deep neural networks?
J. Yosinski
Jeff Clune
Yoshua Bengio
Hod Lipson
OOD
238
8,363
0
06 Nov 2014
On the Computational Efficiency of Training Neural Networks
On the Computational Efficiency of Training Neural Networks
Roi Livni
Shai Shalev-Shwartz
Ohad Shamir
158
480
0
05 Oct 2014
Going Deeper with Convolutions
Going Deeper with Convolutions
Christian Szegedy
Wei Liu
Yangqing Jia
P. Sermanet
Scott E. Reed
Dragomir Anguelov
D. Erhan
Vincent Vanhoucke
Andrew Rabinovich
502
43,717
0
17 Sep 2014
Sequence to Sequence Learning with Neural Networks
Sequence to Sequence Learning with Neural Networks
Ilya Sutskever
Oriol Vinyals
Quoc V. Le
AIMat
450
20,611
0
10 Sep 2014
Identifying and attacking the saddle point problem in high-dimensional
  non-convex optimization
Identifying and attacking the saddle point problem in high-dimensional non-convex optimization
Yann N. Dauphin
Razvan Pascanu
Çağlar Gülçehre
Kyunghyun Cho
Surya Ganguli
Yoshua Bengio
ODL
134
1,394
0
10 Jun 2014
Playing Atari with Deep Reinforcement Learning
Playing Atari with Deep Reinforcement Learning
Volodymyr Mnih
Koray Kavukcuoglu
David Silver
Alex Graves
Ioannis Antonoglou
Daan Wierstra
Martin Riedmiller
134
12,272
0
19 Dec 2013
Provable Bounds for Learning Some Deep Representations
Provable Bounds for Learning Some Deep Representations
Sanjeev Arora
Aditya Bhaskara
Rong Ge
Tengyu Ma
BDL
109
335
0
23 Oct 2013
Stochastic First- and Zeroth-order Methods for Nonconvex Stochastic
  Programming
Stochastic First- and Zeroth-order Methods for Nonconvex Stochastic Programming
Saeed Ghadimi
Guanghui Lan
ODL
126
1,562
0
22 Sep 2013
Dropout Training as Adaptive Regularization
Dropout Training as Adaptive Regularization
Stefan Wager
Sida I. Wang
Percy Liang
133
600
0
04 Jul 2013
Maxout Networks
Maxout Networks
Ian Goodfellow
David Warde-Farley
M. Berk Mirza
Aaron Courville
Yoshua Bengio
OOD
283
2,179
0
18 Feb 2013
Deep Learning for Detecting Robotic Grasps
Deep Learning for Detecting Robotic Grasps
Ian Lenz
Honglak Lee
Ashutosh Saxena
122
1,648
0
16 Jan 2013
Representation Learning: A Review and New Perspectives
Representation Learning: A Review and New Perspectives
Yoshua Bengio
Aaron Courville
Pascal Vincent
OODSSL
296
12,467
0
24 Jun 2012
Practical recommendations for gradient-based training of deep
  architectures
Practical recommendations for gradient-based training of deep architectures
Yoshua Bengio
3DHODL
197
2,203
0
24 Jun 2012
1