ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2207.10430
  4. Cited By
The Neural Race Reduction: Dynamics of Abstraction in Gated Networks

The Neural Race Reduction: Dynamics of Abstraction in Gated Networks

21 July 2022
Andrew M. Saxe
Shagun Sodhani
Sam Lewallen
    AI4CE
ArXiv (abs)PDFHTML

Papers citing "The Neural Race Reduction: Dynamics of Abstraction in Gated Networks"

24 / 74 papers shown
Title
Neural Tangent Kernel: Convergence and Generalization in Neural Networks
Neural Tangent Kernel: Convergence and Generalization in Neural Networks
Arthur Jacot
Franck Gabriel
Clément Hongler
273
3,223
0
20 Jun 2018
Implicit Bias of Gradient Descent on Linear Convolutional Networks
Implicit Bias of Gradient Descent on Linear Convolutional Networks
Suriya Gunasekar
Jason D. Lee
Daniel Soudry
Nathan Srebro
MDE
124
414
0
01 Jun 2018
A Mean Field View of the Landscape of Two-Layers Neural Networks
A Mean Field View of the Landscape of Two-Layers Neural Networks
Song Mei
Andrea Montanari
Phan-Minh Nguyen
MLT
105
862
0
18 Apr 2018
End-to-End Multi-Task Learning with Attention
End-to-End Multi-Task Learning with Attention
Shikun Liu
Edward Johns
Andrew J. Davison
CVBM
61
1,056
0
28 Mar 2018
On the Optimization of Deep Networks: Implicit Acceleration by
  Overparameterization
On the Optimization of Deep Networks: Implicit Acceleration by Overparameterization
Sanjeev Arora
Nadav Cohen
Elad Hazan
108
487
0
19 Feb 2018
Deep linear neural networks with arbitrary loss: All local minima are
  global
Deep linear neural networks with arbitrary loss: All local minima are global
T. Laurent
J. V. Brecht
ODL
76
137
0
05 Dec 2017
An Overview of Multi-Task Learning in Deep Neural Networks
An Overview of Multi-Task Learning in Deep Neural Networks
Sebastian Ruder
CVBM
159
2,831
0
15 Jun 2017
Attention Is All You Need
Attention Is All You Need
Ashish Vaswani
Noam M. Shazeer
Niki Parmar
Jakob Uszkoreit
Llion Jones
Aidan Gomez
Lukasz Kaiser
Illia Polosukhin
3DV
795
132,454
0
12 Jun 2017
A simple neural network module for relational reasoning
A simple neural network module for relational reasoning
Adam Santoro
David Raposo
David Barrett
Mateusz Malinowski
Razvan Pascanu
Peter W. Battaglia
Timothy Lillicrap
GNNNAI
189
1,615
0
05 Jun 2017
Hard Mixtures of Experts for Large Scale Weakly Supervised Vision
Hard Mixtures of Experts for Large Scale Weakly Supervised Vision
Sam Gross
MarcÁurelio Ranzato
Arthur Szlam
MoE
63
102
0
20 Apr 2017
Outrageously Large Neural Networks: The Sparsely-Gated
  Mixture-of-Experts Layer
Outrageously Large Neural Networks: The Sparsely-Gated Mixture-of-Experts Layer
Noam M. Shazeer
Azalia Mirhoseini
Krzysztof Maziarz
Andy Davis
Quoc V. Le
Geoffrey E. Hinton
J. Dean
MoE
253
2,692
0
23 Jan 2017
CLEVR: A Diagnostic Dataset for Compositional Language and Elementary
  Visual Reasoning
CLEVR: A Diagnostic Dataset for Compositional Language and Elementary Visual Reasoning
Justin Johnson
B. Hariharan
Laurens van der Maaten
Li Fei-Fei
C. L. Zitnick
Ross B. Girshick
CoGe
316
2,391
0
20 Dec 2016
Modular Multitask Reinforcement Learning with Policy Sketches
Modular Multitask Reinforcement Learning with Policy Sketches
Jacob Andreas
Dan Klein
Sergey Levine
OffRL
171
463
0
06 Nov 2016
Tensor Switching Networks
Tensor Switching Networks
Chuan-Yung Tsai
Andrew M. Saxe
David D. Cox
40
10
0
31 Oct 2016
Opponent Modeling in Deep Reinforcement Learning
Opponent Modeling in Deep Reinforcement Learning
He He
Jordan L. Boyd-Graber
Kevin Kwok
Hal Daumé III
BDL
79
327
0
18 Sep 2016
UberNet: Training a `Universal' Convolutional Neural Network for Low-,
  Mid-, and High-Level Vision using Diverse Datasets and Limited Memory
UberNet: Training a `Universal' Convolutional Neural Network for Low-, Mid-, and High-Level Vision using Diverse Datasets and Limited Memory
Iasonas Kokkinos
SSegSSL
148
677
0
07 Sep 2016
On the Expressive Power of Deep Neural Networks
On the Expressive Power of Deep Neural Networks
M. Raghu
Ben Poole
Jon M. Kleinberg
Surya Ganguli
Jascha Narain Sohl-Dickstein
76
790
0
16 Jun 2016
Wide Residual Networks
Wide Residual Networks
Sergey Zagoruyko
N. Komodakis
353
8,002
0
23 May 2016
Network of Experts for Large-Scale Image Categorization
Network of Experts for Large-Scale Image Categorization
Karim Ahmed
M. H. Baig
Lorenzo Torresani
87
126
0
20 Apr 2016
Deep Residual Learning for Image Recognition
Deep Residual Learning for Image Recognition
Kaiming He
Xinming Zhang
Shaoqing Ren
Jian Sun
MedIm
2.2K
194,510
0
10 Dec 2015
Deep Speech 2: End-to-End Speech Recognition in English and Mandarin
Deep Speech 2: End-to-End Speech Recognition in English and Mandarin
Dario Amodei
Rishita Anubhai
Eric Battenberg
Carl Case
Jared Casper
...
Chong-Jun Wang
Bo Xiao
Dani Yogatama
J. Zhan
Zhenyao Zhu
141
2,975
0
08 Dec 2015
A Probabilistic Theory of Deep Learning
A Probabilistic Theory of Deep Learning
Ankit B. Patel
M. T. Nguyen
Richard G. Baraniuk
BDLOODUQCV
85
89
0
02 Apr 2015
Exact solutions to the nonlinear dynamics of learning in deep linear
  neural networks
Exact solutions to the nonlinear dynamics of learning in deep linear neural networks
Andrew M. Saxe
James L. McClelland
Surya Ganguli
ODL
188
1,852
0
20 Dec 2013
Natural Language Processing (almost) from Scratch
Natural Language Processing (almost) from Scratch
R. Collobert
Jason Weston
Léon Bottou
Michael Karlen
Koray Kavukcuoglu
Pavel P. Kuksa
203
7,729
0
02 Mar 2011
Previous
12