Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2207.10430
Cited By
The Neural Race Reduction: Dynamics of Abstraction in Gated Networks
21 July 2022
Andrew M. Saxe
Shagun Sodhani
Sam Lewallen
AI4CE
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"The Neural Race Reduction: Dynamics of Abstraction in Gated Networks"
24 / 74 papers shown
Title
Neural Tangent Kernel: Convergence and Generalization in Neural Networks
Arthur Jacot
Franck Gabriel
Clément Hongler
273
3,223
0
20 Jun 2018
Implicit Bias of Gradient Descent on Linear Convolutional Networks
Suriya Gunasekar
Jason D. Lee
Daniel Soudry
Nathan Srebro
MDE
124
414
0
01 Jun 2018
A Mean Field View of the Landscape of Two-Layers Neural Networks
Song Mei
Andrea Montanari
Phan-Minh Nguyen
MLT
105
862
0
18 Apr 2018
End-to-End Multi-Task Learning with Attention
Shikun Liu
Edward Johns
Andrew J. Davison
CVBM
61
1,056
0
28 Mar 2018
On the Optimization of Deep Networks: Implicit Acceleration by Overparameterization
Sanjeev Arora
Nadav Cohen
Elad Hazan
108
487
0
19 Feb 2018
Deep linear neural networks with arbitrary loss: All local minima are global
T. Laurent
J. V. Brecht
ODL
76
137
0
05 Dec 2017
An Overview of Multi-Task Learning in Deep Neural Networks
Sebastian Ruder
CVBM
159
2,831
0
15 Jun 2017
Attention Is All You Need
Ashish Vaswani
Noam M. Shazeer
Niki Parmar
Jakob Uszkoreit
Llion Jones
Aidan Gomez
Lukasz Kaiser
Illia Polosukhin
3DV
795
132,454
0
12 Jun 2017
A simple neural network module for relational reasoning
Adam Santoro
David Raposo
David Barrett
Mateusz Malinowski
Razvan Pascanu
Peter W. Battaglia
Timothy Lillicrap
GNN
NAI
189
1,615
0
05 Jun 2017
Hard Mixtures of Experts for Large Scale Weakly Supervised Vision
Sam Gross
MarcÁurelio Ranzato
Arthur Szlam
MoE
63
102
0
20 Apr 2017
Outrageously Large Neural Networks: The Sparsely-Gated Mixture-of-Experts Layer
Noam M. Shazeer
Azalia Mirhoseini
Krzysztof Maziarz
Andy Davis
Quoc V. Le
Geoffrey E. Hinton
J. Dean
MoE
253
2,692
0
23 Jan 2017
CLEVR: A Diagnostic Dataset for Compositional Language and Elementary Visual Reasoning
Justin Johnson
B. Hariharan
Laurens van der Maaten
Li Fei-Fei
C. L. Zitnick
Ross B. Girshick
CoGe
316
2,391
0
20 Dec 2016
Modular Multitask Reinforcement Learning with Policy Sketches
Jacob Andreas
Dan Klein
Sergey Levine
OffRL
171
463
0
06 Nov 2016
Tensor Switching Networks
Chuan-Yung Tsai
Andrew M. Saxe
David D. Cox
40
10
0
31 Oct 2016
Opponent Modeling in Deep Reinforcement Learning
He He
Jordan L. Boyd-Graber
Kevin Kwok
Hal Daumé III
BDL
79
327
0
18 Sep 2016
UberNet: Training a `Universal' Convolutional Neural Network for Low-, Mid-, and High-Level Vision using Diverse Datasets and Limited Memory
Iasonas Kokkinos
SSeg
SSL
148
677
0
07 Sep 2016
On the Expressive Power of Deep Neural Networks
M. Raghu
Ben Poole
Jon M. Kleinberg
Surya Ganguli
Jascha Narain Sohl-Dickstein
76
790
0
16 Jun 2016
Wide Residual Networks
Sergey Zagoruyko
N. Komodakis
353
8,002
0
23 May 2016
Network of Experts for Large-Scale Image Categorization
Karim Ahmed
M. H. Baig
Lorenzo Torresani
87
126
0
20 Apr 2016
Deep Residual Learning for Image Recognition
Kaiming He
Xinming Zhang
Shaoqing Ren
Jian Sun
MedIm
2.2K
194,510
0
10 Dec 2015
Deep Speech 2: End-to-End Speech Recognition in English and Mandarin
Dario Amodei
Rishita Anubhai
Eric Battenberg
Carl Case
Jared Casper
...
Chong-Jun Wang
Bo Xiao
Dani Yogatama
J. Zhan
Zhenyao Zhu
141
2,975
0
08 Dec 2015
A Probabilistic Theory of Deep Learning
Ankit B. Patel
M. T. Nguyen
Richard G. Baraniuk
BDL
OOD
UQCV
85
89
0
02 Apr 2015
Exact solutions to the nonlinear dynamics of learning in deep linear neural networks
Andrew M. Saxe
James L. McClelland
Surya Ganguli
ODL
188
1,852
0
20 Dec 2013
Natural Language Processing (almost) from Scratch
R. Collobert
Jason Weston
Léon Bottou
Michael Karlen
Koray Kavukcuoglu
Pavel P. Kuksa
203
7,729
0
02 Mar 2011
Previous
1
2