ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1710.04773
  4. Cited By
Residual Connections Encourage Iterative Inference

Residual Connections Encourage Iterative Inference

13 October 2017
Stanislaw Jastrzebski
Devansh Arpit
Nicolas Ballas
Vikas Verma
Tong Che
Yoshua Bengio
ArXivPDFHTML

Papers citing "Residual Connections Encourage Iterative Inference"

19 / 19 papers shown
Title
Decoding Vision Transformers: the Diffusion Steering Lens
Decoding Vision Transformers: the Diffusion Steering Lens
Ryota Takatsuki
Sonia Joseph
Ippei Fujisawa
Ryota Kanai
DiffM
80
0
0
18 Apr 2025
Shared Global and Local Geometry of Language Model Embeddings
Shared Global and Local Geometry of Language Model Embeddings
Andrew Lee
Melanie Weber
F. Viégas
Martin Wattenberg
FedML
94
6
0
27 Mar 2025
The Geometry of Tokens in Internal Representations of Large Language Models
The Geometry of Tokens in Internal Representations of Large Language Models
Karthik Viswanathan
Yuri Gardinazzi
Giada Panerai
Alberto Cazzaniga
Matteo Biagetti
AIFin
129
7
0
17 Jan 2025
Residual Stream Analysis with Multi-Layer SAEs
Residual Stream Analysis with Multi-Layer SAEs
Tim Lawson
Lucy Farnik
Conor Houghton
Laurence Aitchison
72
5
0
06 Sep 2024
A Practical Review of Mechanistic Interpretability for Transformer-Based Language Models
A Practical Review of Mechanistic Interpretability for Transformer-Based Language Models
Daking Rai
Yilun Zhou
Shi Feng
Abulhair Saparov
Ziyu Yao
147
32
0
02 Jul 2024
Emergence of a High-Dimensional Abstraction Phase in Language Transformers
Emergence of a High-Dimensional Abstraction Phase in Language Transformers
Emily Cheng
Diego Doimo
Corentin Kervadec
Iuri Macocco
Jade Yu
Alessandro Laio
Marco Baroni
131
14
0
24 May 2024
Learning Deep ResNet Blocks Sequentially using Boosting Theory
Learning Deep ResNet Blocks Sequentially using Boosting Theory
Furong Huang
Jordan T. Ash
John Langford
Robert Schapire
58
111
0
15 Jun 2017
Highway and Residual Networks learn Unrolled Iterative Estimation
Highway and Residual Networks learn Unrolled Iterative Estimation
Klaus Greff
R. Srivastava
Jürgen Schmidhuber
AI4TS
98
215
0
22 Dec 2016
The Loss Surface of Residual Networks: Ensembles and the Role of Batch
  Normalization
The Loss Surface of Residual Networks: Ensembles and the Role of Batch Normalization
Etai Littwin
Lior Wolf
UQCV
141
15
0
08 Nov 2016
Residual Networks Behave Like Ensembles of Relatively Shallow Networks
Residual Networks Behave Like Ensembles of Relatively Shallow Networks
Andreas Veit
Michael J. Wilber
Serge J. Belongie
UQCV
66
107
0
20 May 2016
Bridging the Gaps Between Residual Learning, Recurrent Neural Networks
  and Visual Cortex
Bridging the Gaps Between Residual Learning, Recurrent Neural Networks and Visual Cortex
Q. Liao
T. Poggio
242
257
0
13 Apr 2016
Recurrent Batch Normalization
Recurrent Batch Normalization
Tim Cooijmans
Nicolas Ballas
César Laurent
Çağlar Gülçehre
Aaron Courville
ODL
48
410
0
30 Mar 2016
Identity Mappings in Deep Residual Networks
Identity Mappings in Deep Residual Networks
Kaiming He
Xinming Zhang
Shaoqing Ren
Jian Sun
354
10,184
0
16 Mar 2016
Normalization Propagation: A Parametric Technique for Removing Internal
  Covariate Shift in Deep Networks
Normalization Propagation: A Parametric Technique for Removing Internal Covariate Shift in Deep Networks
Devansh Arpit
Yingbo Zhou
Bhargava U. Kota
V. Govindaraju
62
127
0
04 Mar 2016
Recurrent Orthogonal Networks and Long-Memory Tasks
Recurrent Orthogonal Networks and Long-Memory Tasks
Mikael Henaff
Arthur Szlam
Yann LeCun
63
133
0
22 Feb 2016
Deep Residual Learning for Image Recognition
Deep Residual Learning for Image Recognition
Kaiming He
Xinming Zhang
Shaoqing Ren
Jian Sun
MedIm
2.2K
194,020
0
10 Dec 2015
Batch Normalization: Accelerating Deep Network Training by Reducing
  Internal Covariate Shift
Batch Normalization: Accelerating Deep Network Training by Reducing Internal Covariate Shift
Sergey Ioffe
Christian Szegedy
OOD
463
43,305
0
11 Feb 2015
Delving Deep into Rectifiers: Surpassing Human-Level Performance on
  ImageNet Classification
Delving Deep into Rectifiers: Surpassing Human-Level Performance on ImageNet Classification
Kaiming He
Xinming Zhang
Shaoqing Ren
Jian Sun
VLM
323
18,625
0
06 Feb 2015
Very Deep Convolutional Networks for Large-Scale Image Recognition
Very Deep Convolutional Networks for Large-Scale Image Recognition
Karen Simonyan
Andrew Zisserman
FAtt
MDE
1.6K
100,386
0
04 Sep 2014
1