ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2106.04537
  4. Cited By
Can You Learn an Algorithm? Generalizing from Easy to Hard Problems with
  Recurrent Networks

Can You Learn an Algorithm? Generalizing from Easy to Hard Problems with Recurrent Networks

8 June 2021
Avi Schwarzschild
Eitan Borgnia
Arjun Gupta
Furong Huang
U. Vishkin
Micah Goldblum
Tom Goldstein
ArXivPDFHTML

Papers citing "Can You Learn an Algorithm? Generalizing from Easy to Hard Problems with Recurrent Networks"

50 / 60 papers shown
Title
Fixed-Point RNNs: From Diagonal to Dense in a Few Iterations
Sajad Movahedi
Felix Sarnthein
Nicola Muca Cirone
Antonio Orvieto
48
2
0
13 Mar 2025
Distributional Scaling Laws for Emergent Capabilities
Distributional Scaling Laws for Emergent Capabilities
Rosie Zhao
Tian Qin
David Alvarez-Melis
Sham Kakade
Naomi Saphra
LRM
39
0
0
24 Feb 2025
Reasoning with Latent Thoughts: On the Power of Looped Transformers
Reasoning with Latent Thoughts: On the Power of Looped Transformers
Nikunj Saunshi
Nishanth Dikkala
Zhiyuan Li
Sanjiv Kumar
Sashank J. Reddi
OffRL
LRM
AI4CE
61
10
0
24 Feb 2025
Playing Hex and Counter Wargames using Reinforcement Learning and Recurrent Neural Networks
Playing Hex and Counter Wargames using Reinforcement Learning and Recurrent Neural Networks
Guilherme Palma
P. A. Santos
João Dias
46
0
0
20 Feb 2025
Hyperspherical Energy Transformer with Recurrent Depth
Yunzhe Hu
Difan Zou
Dong Xu
48
0
0
17 Feb 2025
MathGAP: Out-of-Distribution Evaluation on Problems with Arbitrarily Complex Proofs
MathGAP: Out-of-Distribution Evaluation on Problems with Arbitrarily Complex Proofs
Andreas Opedal
Haruki Shirakami
Bernhard Schölkopf
Abulhair Saparov
Mrinmaya Sachan
LRM
57
1
0
17 Feb 2025
Beyond Interpolation: Extrapolative Reasoning with Reinforcement Learning and Graph Neural Networks
Beyond Interpolation: Extrapolative Reasoning with Reinforcement Learning and Graph Neural Networks
Niccolò Grillo
Andrea Toccaceli
Joël Mathys
Benjamin Estermann
Stefania Fresca
Roger Wattenhofer
AI4CE
LRM
104
0
0
06 Feb 2025
Self-Improving Transformers Overcome Easy-to-Hard and Length Generalization Challenges
Self-Improving Transformers Overcome Easy-to-Hard and Length Generalization Challenges
Nayoung Lee
Ziyang Cai
Avi Schwarzschild
Kangwook Lee
Dimitris Papailiopoulos
ReLM
VLM
LRM
AI4CE
83
4
0
03 Feb 2025
SetLexSem Challenge: Using Set Operations to Evaluate the Lexical and
  Semantic Robustness of Language Models
SetLexSem Challenge: Using Set Operations to Evaluate the Lexical and Semantic Robustness of Language Models
Bardiya Akhbari
Manish Gawali
Nicholas A. Dronen
AAML
37
0
0
11 Nov 2024
Adaptive Length Image Tokenization via Recurrent Allocation
Adaptive Length Image Tokenization via Recurrent Allocation
Shivam Duggal
Phillip Isola
Antonio Torralba
William T. Freeman
VLM
37
5
0
04 Nov 2024
Rethinking Deep Thinking: Stable Learning of Algorithms using Lipschitz
  Constraints
Rethinking Deep Thinking: Stable Learning of Algorithms using Lipschitz Constraints
Jay Bear
Adam Prugel-Bennett
Jonathon S. Hare
29
1
0
30 Oct 2024
Relaxed Recursive Transformers: Effective Parameter Sharing with Layer-wise LoRA
Relaxed Recursive Transformers: Effective Parameter Sharing with Layer-wise LoRA
Sangmin Bae
Adam Fisch
Hrayr Harutyunyan
Ziwei Ji
Seungyeon Kim
Tal Schuster
KELM
81
5
0
28 Oct 2024
Provable Weak-to-Strong Generalization via Benign Overfitting
Provable Weak-to-Strong Generalization via Benign Overfitting
David X. Wu
A. Sahai
70
6
0
06 Oct 2024
On Logical Extrapolation for Mazes with Recurrent and Implicit Networks
On Logical Extrapolation for Mazes with Recurrent and Implicit Networks
Brandon Knutson
Amandin Chyba Rabeendran
Michael I. Ivanitskiy
Jordan Pettyjohn
Cecilia G. Diniz Behn
Samy Wu Fung
Daniel McKenzie
LRM
44
2
0
03 Oct 2024
Easy2Hard-Bench: Standardized Difficulty Labels for Profiling LLM
  Performance and Generalization
Easy2Hard-Bench: Standardized Difficulty Labels for Profiling LLM Performance and Generalization
Mucong Ding
Chenghao Deng
Jocelyn Choo
Zichu Wu
Aakriti Agrawal
...
Dinesh Manocha
Tom Goldstein
John Langford
Anima Anandkumar
Furong Huang
59
5
0
27 Sep 2024
The Extrapolation Power of Implicit Models
The Extrapolation Power of Implicit Models
Juliette Decugis
Alicia Y. Tsai
Max Emerling
Ashwin Ganesh
L. Ghaoui
34
0
0
19 Jul 2024
Universal Length Generalization with Turing Programs
Universal Length Generalization with Turing Programs
Kaiying Hou
David Brandfonbrener
Sham Kakade
Samy Jelassi
Eran Malach
44
7
0
03 Jul 2024
Learning Iterative Reasoning through Energy Diffusion
Learning Iterative Reasoning through Energy Diffusion
Yilun Du
Jiayuan Mao
Joshua B. Tenenbaum
LRM
PINN
48
6
0
17 Jun 2024
Transformers Can Do Arithmetic with the Right Embeddings
Transformers Can Do Arithmetic with the Right Embeddings
Sean McLeish
Arpit Bansal
Alex Stein
Neel Jain
John Kirchenbauer
...
B. Kailkhura
A. Bhatele
Jonas Geiping
Avi Schwarzschild
Tom Goldstein
53
28
0
27 May 2024
Benchmarking ChatGPT on Algorithmic Reasoning
Benchmarking ChatGPT on Algorithmic Reasoning
Sean McLeish
Avi Schwarzschild
Tom Goldstein
AI4MH
LRM
37
4
0
04 Apr 2024
Easy-to-Hard Generalization: Scalable Alignment Beyond Human Supervision
Easy-to-Hard Generalization: Scalable Alignment Beyond Human Supervision
Zhiqing Sun
Longhui Yu
Yikang Shen
Weiyang Liu
Yiming Yang
Sean Welleck
Chuang Gan
36
54
0
14 Mar 2024
When in Doubt, Think Slow: Iterative Reasoning with Latent Imagination
When in Doubt, Think Slow: Iterative Reasoning with Latent Imagination
Martin A Benfeghoul
Umais Zahid
Qinghai Guo
Z. Fountas
OffRL
LRM
35
2
0
23 Feb 2024
Transformers Can Achieve Length Generalization But Not Robustly
Transformers Can Achieve Length Generalization But Not Robustly
Yongchao Zhou
Uri Alon
Xinyun Chen
Xuezhi Wang
Rishabh Agarwal
Denny Zhou
49
36
0
14 Feb 2024
The Unreasonable Effectiveness of Easy Training Data for Hard Tasks
The Unreasonable Effectiveness of Easy Training Data for Hard Tasks
Peter Hase
Mohit Bansal
Peter Clark
Sarah Wiegreffe
15
25
0
12 Jan 2024
Weak-to-Strong Generalization: Eliciting Strong Capabilities With Weak
  Supervision
Weak-to-Strong Generalization: Eliciting Strong Capabilities With Weak Supervision
Collin Burns
Pavel Izmailov
Jan Hendrik Kirchner
Bowen Baker
Leo Gao
...
Adrien Ecoffet
Manas Joglekar
Jan Leike
Ilya Sutskever
Jeff Wu
ELM
50
258
0
14 Dec 2023
Do Smaller Language Models Answer Contextualised Questions Through
  Memorisation Or Generalisation?
Do Smaller Language Models Answer Contextualised Questions Through Memorisation Or Generalisation?
Tim Hartill
Joshua Bensemann
Michael Witbrock
Patricia Riddle
KELM
22
0
0
21 Nov 2023
Adaptive recurrent vision performs zero-shot computation scaling to
  unseen difficulty levels
Adaptive recurrent vision performs zero-shot computation scaling to unseen difficulty levels
Vijay Veerabadran
Srinivas Ravishankar
Yuan Tang
Ritik Raina
Virginia R. de Sa
11
4
0
12 Nov 2023
Large Language Models Are Zero-Shot Time Series Forecasters
Large Language Models Are Zero-Shot Time Series Forecasters
Nate Gruver
Marc Finzi
Shikai Qiu
Andrew Gordon Wilson
AI4TS
33
319
0
11 Oct 2023
Flood and Echo Net: Algorithmically Aligned GNNs that Generalize
Flood and Echo Net: Algorithmically Aligned GNNs that Generalize
Joël Mathys
Florian Grötschla
K. Nadimpalli
Roger Wattenhofer
FedML
44
0
0
10 Oct 2023
SALSA-CLRS: A Sparse and Scalable Benchmark for Algorithmic Reasoning
SALSA-CLRS: A Sparse and Scalable Benchmark for Algorithmic Reasoning
Julian Minder
Florian Grötschla
Joël Mathys
Roger Wattenhofer
29
8
0
21 Sep 2023
A Configurable Library for Generating and Manipulating Maze Datasets
A Configurable Library for Generating and Manipulating Maze Datasets
Michael I. Ivanitskiy
Rusheb Shah
Alex F Spies
Tilman Rauker
Dan Valentine
...
Lucia Quirke
Chris Mathwin
Guillaume Corlouer
Cecilia G. Diniz Behn
Samy Wu Fung Colorado School of Mines
43
11
0
19 Sep 2023
Grounded Image Text Matching with Mismatched Relation Reasoning
Grounded Image Text Matching with Mismatched Relation Reasoning
Yu Wu
Yan-Tao Wei
Haozhe Jasper Wang
Yongfei Liu
Sibei Yang
Xuming He
31
6
0
02 Aug 2023
Skills-in-Context Prompting: Unlocking Compositionality in Large
  Language Models
Skills-in-Context Prompting: Unlocking Compositionality in Large Language Models
Jiaao Chen
Xiaoman Pan
Dian Yu
Kaiqiang Song
Xiaoyang Wang
Dong Yu
Jianshu Chen
ReLM
LRM
21
24
0
01 Aug 2023
Faith and Fate: Limits of Transformers on Compositionality
Faith and Fate: Limits of Transformers on Compositionality
Nouha Dziri
Ximing Lu
Melanie Sclar
Xiang Lorraine Li
Liwei Jian
...
Sean Welleck
Xiang Ren
Allyson Ettinger
Zaïd Harchaoui
Yejin Choi
ReLM
LRM
30
329
0
29 May 2023
Banana: Banach Fixed-Point Network for Pointcloud Segmentation with
  Inter-Part Equivariance
Banana: Banach Fixed-Point Network for Pointcloud Segmentation with Inter-Part Equivariance
Congyue Deng
Jiahui Lei
Bokui Shen
Kostas Daniilidis
Leonidas J. Guibas
3DPC
27
16
0
25 May 2023
Iterative Forward Tuning Boosts In-Context Learning in Language Models
Iterative Forward Tuning Boosts In-Context Learning in Language Models
Jiaxi Yang
Binyuan Hui
Min Yang
Bailin Wang
Bowen Li
Binhua Li
Fei Huang
Yongbin Li
38
16
0
22 May 2023
Let's Sample Step by Step: Adaptive-Consistency for Efficient Reasoning
  and Coding with LLMs
Let's Sample Step by Step: Adaptive-Consistency for Efficient Reasoning and Coding with LLMs
Pranjal Aggarwal
Aman Madaan
Yiming Yang
Mausam
LRM
28
36
0
19 May 2023
The No Free Lunch Theorem, Kolmogorov Complexity, and the Role of
  Inductive Biases in Machine Learning
The No Free Lunch Theorem, Kolmogorov Complexity, and the Role of Inductive Biases in Machine Learning
Micah Goldblum
Marc Finzi
K. Rowan
A. Wilson
UQCV
FedML
24
37
0
11 Apr 2023
Diffusion Probabilistic Models for Structured Node Classification
Diffusion Probabilistic Models for Structured Node Classification
Hyosoon Jang
Seonghyun Park
Sangwoo Mo
Sungsoo Ahn
DiffM
20
3
0
21 Feb 2023
Adaptive Computation with Elastic Input Sequence
Adaptive Computation with Elastic Input Sequence
Fuzhao Xue
Valerii Likhosherstov
Anurag Arnab
N. Houlsby
Mostafa Dehghani
Yang You
31
18
0
30 Jan 2023
Learning to solve arithmetic problems with a virtual abacus
Learning to solve arithmetic problems with a virtual abacus
Flavio Petruzzellis
Ling-Hao Chen
Alberto Testolin
34
1
0
17 Jan 2023
Learning Graph Algorithms With Recurrent Graph Neural Networks
Learning Graph Algorithms With Recurrent Graph Neural Networks
Florian Grötschla
Joël Mathys
Roger Wattenhofer
GNN
19
6
0
09 Dec 2022
Learning to design without prior data: Discovering generalizable design
  strategies using deep learning and tree search
Learning to design without prior data: Discovering generalizable design strategies using deep learning and tree search
Ayush Raina
Jonathan Cagan
Christopher McComb
AI4CE
25
9
0
28 Nov 2022
A Recursively Recurrent Neural Network (R2N2) Architecture for Learning
  Iterative Algorithms
A Recursively Recurrent Neural Network (R2N2) Architecture for Learning Iterative Algorithms
Danimir T. Doncevic
Alexander Mitsos
Yu Guo
Qianxiao Li
Felix Dietrich
Manuel Dahmen
Ioannis G. Kevrekidis
15
7
0
22 Nov 2022
Simplicity Bias in Transformers and their Ability to Learn Sparse
  Boolean Functions
Simplicity Bias in Transformers and their Ability to Learn Sparse Boolean Functions
S. Bhattamishra
Arkil Patel
Varun Kanade
Phil Blunsom
16
44
0
22 Nov 2022
Path Independent Equilibrium Models Can Better Exploit Test-Time
  Computation
Path Independent Equilibrium Models Can Better Exploit Test-Time Computation
Cem Anil
Ashwini Pokle
Kaiqu Liang
Johannes Treutlein
Yuhuai Wu
Shaojie Bai
Zico Kolter
Roger C. Grosse
29
16
0
18 Nov 2022
Logical Tasks for Measuring Extrapolation and Rule Comprehension
Logical Tasks for Measuring Extrapolation and Rule Comprehension
Ippei Fujisawa
Ryota Kanai
ELM
LRM
28
4
0
14 Nov 2022
Transformers Learn Shortcuts to Automata
Transformers Learn Shortcuts to Automata
Bingbin Liu
Jordan T. Ash
Surbhi Goel
A. Krishnamurthy
Cyril Zhang
OffRL
LRM
46
156
0
19 Oct 2022
Measuring and Narrowing the Compositionality Gap in Language Models
Measuring and Narrowing the Compositionality Gap in Language Models
Ofir Press
Muru Zhang
Sewon Min
Ludwig Schmidt
Noah A. Smith
M. Lewis
ReLM
KELM
LRM
52
557
0
07 Oct 2022
Recurrent Convolutional Neural Networks Learn Succinct Learning
  Algorithms
Recurrent Convolutional Neural Networks Learn Succinct Learning Algorithms
Surbhi Goel
Sham Kakade
Adam Tauman Kalai
Cyril Zhang
32
1
0
01 Sep 2022
12
Next