ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2107.12544
  4. Cited By
Human-Level Reinforcement Learning through Theory-Based Modeling,
  Exploration, and Planning

Human-Level Reinforcement Learning through Theory-Based Modeling, Exploration, and Planning

27 July 2021
Pedro Tsividis
J. Loula
Jake Burga
Nathan Foss
Andres Campero
Thomas Pouncy
S. Gershman
J. Tenenbaum
    LM&Ro
ArXivPDFHTML

Papers citing "Human-Level Reinforcement Learning through Theory-Based Modeling, Exploration, and Planning"

24 / 24 papers shown
Title
Possible principles for aligned structure learning agents
Possible principles for aligned structure learning agents
Lancelot Da Costa
Tomáš Gavenčiak
David Hyland
Mandana Samiei
Cristian Dragos-Manta
Candice Pattisapu
Adeel Razi
Karl J. Friston
20
1
0
30 Sep 2024
Toward Universal and Interpretable World Models for Open-ended Learning
  Agents
Toward Universal and Interpretable World Models for Open-ended Learning Agents
Lancelot Da Costa
TPM
26
1
0
27 Sep 2024
Learning to Play Video Games with Intuitive Physics Priors
Learning to Play Video Games with Intuitive Physics Priors
Abhishek Jaiswal
Nisheeth Srivastava
OCL
23
1
0
20 Sep 2024
Generating Code World Models with Large Language Models Guided by Monte
  Carlo Tree Search
Generating Code World Models with Large Language Models Guided by Monte Carlo Tree Search
Nicola Dainese
Matteo Merler
Minttu Alakuijala
Pekka Marttinen
LLMAG
36
7
0
24 May 2024
WorldCoder, a Model-Based LLM Agent: Building World Models by Writing
  Code and Interacting with the Environment
WorldCoder, a Model-Based LLM Agent: Building World Models by Writing Code and Interacting with the Environment
Hao Tang
Darren Key
Kevin Ellis
LLMAG
20
27
0
19 Feb 2024
Personalizing Driver Safety Interfaces via Driver Cognitive Factors
  Inference
Personalizing Driver Safety Interfaces via Driver Cognitive Factors Inference
Emily S. Sumner
Jonathan A. DeCastro
Jean Costa
Deepak Gopinath
Everlyne Kimani
...
H. Yasuda
Katharine Sieck
Avinash Balachandran
Tiffany Chen
Guy Rosman
20
4
0
08 Feb 2024
Regularity as Intrinsic Reward for Free Play
Regularity as Intrinsic Reward for Free Play
Cansu Sancaktar
J. Piater
Georg Martius
42
2
0
03 Dec 2023
MoCa: Measuring Human-Language Model Alignment on Causal and Moral
  Judgment Tasks
MoCa: Measuring Human-Language Model Alignment on Causal and Moral Judgment Tasks
Allen Nie
Yuhui Zhang
Atharva Amdekar
Chris Piech
Tatsunori Hashimoto
Tobias Gerstenberg
27
34
0
30 Oct 2023
Improving Offline-to-Online Reinforcement Learning with Q-Ensembles
Improving Offline-to-Online Reinforcement Learning with Q-Ensembles
Kai-Wen Zhao
Yi-An Ma
Jianye Hao
Jinyi Liu
Yan Zheng
Zhaopeng Meng
OffRL
OnRL
18
12
0
12 Jun 2023
Human-like Few-Shot Learning via Bayesian Reasoning over Natural
  Language
Human-like Few-Shot Learning via Bayesian Reasoning over Natural Language
Kevin Ellis
BDL
LRM
21
16
0
05 Jun 2023
Meta-Learned Models of Cognition
Meta-Learned Models of Cognition
Marcel Binz
Ishita Dasgupta
A. Jagadish
M. Botvinick
Jane X. Wang
Eric Schulz
30
25
0
12 Apr 2023
Does Deep Learning Learn to Abstract? A Systematic Probing Framework
Does Deep Learning Learn to Abstract? A Systematic Probing Framework
Shengnan An
Zeqi Lin
B. Chen
Qiang Fu
Nanning Zheng
Jian-Guang Lou
31
4
0
23 Feb 2023
Policy Expansion for Bridging Offline-to-Online Reinforcement Learning
Policy Expansion for Bridging Offline-to-Online Reinforcement Learning
Haichao Zhang
Weiwen Xu
Haonan Yu
CLL
OffRL
OnRL
34
62
0
02 Feb 2023
Generalised agent for solving higher board states of tic tac toe using
  Reinforcement Learning
Generalised agent for solving higher board states of tic tac toe using Reinforcement Learning
Bhavuk Kalra
16
0
0
23 Dec 2022
Language Models as Agent Models
Language Models as Agent Models
Jacob Andreas
LLMAG
26
132
0
03 Dec 2022
Learning Latent Traits for Simulated Cooperative Driving Tasks
Learning Latent Traits for Simulated Cooperative Driving Tasks
Jonathan A. DeCastro
Deepak Gopinath
Guy Rosman
Emily S. Sumner
Shabnam Hakimi
Simon Stent
18
0
0
20 Jul 2022
Curious Exploration via Structured World Models Yields Zero-Shot Object
  Manipulation
Curious Exploration via Structured World Models Yields Zero-Shot Object Manipulation
Cansu Sancaktar
Sebastian Blaes
Georg Martius
LM&Ro
18
24
0
22 Jun 2022
Disentangling Abstraction from Statistical Pattern Matching in Human and
  Machine Learning
Disentangling Abstraction from Statistical Pattern Matching in Human and Machine Learning
Sreejan Kumar
Ishita Dasgupta
Nathaniel D. Daw
J. Cohen
Thomas L. Griffiths
27
10
0
04 Apr 2022
Learning Relational Rules from Rewards
Learning Relational Rules from Rewards
Guillermo Puebla
L. Doumas
14
0
0
25 Mar 2022
Target Languages (vs. Inductive Biases) for Learning to Act and Plan
Target Languages (vs. Inductive Biases) for Learning to Act and Plan
Hector Geffner
36
6
0
15 Sep 2021
Learning to solve complex tasks by growing knowledge culturally across
  generations
Learning to solve complex tasks by growing knowledge culturally across generations
Michael Henry Tessler
Jason Madeano
Pedro Tsividis
Brin Harper
Noah D. Goodman
J. Tenenbaum
20
7
0
28 Jul 2021
Alchemy: A benchmark and analysis toolkit for meta-reinforcement
  learning agents
Alchemy: A benchmark and analysis toolkit for meta-reinforcement learning agents
Jane X. Wang
Michael King
Nicolas Porcel
Z. Kurth-Nelson
Tina Zhu
...
Neil C. Rabinowitz
Loic Matthey
Demis Hassabis
Alexander Lerchner
M. Botvinick
OffRL
23
29
0
04 Feb 2021
Building machines that adapt and compute like brains
Building machines that adapt and compute like brains
Brenden Lake
J. Tenenbaum
AI4CE
FedML
NAI
AILaw
254
890
0
11 Nov 2017
A Compositional Object-Based Approach to Learning Physical Dynamics
A Compositional Object-Based Approach to Learning Physical Dynamics
Michael Chang
T. Ullman
Antonio Torralba
J. Tenenbaum
AI4CE
OCL
241
438
0
01 Dec 2016
1