Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2407.21713
Cited By
v1
v2 (latest)
Social Learning through Interactions with Other Agents: A Survey
31 July 2024
Dylan Hillier
Cheston Tan
Jing Jiang
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"Social Learning through Interactions with Other Agents: A Survey"
15 / 15 papers shown
Title
CoderAgent: Simulating Student Behavior for Personalized Programming Learning with Large Language Models
Yi Zhan
Qi Liu
Weibo Gao
Zheng Zhang
Tianfu Wang
Shuanghong Shen
Junyu Lu
Zhenya Huang
LLMAG
AI4CE
77
0
0
27 May 2025
Findings of the BabyLM Challenge: Sample-Efficient Pretraining on Developmentally Plausible Corpora
Alex Warstadt
Aaron Mueller
Leshem Choshen
E. Wilcox
Chengxu Zhuang
...
Rafael Mosquera
Bhargavi Paranjape
Adina Williams
Tal Linzen
Ryan Cotterell
184
120
0
10 Apr 2025
Social Skill Training with Large Language Models
Diyi Yang
Caleb Ziems
William B. Held
Omar Shaikh
Michael S. Bernstein
John C. Mitchell
LLMAG
71
11
0
05 Apr 2024
RT-H: Action Hierarchies Using Language
Suneel Belkhale
Tianli Ding
Ted Xiao
P. Sermanet
Quon Vuong
Jonathan Tompson
Yevgen Chebotar
Debidatta Dwibedi
Dorsa Sadigh
LM&Ro
102
89
0
04 Mar 2024
Theory of Mind for Multi-Agent Collaboration via Large Language Models
Huao Li
Yu Quan Chong
Simon Stepputtis
Joseph Campbell
Dana Hughes
Michael Lewis
Katia Sycara
LLMAG
99
76
0
16 Oct 2023
Building Cooperative Embodied Agents Modularly with Large Language Models
Hongxin Zhang
Weihua Du
Jiaming Shan
Qinhong Zhou
Yilun Du
J. Tenenbaum
Tianmin Shu
Chuang Gan
LLMAG
LM&Ro
121
175
0
05 Jul 2023
Language to Rewards for Robotic Skill Synthesis
Wenhao Yu
Nimrod Gileadi
Chuyuan Fu
Sean Kirmani
Kuang-Huei Lee
...
N. Heess
Dorsa Sadigh
Jie Tan
Yuval Tassa
F. Xia
LM&Ro
83
282
0
14 Jun 2023
Direct Preference Optimization: Your Language Model is Secretly a Reward Model
Rafael Rafailov
Archit Sharma
E. Mitchell
Stefano Ermon
Christopher D. Manning
Chelsea Finn
ALM
387
4,139
0
29 May 2023
Habitat-Web: Learning Embodied Object-Search Strategies from Human Demonstrations at Scale
Ram Ramrakhya
Eric Undersander
Dhruv Batra
Abhishek Das
LM&Ro
120
119
0
07 Apr 2022
Training language models to follow instructions with human feedback
Long Ouyang
Jeff Wu
Xu Jiang
Diogo Almeida
Carroll L. Wainwright
...
Amanda Askell
Peter Welinder
Paul Christiano
Jan Leike
Ryan J. Lowe
OSLM
ALM
883
13,176
0
04 Mar 2022
Chain-of-Thought Prompting Elicits Reasoning in Large Language Models
Jason W. Wei
Xuezhi Wang
Dale Schuurmans
Maarten Bosma
Brian Ichter
F. Xia
Ed H. Chi
Quoc Le
Denny Zhou
LM&Ro
LRM
AI4CE
ReLM
823
9,644
0
28 Jan 2022
Social Neuro AI: Social Interaction as the "dark matter" of AI
Samuele Bolotta
G. Dumas
61
23
0
31 Dec 2021
Learning Rewards from Linguistic Feedback
T. Sumers
Mark K. Ho
Robert D. Hawkins
Karthik Narasimhan
Thomas Griffiths
118
54
0
30 Sep 2020
ALFRED: A Benchmark for Interpreting Grounded Instructions for Everyday Tasks
Mohit Shridhar
Jesse Thomason
Daniel Gordon
Yonatan Bisk
Winson Han
Roozbeh Mottaghi
Luke Zettlemoyer
Dieter Fox
LM&Ro
117
779
0
03 Dec 2019
A Review of Cooperative Multi-Agent Deep Reinforcement Learning
Afshin Oroojlooyjadid
Davood Hajinezhad
97
431
0
11 Aug 2019
1