Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2505.20573
Cited By
Collision- and Reachability-Aware Multi-Robot Control with Grounded LLM Planners
26 May 2025
Jiabao Ji
Yongchao Chen
Yang Zhang
Ramana Rao Kompella
Chuchu Fan
Gaowen Liu
Shiyu Chang
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Collision- and Reachability-Aware Multi-Robot Control with Grounded LLM Planners"
32 / 32 papers shown
Title
DeepSeek-Prover-V2: Advancing Formal Mathematical Reasoning via Reinforcement Learning for Subgoal Decomposition
Zhaochun Ren
Zhihong Shao
Junxiao Song
Huajian Xin
Haoyu Wang
...
Hongxuan Tang
Yuxuan Liu
Wenjun Gao
Daya Guo
Chong Ruan
AIMat
ReLM
LRM
66
19
0
30 Apr 2025
Agentic Reasoning and Tool Integration for LLMs via Reinforcement Learning
Joykirat Singh
Raghav Magazine
Yash Pandya
A. Nambi
LLMAG
KELM
OffRL
LRM
284
6
0
28 Apr 2025
Syntactic and Semantic Control of Large Language Models via Sequential Monte Carlo
João Loula
Benjamin LeBrun
Li Du
Ben Lipkin
Clemente Pasti
...
Ryan Cotterel
Vikash K. Mansinghka
Alexander K. Lew
Tim Vieira
Timothy J. O'Donnell
77
4
0
17 Apr 2025
Emotion Recognition Using Convolutional Neural Networks
Shaoyuan Xu
Yang Cheng
Qian Lin
J. Allebach
43
1
0
03 Apr 2025
ThinkPrune: Pruning Long Chain-of-Thought of LLMs via Reinforcement Learning
Bairu Hou
Yang Zhang
Jiabao Ji
Yujian Liu
Kaizhi Qian
Jacob Andreas
Shiyu Chang
OffRL
LRM
80
20
0
02 Apr 2025
SimpleRL-Zoo: Investigating and Taming Zero Reinforcement Learning for Open Base Models in the Wild
Weihao Zeng
Yuzhen Huang
Qian Liu
Wei Liu
Keqing He
Zejun Ma
Junxian He
OffRL
ReLM
LRM
113
85
0
24 Mar 2025
What Makes a Reward Model a Good Teacher? An Optimization Perspective
Noam Razin
Zixuan Wang
Hubert Strauss
Stanley Wei
Jason D. Lee
Sanjeev Arora
65
9
0
19 Mar 2025
Search-R1: Training LLMs to Reason and Leverage Search Engines with Reinforcement Learning
Bowen Jin
Hansi Zeng
Zhenrui Yue
Dong Wang
Sercan O. Arik
Dong Wang
Hamed Zamani
Jiawei Han
RALM
ReLM
KELM
OffRL
AI4TS
LRM
123
77
0
12 Mar 2025
Code-as-Symbolic-Planner: Foundation Model-Based Robot Planning via Symbolic Code Generation
Yuxiao Chen
Yilun Hao
Yang Zhang
Chuchu Fan
LRM
91
2
0
03 Mar 2025
MedVLM-R1: Incentivizing Medical Reasoning Capability of Vision-Language Models (VLMs) via Reinforcement Learning
Jiazhen Pan
Che Liu
Junde Wu
Fenglin Liu
Jiayuan Zhu
Hongwei Bran Li
Chen Chen
Cheng Ouyang
Daniel Rueckert
LRM
LM&MA
VLM
119
24
0
26 Feb 2025
SFT Memorizes, RL Generalizes: A Comparative Study of Foundation Model Post-training
Tianzhe Chu
Yuexiang Zhai
Jihan Yang
Shengbang Tong
Saining Xie
Dale Schuurmans
Quoc V. Le
Sergey Levine
Yi-An Ma
OffRL
109
85
0
28 Jan 2025
DeepSeek-R1: Incentivizing Reasoning Capability in LLMs via Reinforcement Learning
DeepSeek-AI
Daya Guo
Dejian Yang
Haowei Zhang
Junxiao Song
...
Shiyu Wang
S. Yu
Shunfeng Zhou
Shuting Pan
S.S. Li
ReLM
VLM
OffRL
AI4TS
LRM
254
1,503
0
22 Jan 2025
HybridFlow: A Flexible and Efficient RLHF Framework
Guangming Sheng
Chi Zhang
Zilingfeng Ye
Xibin Wu
Wang Zhang
Ru Zhang
Size Zheng
Haibin Lin
Chuan Wu
AI4CE
90
171
0
28 Sep 2024
Towards Efficient LLM Grounding for Embodied Multi-Agent Collaboration
Yang Zhang
Shixin Yang
Chenjia Bai
Fei Wu
Xiu Li
Zhen Wang
Xuelong Li
LLMAG
82
29
0
23 May 2024
Embodied LLM Agents Learn to Cooperate in Organized Teams
Xudong Guo
Kaixuan Huang
Jiale Liu
Wenhui Fan
Natalia Vélez
Qingyun Wu
Huazheng Wang
Thomas L. Griffiths
Mengdi Wang
LM&Ro
LLMAG
77
42
0
19 Mar 2024
Quiet-STaR: Language Models Can Teach Themselves to Think Before Speaking
E. Zelikman
Georges Harik
Yijia Shao
Varuna Jayasiri
Nick Haber
Noah D. Goodman
LLMAG
ReLM
LRM
82
131
0
14 Mar 2024
Beyond A*: Better Planning with Transformers via Search Dynamics Bootstrapping
Lucas Lehnert
Sainbayar Sukhbaatar
DiJia Su
Qinqing Zheng
Paul Mcvay
Michael Rabbat
Yuandong Tian
55
56
0
21 Feb 2024
V-STaR: Training Verifiers for Self-Taught Reasoners
Arian Hosseini
Xingdi Yuan
Nikolay Malkin
Rameswar Panda
Alessandro Sordoni
Rishabh Agarwal
ReLM
LRM
66
119
0
09 Feb 2024
DeepSeekMath: Pushing the Limits of Mathematical Reasoning in Open Language Models
Zhihong Shao
Peiyi Wang
Qihao Zhu
Runxin Xu
Jun-Mei Song
...
Haowei Zhang
Mingchuan Zhang
Yiming Li
Yu-Huan Wu
Daya Guo
ReLM
LRM
91
953
0
05 Feb 2024
GCBF+: A Neural Graph Control Barrier Function Framework for Distributed Safe Multi-Agent Control
Songyuan Zhang
Oswin So
Kunal Garg
Chuchu Fan
AI4CE
105
23
0
25 Jan 2024
Scalable Multi-Robot Collaboration with Large Language Models: Centralized or Decentralized Systems?
Yongchao Chen
Jacob Arkin
Yang Zhang
Nicholas Roy
Chuchu Fan
LLMAG
LM&Ro
47
78
0
27 Sep 2023
AutoTAMP: Autoregressive Task and Motion Planning with LLMs as Translators and Checkers
Yongchao Chen
Jacob Arkin
Charles Dawson
Yang Zhang
Nicholas Roy
Chuchu Fan
LRM
41
101
0
10 Jun 2023
Leveraging Pre-trained Large Language Models to Construct and Utilize World Models for Model-based Task Planning
L. Guan
Karthik Valmeekam
S. Sreedharan
Subbarao Kambhampati
LLMAG
38
171
0
24 May 2023
Errors are Useful Prompts: Instruction Guided Task Programming with Verifier-Assisted Iterative Prompting
Marta Skreta
Naruki Yoshikawa
Sebastian Arellano-Rubach
Zhi Ji
L. B. Kristensen
Kourosh Darvish
Alán Aspuru-Guzik
Florian Shkurti
Animesh Garg
98
58
0
24 Mar 2023
Text2Motion: From Natural Language Instructions to Feasible Plans
Kevin Qinghong Lin
Christopher Agia
Toki Migimatsu
Marco Pavone
Jeannette Bohg
LM&Ro
54
272
0
21 Mar 2023
ProgPrompt: Generating Situated Robot Task Plans using Large Language Models
Ishika Singh
Valts Blukis
Arsalan Mousavian
Ankit Goyal
Danfei Xu
Jonathan Tremblay
Dieter Fox
Jesse Thomason
Animesh Garg
LM&Ro
LLMAG
137
640
0
22 Sep 2022
Inner Monologue: Embodied Reasoning through Planning with Language Models
Wenlong Huang
F. Xia
Ted Xiao
Harris Chan
Jacky Liang
...
Tomas Jackson
Linda Luu
Sergey Levine
Karol Hausman
Brian Ichter
LLMAG
LM&Ro
LRM
78
880
0
12 Jul 2022
Do As I Can, Not As I Say: Grounding Language in Robotic Affordances
Michael Ahn
Anthony Brohan
Noah Brown
Yevgen Chebotar
Omar Cortes
...
Ted Xiao
Peng Xu
Sichun Xu
Mengyuan Yan
Andy Zeng
LM&Ro
138
1,922
0
04 Apr 2022
Multi-agent Motion Planning from Signal Temporal Logic Specifications
Dawei Sun
Jingkai Chen
Sayan Mitra
Chuchu Fan
45
81
0
13 Jan 2022
Proximal Policy Optimization Algorithms
John Schulman
Filip Wolski
Prafulla Dhariwal
Alec Radford
Oleg Klimov
OffRL
248
18,685
0
20 Jul 2017
High-Dimensional Continuous Control Using Generalized Advantage Estimation
John Schulman
Philipp Moritz
Sergey Levine
Michael I. Jordan
Pieter Abbeel
OffRL
45
3,368
0
08 Jun 2015
PDDL2.1: An Extension to PDDL for Expressing Temporal Planning Domains
M. Fox
D. Long
74
2,168
0
22 Jun 2011
1