ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2206.12984
  4. Cited By
Improving Policy Optimization with Generalist-Specialist Learning

Improving Policy Optimization with Generalist-Specialist Learning

26 June 2022
Zhiwei Jia
Xuanlin Li
Z. Ling
Shuang Liu
Yiran Wu
H. Su
    OffRL
ArXivPDFHTML

Papers citing "Improving Policy Optimization with Generalist-Specialist Learning"

6 / 6 papers shown
Title
PhysReaction: Physically Plausible Real-Time Humanoid Reaction Synthesis
  via Forward Dynamics Guided 4D Imitation
PhysReaction: Physically Plausible Real-Time Humanoid Reaction Synthesis via Forward Dynamics Guided 4D Imitation
Yunze Liu
Changxi Chen
Chenjing Ding
Li Yi
31
6
0
01 Apr 2024
Perpetual Humanoid Control for Real-time Simulated Avatars
Perpetual Humanoid Control for Real-time Simulated Avatars
Zhengyi Luo
Jinkun Cao
Alexander W. Winkler
Kris M. Kitani
Weipeng Xu
49
88
0
10 May 2023
Chain-of-Thought Predictive Control
Chain-of-Thought Predictive Control
Zhiwei Jia
Vineet Thumuluri
Fangchen Liu
Ling-Hao Chen
Zhiao Huang
H. Su
LM&Ro
31
20
0
03 Apr 2023
UniDexGrasp++: Improving Dexterous Grasping Policy Learning via
  Geometry-aware Curriculum and Iterative Generalist-Specialist Learning
UniDexGrasp++: Improving Dexterous Grasping Policy Learning via Geometry-aware Curriculum and Iterative Generalist-Specialist Learning
Weikang Wan
Haoran Geng
Yun-Hai Liu
Zikang Shan
Yaodong Yang
Li Yi
He-Nan Wang
44
94
0
02 Apr 2023
ManiSkill2: A Unified Benchmark for Generalizable Manipulation Skills
ManiSkill2: A Unified Benchmark for Generalizable Manipulation Skills
Jiayuan Gu
Fanbo Xiang
Xuanlin Li
Z. Ling
Xiqiang Liu
...
Xiao Yuan
P. Xie
Zhiao Huang
Rui Chen
Hao Su
34
174
0
09 Feb 2023
Improving generalization in reinforcement learning through forked agents
Improving generalization in reinforcement learning through forked agents
Olivier Moulin
Vincent François-Lavet
Mark Hoogendoorn
AI4CE
23
0
0
13 Dec 2022
1