Title |
---|
![]() Adaptive teachers for amortized samplers Minsu Kim Sanghyeok Choi Taeyoung Yun Emmanuel Bengio Leo Feng Jarrid Rector-Brooks Sungsoo Ahn Jinkyoo Park Nikolay Malkin Yoshua Bengio |
![]() Variance Reduction for Policy Gradient with Action-Dependent Factorized
Baselines Cathy Wu Aravind Rajeswaran Yan Duan Vikash Kumar Alexandre M. Bayen Sham Kakade Igor Mordatch Pieter Abbeel |