Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2506.14460
Cited By
Zeroth-Order Optimization is Secretly Single-Step Policy Optimization
17 June 2025
Junbin Qiu
Zhengpeng Xie
Xiangda Yan
Yongjie Yang
Yao Shu
OffRL
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"Zeroth-Order Optimization is Secretly Single-Step Policy Optimization"
Title
No papers