Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1901.11503
Cited By
Contrasting Exploration in Parameter and Action Space: A Zeroth-Order Optimization Perspective
31 January 2019
Anirudh Vemula
Wen Sun
J. Andrew Bagnell
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"Contrasting Exploration in Parameter and Action Space: A Zeroth-Order Optimization Perspective"
30 / 30 papers shown
Title
Fully Zeroth-Order Bilevel Programming via Gaussian Smoothing
Alireza Aghasi
Saeed Ghadimi
87
3
0
29 Mar 2024
Task2Morph: Differentiable Task-inspired Framework for Contact-Aware Robot Design
Yishuai Cai
Shaowu Yang
Minglong Li
Xinglin Chen
Yunxin Mao
Xiaodong Yi
Wenjing Yang
81
3
0
28 Mar 2024
Forward Learning for Gradient-based Black-box Saliency Map Generation
Zeliang Zhang
Mingqian Feng
Jinyang Jiang
Rongyi Zhu
Yijie Peng
Chenliang Xu
FAtt
103
2
0
22 Mar 2024
Sparse MeZO: Less Parameters for Better Performance in Zeroth-Order LLM Fine-Tuning
Yong Liu
Zirui Zhu
Chaoyu Gong
Minhao Cheng
Cho-Jui Hsieh
Yang You
MoE
81
23
0
24 Feb 2024
Revisiting Zeroth-Order Optimization for Memory-Efficient LLM Fine-Tuning: A Benchmark
Yihua Zhang
Pingzhi Li
Junyuan Hong
Jiaxiang Li
Yimeng Zhang
...
Wotao Yin
Mingyi Hong
Zhangyang Wang
Sijia Liu
Tianlong Chen
132
60
0
18 Feb 2024
Rendering Wireless Environments Useful for Gradient Estimators: A Zero-Order Stochastic Federated Learning Method
Elissa Mhanna
Mohamad Assaad
146
1
0
30 Jan 2024
DeepZero: Scaling up Zeroth-Order Optimization for Deep Model Training
Aochuan Chen
Yimeng Zhang
Jinghan Jia
James Diffenderfer
Jiancheng Liu
Konstantinos Parasyris
Yihua Zhang
Zheng Zhang
B. Kailkhura
Sijia Liu
138
48
0
03 Oct 2023
Symmetry-Aware Robot Design with Structured Subgroups
Heng Dong
Junyu Zhang
Tonghan Wang
Chongjie Zhang
58
12
0
31 May 2023
Learning One Abstract Bit at a Time Through Self-Invented Experiments Encoded as Neural Networks
Vincent Herrmann
Louis Kirsch
Jürgen Schmidhuber
AI4CE
107
6
0
29 Dec 2022
Assessing Quality-Diversity Neuro-Evolution Algorithms Performance in Hard Exploration Problems
Félix Chalumeau
Thomas Pierrot
Valentin Macé
Arthur Flajolet
Karim Beguir
Antoine Cully
Nicolas Perrin-Gilbert
99
7
0
24 Nov 2022
Zero-Order One-Point Estimate with Distributed Stochastic Gradient-Tracking Technique
Elissa Mhanna
Mohamad Assaad
82
4
0
11 Oct 2022
Distributed Multi-Agent Reinforcement Learning Based on Graph-Induced Local Value Functions
Gangshan Jing
H. Bai
Jemin George
A. Chakrabortty
P. Sharma
83
5
0
26 Feb 2022
Black-Box Generalization: Stability of Zeroth-Order Learning
Konstantinos E. Nikolakakis
Farzin Haddadpour
Dionysios S. Kalogerias
Amin Karbasi
MLT
73
2
0
14 Feb 2022
Distributed Cooperative Multi-Agent Reinforcement Learning with Directed Coordination Graph
Gangshan Jing
H. Bai
Jemin George
A. Chakrabortty
P. Sharma
OffRL
69
6
0
10 Jan 2022
Transform2Act: Learning a Transform-and-Control Policy for Efficient Agent Design
Ye Yuan
Yuda Song
Zhengyi Luo
Wen Sun
Kris Kitani
72
36
0
07 Oct 2021
Model-Free Learning of Optimal Deterministic Resource Allocations in Wireless Systems via Action-Space Exploration
Hassaan Hashmi
Dionysios S. Kalogerias
23
2
0
23 Aug 2021
Fast and Efficient Locomotion via Learned Gait Transitions
Yuxiang Yang
Tingnan Zhang
Erwin Coumans
Jie Tan
Byron Boots
92
92
0
09 Apr 2021
Learning Sampling Policy for Faster Derivative Free Optimization
Zhou Zhai
Bin Gu
Heng-Chiao Huang
45
1
0
09 Apr 2021
Derivative-Free Reinforcement Learning: A Review
Hong Qian
Yang Yu
OffRL
137
42
0
10 Feb 2021
Average-reward model-free reinforcement learning: a systematic review and literature mapping
Vektor Dewanto
George Dunn
A. Eshragh
M. Gallagher
Fred Roosta
81
30
0
18 Oct 2020
Learning Branching Heuristics for Propositional Model Counting
Pashootan Vaezipoor
Gil Lederman
Yuhuai Wu
Chris J. Maddison
Roger C. Grosse
Sanjit A. Seshia
F. Bacchus
LRM
82
13
0
07 Jul 2020
Diversity Policy Gradient for Sample Efficient Quality-Diversity Optimization
Thomas Pierrot
Valentin Macé
Félix Chalumeau
Arthur Flajolet
Geoffrey Cideron
Karim Beguir
Antoine Cully
Olivier Sigaud
Nicolas Perrin-Gilbert
110
62
0
15 Jun 2020
Zeroth-order Deterministic Policy Gradient
Harshat Kumar
Dionysios S. Kalogerias
George J. Pappas
Alejandro Ribeiro
OffRL
33
14
0
12 Jun 2020
A Primer on Zeroth-Order Optimization in Signal Processing and Machine Learning
Sijia Liu
Pin-Yu Chen
B. Kailkhura
Gaoyuan Zhang
A. Hero III
P. Varshney
84
235
0
11 Jun 2020
Explicit Gradient Learning
Mor Sinay
Elad Sarafian
Y. Louzoun
Noam Agmon
Sarit Kraus
OffRL
56
8
0
09 Jun 2020
MLE-guided parameter search for task loss minimization in neural sequence modeling
Sean Welleck
Kyunghyun Cho
65
10
0
04 Jun 2020
Learning to Guide Random Search
Ozan Sener
V. Koltun
ODL
63
21
0
25 Apr 2020
Trajectory-wise Control Variates for Variance Reduction in Policy Gradient Methods
Ching-An Cheng
Xinyan Yan
Byron Boots
71
22
0
08 Aug 2019
MULEX: Disentangling Exploitation from Exploration in Deep RL
Lucas Beyer
Damien Vincent
O. Teboul
Sylvain Gelly
Matthieu Geist
Olivier Pietquin
50
14
0
01 Jul 2019
Sample Complexity of Estimating the Policy Gradient for Nearly Deterministic Dynamical Systems
Osbert Bastani
61
4
0
24 Jan 2019
1