ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1901.11503
  4. Cited By
Contrasting Exploration in Parameter and Action Space: A Zeroth-Order
  Optimization Perspective

Contrasting Exploration in Parameter and Action Space: A Zeroth-Order Optimization Perspective

31 January 2019
Anirudh Vemula
Wen Sun
J. Andrew Bagnell
ArXiv (abs)PDFHTML

Papers citing "Contrasting Exploration in Parameter and Action Space: A Zeroth-Order Optimization Perspective"

30 / 30 papers shown
Title
Fully Zeroth-Order Bilevel Programming via Gaussian Smoothing
Fully Zeroth-Order Bilevel Programming via Gaussian Smoothing
Alireza Aghasi
Saeed Ghadimi
87
3
0
29 Mar 2024
Task2Morph: Differentiable Task-inspired Framework for Contact-Aware
  Robot Design
Task2Morph: Differentiable Task-inspired Framework for Contact-Aware Robot Design
Yishuai Cai
Shaowu Yang
Minglong Li
Xinglin Chen
Yunxin Mao
Xiaodong Yi
Wenjing Yang
81
3
0
28 Mar 2024
Forward Learning for Gradient-based Black-box Saliency Map Generation
Forward Learning for Gradient-based Black-box Saliency Map Generation
Zeliang Zhang
Mingqian Feng
Jinyang Jiang
Rongyi Zhu
Yijie Peng
Chenliang Xu
FAtt
103
2
0
22 Mar 2024
Sparse MeZO: Less Parameters for Better Performance in Zeroth-Order LLM
  Fine-Tuning
Sparse MeZO: Less Parameters for Better Performance in Zeroth-Order LLM Fine-Tuning
Yong Liu
Zirui Zhu
Chaoyu Gong
Minhao Cheng
Cho-Jui Hsieh
Yang You
MoE
81
23
0
24 Feb 2024
Revisiting Zeroth-Order Optimization for Memory-Efficient LLM
  Fine-Tuning: A Benchmark
Revisiting Zeroth-Order Optimization for Memory-Efficient LLM Fine-Tuning: A Benchmark
Yihua Zhang
Pingzhi Li
Junyuan Hong
Jiaxiang Li
Yimeng Zhang
...
Wotao Yin
Mingyi Hong
Zhangyang Wang
Sijia Liu
Tianlong Chen
132
60
0
18 Feb 2024
Rendering Wireless Environments Useful for Gradient Estimators: A Zero-Order Stochastic Federated Learning Method
Rendering Wireless Environments Useful for Gradient Estimators: A Zero-Order Stochastic Federated Learning Method
Elissa Mhanna
Mohamad Assaad
146
1
0
30 Jan 2024
DeepZero: Scaling up Zeroth-Order Optimization for Deep Model Training
DeepZero: Scaling up Zeroth-Order Optimization for Deep Model Training
Aochuan Chen
Yimeng Zhang
Jinghan Jia
James Diffenderfer
Jiancheng Liu
Konstantinos Parasyris
Yihua Zhang
Zheng Zhang
B. Kailkhura
Sijia Liu
138
48
0
03 Oct 2023
Symmetry-Aware Robot Design with Structured Subgroups
Symmetry-Aware Robot Design with Structured Subgroups
Heng Dong
Junyu Zhang
Tonghan Wang
Chongjie Zhang
58
12
0
31 May 2023
Learning One Abstract Bit at a Time Through Self-Invented Experiments
  Encoded as Neural Networks
Learning One Abstract Bit at a Time Through Self-Invented Experiments Encoded as Neural Networks
Vincent Herrmann
Louis Kirsch
Jürgen Schmidhuber
AI4CE
107
6
0
29 Dec 2022
Assessing Quality-Diversity Neuro-Evolution Algorithms Performance in
  Hard Exploration Problems
Assessing Quality-Diversity Neuro-Evolution Algorithms Performance in Hard Exploration Problems
Félix Chalumeau
Thomas Pierrot
Valentin Macé
Arthur Flajolet
Karim Beguir
Antoine Cully
Nicolas Perrin-Gilbert
99
7
0
24 Nov 2022
Zero-Order One-Point Estimate with Distributed Stochastic
  Gradient-Tracking Technique
Zero-Order One-Point Estimate with Distributed Stochastic Gradient-Tracking Technique
Elissa Mhanna
Mohamad Assaad
82
4
0
11 Oct 2022
Distributed Multi-Agent Reinforcement Learning Based on Graph-Induced
  Local Value Functions
Distributed Multi-Agent Reinforcement Learning Based on Graph-Induced Local Value Functions
Gangshan Jing
H. Bai
Jemin George
A. Chakrabortty
P. Sharma
83
5
0
26 Feb 2022
Black-Box Generalization: Stability of Zeroth-Order Learning
Black-Box Generalization: Stability of Zeroth-Order Learning
Konstantinos E. Nikolakakis
Farzin Haddadpour
Dionysios S. Kalogerias
Amin Karbasi
MLT
73
2
0
14 Feb 2022
Distributed Cooperative Multi-Agent Reinforcement Learning with Directed
  Coordination Graph
Distributed Cooperative Multi-Agent Reinforcement Learning with Directed Coordination Graph
Gangshan Jing
H. Bai
Jemin George
A. Chakrabortty
P. Sharma
OffRL
69
6
0
10 Jan 2022
Transform2Act: Learning a Transform-and-Control Policy for Efficient
  Agent Design
Transform2Act: Learning a Transform-and-Control Policy for Efficient Agent Design
Ye Yuan
Yuda Song
Zhengyi Luo
Wen Sun
Kris Kitani
72
36
0
07 Oct 2021
Model-Free Learning of Optimal Deterministic Resource Allocations in
  Wireless Systems via Action-Space Exploration
Model-Free Learning of Optimal Deterministic Resource Allocations in Wireless Systems via Action-Space Exploration
Hassaan Hashmi
Dionysios S. Kalogerias
23
2
0
23 Aug 2021
Fast and Efficient Locomotion via Learned Gait Transitions
Fast and Efficient Locomotion via Learned Gait Transitions
Yuxiang Yang
Tingnan Zhang
Erwin Coumans
Jie Tan
Byron Boots
92
92
0
09 Apr 2021
Learning Sampling Policy for Faster Derivative Free Optimization
Learning Sampling Policy for Faster Derivative Free Optimization
Zhou Zhai
Bin Gu
Heng-Chiao Huang
45
1
0
09 Apr 2021
Derivative-Free Reinforcement Learning: A Review
Derivative-Free Reinforcement Learning: A Review
Hong Qian
Yang Yu
OffRL
137
42
0
10 Feb 2021
Average-reward model-free reinforcement learning: a systematic review
  and literature mapping
Average-reward model-free reinforcement learning: a systematic review and literature mapping
Vektor Dewanto
George Dunn
A. Eshragh
M. Gallagher
Fred Roosta
81
30
0
18 Oct 2020
Learning Branching Heuristics for Propositional Model Counting
Learning Branching Heuristics for Propositional Model Counting
Pashootan Vaezipoor
Gil Lederman
Yuhuai Wu
Chris J. Maddison
Roger C. Grosse
Sanjit A. Seshia
F. Bacchus
LRM
82
13
0
07 Jul 2020
Diversity Policy Gradient for Sample Efficient Quality-Diversity
  Optimization
Diversity Policy Gradient for Sample Efficient Quality-Diversity Optimization
Thomas Pierrot
Valentin Macé
Félix Chalumeau
Arthur Flajolet
Geoffrey Cideron
Karim Beguir
Antoine Cully
Olivier Sigaud
Nicolas Perrin-Gilbert
110
62
0
15 Jun 2020
Zeroth-order Deterministic Policy Gradient
Zeroth-order Deterministic Policy Gradient
Harshat Kumar
Dionysios S. Kalogerias
George J. Pappas
Alejandro Ribeiro
OffRL
33
14
0
12 Jun 2020
A Primer on Zeroth-Order Optimization in Signal Processing and Machine
  Learning
A Primer on Zeroth-Order Optimization in Signal Processing and Machine Learning
Sijia Liu
Pin-Yu Chen
B. Kailkhura
Gaoyuan Zhang
A. Hero III
P. Varshney
84
235
0
11 Jun 2020
Explicit Gradient Learning
Explicit Gradient Learning
Mor Sinay
Elad Sarafian
Y. Louzoun
Noam Agmon
Sarit Kraus
OffRL
56
8
0
09 Jun 2020
MLE-guided parameter search for task loss minimization in neural
  sequence modeling
MLE-guided parameter search for task loss minimization in neural sequence modeling
Sean Welleck
Kyunghyun Cho
65
10
0
04 Jun 2020
Learning to Guide Random Search
Learning to Guide Random Search
Ozan Sener
V. Koltun
ODL
63
21
0
25 Apr 2020
Trajectory-wise Control Variates for Variance Reduction in Policy
  Gradient Methods
Trajectory-wise Control Variates for Variance Reduction in Policy Gradient Methods
Ching-An Cheng
Xinyan Yan
Byron Boots
71
22
0
08 Aug 2019
MULEX: Disentangling Exploitation from Exploration in Deep RL
MULEX: Disentangling Exploitation from Exploration in Deep RL
Lucas Beyer
Damien Vincent
O. Teboul
Sylvain Gelly
Matthieu Geist
Olivier Pietquin
50
14
0
01 Jul 2019
Sample Complexity of Estimating the Policy Gradient for Nearly
  Deterministic Dynamical Systems
Sample Complexity of Estimating the Policy Gradient for Nearly Deterministic Dynamical Systems
Osbert Bastani
61
4
0
24 Jan 2019
1