ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1901.11503
  4. Cited By
Contrasting Exploration in Parameter and Action Space: A Zeroth-Order
  Optimization Perspective

Contrasting Exploration in Parameter and Action Space: A Zeroth-Order Optimization Perspective

31 January 2019
Anirudh Vemula
Wen Sun
J. Andrew Bagnell
ArXivPDFHTML

Papers citing "Contrasting Exploration in Parameter and Action Space: A Zeroth-Order Optimization Perspective"

7 / 7 papers shown
Title
Rendering Wireless Environments Useful for Gradient Estimators: A Zero-Order Stochastic Federated Learning Method
Rendering Wireless Environments Useful for Gradient Estimators: A Zero-Order Stochastic Federated Learning Method
Elissa Mhanna
Mohamad Assaad
55
1
0
30 Jan 2024
Learning One Abstract Bit at a Time Through Self-Invented Experiments
  Encoded as Neural Networks
Learning One Abstract Bit at a Time Through Self-Invented Experiments Encoded as Neural Networks
Vincent Herrmann
Louis Kirsch
Jürgen Schmidhuber
AI4CE
48
5
0
29 Dec 2022
Zero-Order One-Point Estimate with Distributed Stochastic
  Gradient-Tracking Technique
Zero-Order One-Point Estimate with Distributed Stochastic Gradient-Tracking Technique
Elissa Mhanna
Mohamad Assaad
33
4
0
11 Oct 2022
Distributed Cooperative Multi-Agent Reinforcement Learning with Directed
  Coordination Graph
Distributed Cooperative Multi-Agent Reinforcement Learning with Directed Coordination Graph
Gangshan Jing
H. Bai
Jemin George
A. Chakrabortty
P. Sharma
OffRL
26
6
0
10 Jan 2022
Transform2Act: Learning a Transform-and-Control Policy for Efficient
  Agent Design
Transform2Act: Learning a Transform-and-Control Policy for Efficient Agent Design
Ye Yuan
Yuda Song
Zhengyi Luo
Wen Sun
Kris Kitani
27
35
0
07 Oct 2021
A Primer on Zeroth-Order Optimization in Signal Processing and Machine
  Learning
A Primer on Zeroth-Order Optimization in Signal Processing and Machine Learning
Sijia Liu
Pin-Yu Chen
B. Kailkhura
Gaoyuan Zhang
A. Hero III
P. Varshney
26
224
0
11 Jun 2020
Trajectory-wise Control Variates for Variance Reduction in Policy
  Gradient Methods
Trajectory-wise Control Variates for Variance Reduction in Policy Gradient Methods
Ching-An Cheng
Xinyan Yan
Byron Boots
25
22
0
08 Aug 2019
1