Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1701.07274
Cited By
v1
v2
v3
v4
v5
v6 (latest)
Deep Reinforcement Learning: An Overview
25 January 2017
Yuxi Li
OffRL
VLM
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"Deep Reinforcement Learning: An Overview"
50 / 417 papers shown
Title
Bailing-TTS: Chinese Dialectal Speech Synthesis Towards Human-like Spontaneous Representation
Xinhan Di
Jiahao Lu
Yunming Liang
Junjie Zheng
Yihua Wang
Chaofan Ding
ALM
91
1
0
01 Aug 2024
Ontology-driven Reinforcement Learning for Personalized Student Support
Ryan Hare
Ying Tang
43
1
0
14 Jul 2024
An Open-source Hardware/Software Architecture and Supporting Simulation Environment to Perform Human FPV Flight Demonstrations for Unmanned Aerial Vehicle Autonomy
Haosong Xiao
Prajit KrisshnaKumar
Jagadeswara P K V Pothuri
Puru Soni
Eric Butcher
Souma Chowdhury
89
0
0
08 Jul 2024
Pseudo-Labeling by Multi-Policy Viewfinder Network for Image Cropping
Zhiyu Pan
Kewei Wang
Yizheng Wu
Liwen Xiao
Jiahao Cui
Zhicheng Wang
Zhiguo Cao
51
0
0
02 Jul 2024
Diffusion Models for Offline Multi-agent Reinforcement Learning with Safety Constraints
Jianuo Huang
OffRL
61
0
0
30 Jun 2024
Fuzzy Logic Guided Reward Function Variation: An Oracle for Testing Reinforcement Learning Programs
Shiyu Zhang
Haoyang Song
Qixin Wang
Yu Pei
74
0
0
28 Jun 2024
LiCS: Navigation using Learned-imitation on Cluttered Space
J. J. Damanik
Jae-Won Jung
Chala Adane Deresa
Han-Lim Choi
92
4
0
21 Jun 2024
Do Not Wait: Learning Re-Ranking Model Without User Feedback At Serving Time in E-Commerce
Yuan Wang
Zhiyu Li
Changshuo Zhang
Sirui Chen
Xiao Zhang
Jun Xu
Quan Lin
62
1
0
20 Jun 2024
Towards Real-World Efficiency: Domain Randomization in Reinforcement Learning for Pre-Capture of Free-Floating Moving Targets by Autonomous Robots
Bahador Beigomi
Zheng H. Zhu
54
0
0
10 Jun 2024
Online Policy Distillation with Decision-Attention
Xinqiang Yu
Chuanguang Yang
Chengqing Yu
Libo Huang
Zhulin An
Yongjun Xu
OffRL
99
1
0
08 Jun 2024
Prototypical Reward Network for Data-Efficient RLHF
Jinghan Zhang
Xiting Wang
Yiqiao Jin
Changyu Chen
Xinhao Zhang
Kunpeng Liu
ALM
88
22
0
06 Jun 2024
Seed-TTS: A Family of High-Quality Versatile Speech Generation Models
Philip Anastassiou
Jiawei Chen
Jingshu Chen
Yuanzhe Chen
Zhuo Chen
...
Wenjie Zhang
Yanzhe Zhang
Zilin Zhao
Dejian Zhong
Xiaobin Zhuang
119
106
0
04 Jun 2024
Adaptive Layer Splitting for Wireless LLM Inference in Edge Computing: A Model-Based Reinforcement Learning Approach
Yuxuan Chen
Rongpeng Li
Xiaoxue Yu
Zhifeng Zhao
Honggang Zhang
88
10
0
03 Jun 2024
Diffusion Policies creating a Trust Region for Offline Reinforcement Learning
Tianyu Chen
Zhendong Wang
Mingyuan Zhou
OffRL
89
11
0
30 May 2024
A Survey on Vision-Language-Action Models for Embodied AI
Yueen Ma
Zixing Song
Yuzheng Zhuang
Jianye Hao
Irwin King
LM&Ro
325
54
0
23 May 2024
Do No Harm: A Counterfactual Approach to Safe Reinforcement Learning
Sean Vaskov
Wilko Schwarting
Chris Baker
103
1
0
19 May 2024
Python-Based Reinforcement Learning on Simulink Models
Georg Schafer
Max Schirl
Jakob Rehrl
Stefan Huber
Simon Hirlaender
AI4CE
54
4
0
14 May 2024
PhilHumans: Benchmarking Machine Learning for Personal Health
Vadim Liventsev
Vivek Kumar
Allmin Pradhap Singh Susaiyah
Zixiu "Alex" Wu
Ivan Rodin
...
Milan Petkovic
Diego Reforgiato Recupero
Ehud Reiter
Daniele Riboni
Raymond Sterling
AI4MH
LM&MA
66
0
0
04 May 2024
Research and application of artificial intelligence based webshell detection model: A literature review
Mingrui Ma
Lansheng Han
Chunjie Zhou
129
3
0
28 Apr 2024
Beyond the Edge: An Advanced Exploration of Reinforcement Learning for Mobile Edge Computing, its Applications, and Future Research Trajectories
Ning Yang
Shuo Chen
Haijun Zhang
Randall Berry
OffRL
101
9
0
22 Apr 2024
Physics-based reward driven image analysis in microscopy
Kamyar Barakati
Hui Yuan
Amit Goyal
Sergei V. Kalinin
59
2
0
22 Apr 2024
Cooperative Sentiment Agents for Multimodal Sentiment Analysis
Shan Wang
Hui Shuai
Qingshan Liu
Fei Wang
LLMAG
90
1
0
19 Apr 2024
Enhancing Autonomous Vehicle Training with Language Model Integration and Critical Scenario Generation
Hanlin Tian
Kethan Reddy
Yuxiang Feng
Mohammed Quddus
Y. Demiris
Panagiotis Angeloudis
96
14
0
12 Apr 2024
Generative Pre-Trained Transformer for Symbolic Regression Base In-Context Reinforcement Learning
Yanjie Li
Weijun Li
Lina Yu
Min Wu
Jingyi Liu
Wenqiang Li
Meilan Hao
Shu Wei
Yusong Deng
80
3
0
09 Apr 2024
Stochastic Online Optimization for Cyber-Physical and Robotic Systems
Hao Ma
Melanie Zeilinger
Michael Muehlebach
87
1
0
08 Apr 2024
From Two-Dimensional to Three-Dimensional Environment with Q-Learning: Modeling Autonomous Navigation with Reinforcement Learning and no Libraries
Ergon Cugler de Moraes Silva
OffRL
49
0
0
27 Mar 2024
Parametric PDE Control with Deep Reinforcement Learning and Differentiable L0-Sparse Polynomial Policies
N. Botteghi
Urban Fasel
AI4CE
103
6
0
22 Mar 2024
Levels of AI Agents: from Rules to Large Language Models
Yu Huang
AI4CE
ELM
LM&Ro
63
4
0
06 Mar 2024
A Survey on Applications of Reinforcement Learning in Spatial Resource Allocation
Di Zhang
Moyang Wang
Joseph D Mango
Xiang Li
Xianrui Xu
105
1
0
06 Mar 2024
Reinforcement Learning-Based Approaches for Enhancing Security and Resilience in Smart Control: A Survey on Attack and Defense Methods
Zheyu Zhang
AAML
47
0
0
23 Feb 2024
SInViG: A Self-Evolving Interactive Visual Agent for Human-Robot Interaction
Jie Xu
Hanbo Zhang
Xinghang Li
Huaping Liu
Xuguang Lan
Tao Kong
LM&Ro
95
3
0
19 Feb 2024
Optimal Parallelization Strategies for Active Flow Control in Deep Reinforcement Learning-Based Computational Fluid Dynamics
Wang Jia
Hang Xu
AI4CE
91
6
0
18 Feb 2024
SINR-Aware Deep Reinforcement Learning for Distributed Dynamic Channel Allocation in Cognitive Interference Networks
Yaniv Cohen
Tomer Gafni
Ronen Greenberg
Kobi Cohen
36
5
0
17 Feb 2024
Agents Need Not Know Their Purpose
Paulo Garcia
49
0
0
15 Feb 2024
Learning Interpretable Policies in Hindsight-Observable POMDPs through Partially Supervised Reinforcement Learning
Michael Lanier
Ying Xu
Nathan Jacobs
Chongjie Zhang
Yevgeniy Vorobeychik
54
2
0
14 Feb 2024
Steady-State Error Compensation for Reinforcement Learning with Quadratic Rewards
Liyao Wang
Zishun Zheng
Yuan Lin
26
0
0
14 Feb 2024
ACTER: Diverse and Actionable Counterfactual Sequences for Explaining and Diagnosing RL Policies
Jasmina Gajcin
Ivana Dusparic
CML
OffRL
70
2
0
09 Feb 2024
Circuit Partitioning for Multi-Core Quantum Architectures with Deep Reinforcement Learning
Arnau Pastor
Pau Escofet
Sahar Ben Rached
Eduard Alarcón
Pere Barlet-Ros
S. Abadal
GNN
137
6
0
31 Jan 2024
DittoGym: Learning to Control Soft Shape-Shifting Robots
Suning Huang
Boyuan Chen
Huazhe Xu
Vincent Sitzmann
110
3
0
24 Jan 2024
Machine Learning on Dynamic Graphs: A Survey on Applications
Sanaz Hasanzadeh Fard
AI4CE
84
5
0
16 Jan 2024
Learning Crowd Behaviors in Navigation with Attention-based Spatial-Temporal Graphs
Yanying Zhou
Jochen Garcke
GNN
104
4
0
11 Jan 2024
On Safety and Liveness Filtering Using Hamilton-Jacobi Reachability Analysis
Javier Borquez
Kaustav Chakraborty
Hao Wang
Somil Bansal
58
10
0
23 Dec 2023
Learning from Mistakes: Iterative Prompt Relabeling for Text-to-Image Diffusion Model Training
Xinyan Chen
Jiaxin Ge
Tianjun Zhang
Jiaming Liu
Shanghang Zhang
VLM
EGVM
190
0
0
23 Dec 2023
Analyzing Generalization in Policy Networks: A Case Study with the Double-Integrator System
Ruining Zhang
H. Han
Maolong Lv
Qisong Yang
Jian Cheng
OffRL
66
2
0
16 Dec 2023
An Invitation to Deep Reinforcement Learning
Bernhard Jaeger
Andreas Geiger
OffRL
OOD
158
5
0
13 Dec 2023
Evolving Reservoirs for Meta Reinforcement Learning
Corentin Léger
Gautier Hamon
Eleni Nisioti
X. Hinaut
Clément Moulin-Frier
88
1
0
09 Dec 2023
Learning for Semantic Knowledge Base-Guided Online Feature Transmission in Dynamic Channels
Xiangyu Gao
Yaping Sun
Dongyu Wei
Xiaodong Xu
Hao Chen
Hao Yin
Shuguang Cui
37
2
0
30 Nov 2023
Two-step dynamic obstacle avoidance
Fabian Hart
Martin Waltz
Ostap Okhrin
87
3
0
28 Nov 2023
Beyond Hallucinations: Enhancing LVLMs through Hallucination-Aware Direct Preference Optimization
Zhiyuan Zhao
Bin Wang
Linke Ouyang
Xiao-wen Dong
Jiaqi Wang
Conghui He
MLLM
VLM
141
135
0
28 Nov 2023
Adinkra Symbol Recognition using Classical Machine Learning and Deep Learning
Michael Adjeisah
K. Asamoah
Martha Asamoah Yeboah
Raji Rafiu King
Godwin Ferguson Achaab
Kingsley Adjei
65
0
0
27 Nov 2023
Previous
1
2
3
4
5
6
7
8
9
Next