Full metadata
Title
Learning Policies for Model-Based Reinforcement Learning Using Distributed Reward Formulation
Description
This work explores combining state-of-the-art \gls{mbrl} algorithms focused on learning complex policies with large state-spaces and augmenting them with distributional reward perspective on \gls{rl} algorithms. Distributional \gls{rl} provides a probabilistic reward formulation as opposed to the classic \gls{rl} formulation which models the estimation of this distributional return. These probabilistic reward formulations help the agent choose highly risk-averse actions, which in turn makes the learning more stable. To evaluate this idea, I experiment in simulation on complex high-dimensional environments when subject under different noisy conditions.
Date Created
2021
Contributors
- Agarwal, Nikhil (Author)
- Ben Amor, Heni (Thesis advisor)
- Phielipp, Mariano (Committee member)
- DV, Hemanth (Committee member)
- Arizona State University (Publisher)
Topical Subject
Resource Type
Extent
38 pages
Language
eng
Copyright Statement
In Copyright
Primary Member of
Peer-reviewed
No
Open Access
No
Handle
https://hdl.handle.net/2286/R.2.N.161694
Level of coding
minimal
Cataloging Standards
Note
Partial requirement for: M.S., Arizona State University, 2021
Field of study: Computer Science
System Created
- 2021-11-16 03:15:05
System Modified
- 2021-11-30 12:51:28
- 2 years 11 months ago
Additional Formats