This project investigates modeling uncertainty for reinforcement learning of agent policies in Markov Decision Processes. The idea is to approximate the posterior over beliefs of the Q values with experience. The beliefs are combined in real-time using Assumed Density Filtering (ADF) with Temporal Difference Learning. Applications of this reinforcement learning algorithms include learning autonomous behaviors and modeling driver intentions.
Name | Affiliation | Role | Position | |
---|---|---|---|---|
heejinj@seas.upenn.edu | Jeong, Heejin | University of Pennsylvania | Other | Student - PhD |
ddlee@seas.upenn.edu | Lee, Daniel | University of Pennsylvania | PI | Faculty - Tenured |
Type | Name | Uploaded |
---|---|---|
Publication | adfq_aaai_v1.pdf | Sept. 30, 2018, 12:09 p.m. |
Presentation | Bayesian Q-learning with Assumed Density Filtering | March 30, 2018, 6:57 a.m. |
Progress Report | 93_Progress_Report_2018-03-30 | March 30, 2018, 6:58 a.m. |
Progress Report | 93_Progress_Report_2018-09-30 | Sept. 30, 2018, 12:09 p.m. |
No match sources!
No partners!