Master Thesis MSTR-2018-128

BibliographyDittrich, Florian: Deep reinforcement learning for high-level behavior decision making.
University of Stuttgart, Faculty of Computer Science, Electrical Engineering, and Information Technology, Master Thesis No. 128 (2018).
77 pages, english.
Abstract

As the vision of fully autonomous vehicles potentially introduces significant benefits for our society, this work investigates approaches for sequential decision making for high-level actions in highway scenarios. These scenarios are modeled using an markov decision process (MDP) and consider deep reinforcement learning to solve it. Our approach, based on deep Q-networks (DQNs), is able to fully avoid collisions and learns a policy that results in comfortable trajectories compared to baseline policies we developed. One of the main challenges for reinforcement learning are sparse rewards, which we aim to overcome employing reward shaping. Additionally, the necessity of multiple layers of non-liniearities in the DQN algorithm is empirically evaluated using our scenarios. The results support the usage of multiple levels of non-linearities, as a linear variant of the DQN is not capable of learning effective policies in our experiments. Due to a weight initialization with behavioral cloning, an acceleration of the learning procedure is achieved.

Full text and
other links
Volltext
Department(s)University of Stuttgart, Institute of Parallel and Distributed Systems, Machine Learning und Robotics
Superviser(s)Toussaint, Prof. Marc; Schmitt, Dr. Felix
Entry dateApril 6, 2022
New Report   New Article   New Monograph   Computer Science