Absraction for Efficient Reinforcement Learning

Te Herenga Waka—Victoria University of Wellington
→
Library Home
→
ResearchArchive Home
→
University Library Papers and Theses
→
Master's Theses
→
View Item

Absraction for Efficient Reinforcement Learning

Telfar, Alexander

URI: http://researcharchive.vuw.ac.nz/handle/10063/8990

Date: 2020

Rights: http://creativecommons.org/licenses/by-sa/3.0/nz/
Creative Commons GNU GPL
Allow modifications, as long as others share alike
Allow commercial use

Abstract:

Successful reinforcement learning requires large amounts of data, compute, and some luck. We explore the ability of abstraction(s) to reduce these dependencies. Abstractions for reinforcement learning share the goals of this abstract: to capture essential details, while leaving out the unimportant. By throwing away inessential details, there will be less to compute, less to explore, and less variance in observations. But, does this always aid reinforcement learning? More specifically, we start by looking for abstractions that are easily solvable. This leads us to a type of linear abstraction. We show that, while it does allow efficient solutions, it also gives erroneous solutions, in the general case. We then attempt to improve the sample efficiency of a reinforcment learner. We do so by constructing a measure of symmetry and using it as an inductive bias. We design and run experiments to test the advantage provided by this inductive bias, but must leave conclusions to future work.

Show full item record