dc.contributor.advisor |
Browne, Will |
|
dc.contributor.advisor |
McCane, Brendan |
|
dc.contributor.author |
Telfar, Alexander |
|
dc.date.accessioned |
2020-07-14T02:02:28Z |
|
dc.date.available |
2020-07-14T02:02:28Z |
|
dc.date.copyright |
2020 |
|
dc.date.issued |
2020 |
|
dc.identifier.uri |
http://researcharchive.vuw.ac.nz/handle/10063/8990 |
|
dc.description.abstract |
Successful reinforcement learning requires large amounts of data, compute, and some luck. We explore the ability of abstraction(s) to reduce these dependencies. Abstractions for reinforcement learning share the goals of this abstract: to capture essential details, while leaving out the unimportant. By throwing away inessential details, there will be less to compute, less to explore, and less variance in observations. But, does this always aid reinforcement learning? More specifically, we start by looking for abstractions that are easily solvable. This leads us to a type of linear abstraction. We show that, while it does allow efficient solutions, it also gives erroneous solutions, in the general case.
We then attempt to improve the sample efficiency of a reinforcment learner. We do so by constructing a measure of symmetry and using it as an inductive bias. We design and run experiments to test the advantage provided by this inductive bias, but must leave conclusions to future work. |
en_NZ |
dc.language.iso |
en_NZ |
|
dc.publisher |
Victoria University of Wellington |
en_NZ |
dc.rights.uri |
http://creativecommons.org/licenses/by-sa/3.0/nz/ |
|
dc.subject |
reinforcement learning |
en_NZ |
dc.subject |
abstraction |
en_NZ |
dc.subject |
computational complexity |
en_NZ |
dc.title |
Absraction for Efficient Reinforcement Learning |
en_NZ |
dc.type |
Text |
en_NZ |
vuwschema.contributor.unit |
School of Engineering and Computer Science |
en_NZ |
vuwschema.type.vuw |
Awarded Research Masters Thesis |
en_NZ |
thesis.degree.discipline |
Computer Science |
en_NZ |
thesis.degree.grantor |
Victoria University of Wellington |
en_NZ |
thesis.degree.level |
Masters |
en_NZ |
thesis.degree.name |
Master of Computer Science |
en_NZ |
dc.rights.license |
Creative Commons GNU GPL |
en_NZ |
dc.rights.license |
Allow modifications, as long as others share alike |
en_NZ |
dc.rights.license |
Allow commercial use |
en_NZ |
dc.date.updated |
2020-07-01T23:06:53Z |
|
vuwschema.subject.anzsrcfor |
080199 Artificial Intelligence and Image Processing not elsewhere classified |
en_NZ |
vuwschema.subject.anzsrcfor |
080101 Adaptive Agents and Intelligent Robotics |
en_NZ |
vuwschema.subject.anzsrcseo |
970109 Expanding Knowledge in Engineering |
en_NZ |
vuwschema.subject.anzsrcseo |
970110 Expanding Knowledge in Technology |
en_NZ |
vuwschema.subject.anzsrctoa |
2 STRATEGIC BASIC RESEARCH |
en_NZ |