Towards Efficient and Effective Deep Model-based Reinforcement Learning

Luo, Yuping

Please use this identifier to cite or link to this item: http://arks.princeton.edu/ark:/88435/dsp013197xq26c

Full metadata record

DC Field	Value	Language
dc.contributor.advisor	Arora, Sanjeev
dc.contributor.author	Luo, Yuping
dc.contributor.other	Computer Science Department
dc.date.accessioned	2022-10-10T19:50:38Z	-
dc.date.available	2022-10-10T19:50:38Z	-
dc.date.created	2022-01-01
dc.date.issued	2022
dc.identifier.uri	http://arks.princeton.edu/ark:/88435/dsp013197xq26c	-
dc.description.abstract	Recent advances in deep reinforcement learning have demonstrated its great potential for real-world problems. However, two concerns prevent reinforcement learning from being applied: Efficiency and Efficacy. This dissertation studies how to improve the efficiency and efficacy of reinforcement learning by designing deep model-based algorithms. The access to dynamics models empowers the algorithms to plan, which is key to sequential decision making. This dissertation covers four topics: online reinforcement learning, the expressivity of neural networks in deep reinforcement learning, offline reinforcement learning, and safe reinforcement learning. For online reinforcement learning, we present an algorithmic framework with theoretical guarantees by utilizing a lower bound of performance the policy learned in the learned environment can obtain in the real environment. We also empirically verify the efficiency of our proposed method. For expressivity of neural networks in deep reinforcement learning, we prove that in some scenarios, the model-based approaches can require much less representation power to approximate a near-optimal policy than model-free approaches, and empirically show that this can be an issue in simulated robotics environments and a model-based planner can help. For offline reinforcement learning, we devise an algorithm that enables the policy to stay close to the provided expert demonstration set to reduce distribution shift, and we also conduct experiments to demonstrate the efficacy of our methods to improve the success rate for robotic arm manipulation tasks in simulated environments. For safe reinforcement learning, we propose a method that uses the learned dynamics model to certify safe states, and our experiments show that our method can learn a decent policy without a single safety violation during training in a set of simple but challenging tasks, while baseline algorithms have hundreds of safety violations.
dc.format.mimetype	application/pdf
dc.language.iso	en
dc.publisher	Princeton, NJ : Princeton University
dc.relation.isformatof	The Mudd Manuscript Library retains one bound copy of each dissertation. Search for these copies in the library's main catalog: <a href=http://catalog.princeton.edu>catalog.princeton.edu</a>
dc.subject	Deep Learning
dc.subject	Machine Learning
dc.subject	Reinforcement Learning
dc.subject.classification	Computer science
dc.subject.classification	Artificial intelligence
dc.title	Towards Efficient and Effective Deep Model-based Reinforcement Learning
dc.type	Academic dissertations (Ph.D.)
pu.date.classyear	2022
pu.department	Computer Science
Appears in Collections:	Computer Science

Files in This Item:

File	Description	Size	Format
Luo_princeton_0181D_14201.pdf		4.11 MB	Adobe PDF	View/Download

Show simple item record

Search

Browse