Skip navigation
Please use this identifier to cite or link to this item: http://arks.princeton.edu/ark:/88435/dsp013r074z138
Title: Model-Misspecified Offline Reinforcement Learning
Authors: Ye, Naimeng
Advisors: Wang, Mengdi
Department: Mathematics
Certificate Program: Center for Statistics and Machine Learning
Class Year: 2022
Abstract: We study offline reinforcement learning (RL) with linear function approximation, a common technique often employed to lift the dependence on the size of the state space. A great volume of existing literature has studied this problem under the realizability assumption, i.e., the underlying transition/value function is indeed a linear function of given features. However, such an assumption rarely holds in practice. In sharp contrast, classic learning theory in supervised learning provides agnostic generalization bound that holds regardless of whether the given function approximation is realizable or not. In an attempt to theoretically understand the learnability question when the model is outside of the linear realm, we propose a variant of the standard Least Square Value Iteration (LSVI) algorithm and theoretically prove its efficiency. Specifically, we examine the case where the transition model of the MDP is close to having a low rank decomposition but not exactly linear. In contrast to existing works that measure model misspecification by the point-wise error, our result scales only with the expected error under the offline data distribution, a significantly weaker notion that can be much smaller than the point-wise error. Provided with a bound on this population error, we establish a data-dependent upper bound on the suboptimality of Constrained-LSVI for approximately linear MDPs. The upper bound is further accompanied by a lower bound discussion which provides some insight on the information-theoretic limits of the learning process.
URI: http://arks.princeton.edu/ark:/88435/dsp013r074z138
Type of Material: Princeton University Senior Theses
Language: en
Appears in Collections:Mathematics, 1934-2023

Files in This Item:
File Description SizeFormat 
YE-NAIMENG-THESIS.pdf283.93 kBAdobe PDF    Request a copy


Items in Dataspace are protected by copyright, with all rights reserved, unless otherwise indicated.