Learning Algorithms for Intelligent Agents and Mechanisms

Rahme, Jad

Please use this identifier to cite or link to this item: http://arks.princeton.edu/ark:/88435/dsp01j67316994

Full metadata record

DC Field	Value	Language
dc.contributor.advisor	Adams, Ryan P
dc.contributor.author	Rahme, Jad
dc.contributor.other	Applied and Computational Mathematics Department
dc.date.accessioned	2022-10-10T19:50:09Z	-
dc.date.available	2022-10-10T19:50:09Z	-
dc.date.created	2022-01-01
dc.date.issued	2022
dc.identifier.uri	http://arks.princeton.edu/ark:/88435/dsp01j67316994	-
dc.description.abstract	The ability to learn from past experiences and adapt one’s behavior accordingly within an environment or context to achieve a certain goal is a characteristic of a truly intelligent entity. Developing efficient, robust, and reliable learning algorithms towards that end is an active area of research and a major step towards achieving artificial general intelligence. In this thesis, we research learning algorithms for optimal decision making in two different contexts, Reinforcement Learning in Part I and Auction Design in Part II. Reinforcement learning (RL) is an area of machine learning that is concerned with how an agent should act in an environment in order to maximize its cumulative reward over time. In Chapter 2, inspired by statistical physics, we develop a novel approach to RL that not only learns optimal policies with enhanced desirable properties but also sheds new light on maximum entropy RL. In Chapter 3, we tackle the generalization problem in RL using a Bayesian perspective. We show that imperfect knowledge of the environment’s dynamics effectively turn a fully-observed Markov Decision Process (MDP) into a Partially Observed MDP (POMDP) that we call the Epistemic POMDP. Informed by this observation, we develop a new policy learning algorithm LEEP which has improved generalization properties. An auction is the process of organizing the buying and selling of products and services that is of great practical importance. Designing an incentive compatible, individually rational auction that maximizes revenue is a challenging and intractable problem. Recently, a deep learning based approach was proposed to learn optimal auctions from data. While successful, this approach suffers from a few limitations, including sample inefficiency, lack of generalization to new auctions, and training difficulties. In Chapter 4, we construct a symmetry preserving neural network architecture, EquivariantNet, suitable for anonymous auctions. EquivariantNet is not only more sample efficient but is also able to learn auction rules that generalize well to other settings. In Chapter 5, we propose a novel formulation of the auction learning problem as a two player game. The resulting learning algorithm, ALGNet, is easier to train, more reliable and better suited for non stationary settings.
dc.format.mimetype	application/pdf
dc.language.iso	en
dc.publisher	Princeton, NJ : Princeton University
dc.relation.isformatof	The Mudd Manuscript Library retains one bound copy of each dissertation. Search for these copies in the library's main catalog: <a href=http://catalog.princeton.edu>catalog.princeton.edu</a>
dc.subject	Bayesian Statistics
dc.subject	Deep Learning
dc.subject	Game Theory
dc.subject	Mechanism Design
dc.subject	Reinforcement Learning
dc.subject	Statistical Physics
dc.subject.classification	Artificial intelligence
dc.subject.classification	Applied mathematics
dc.title	Learning Algorithms for Intelligent Agents and Mechanisms
dc.type	Academic dissertations (Ph.D.)
pu.date.classyear	2022
pu.department	Applied and Computational Mathematics
Appears in Collections:	Applied and Computational Mathematics

Files in This Item:

File	Description	Size	Format
Rahme_princeton_0181D_14181.pdf		5.12 MB	Adobe PDF	View/Download

Show simple item record

Search

Browse