Extracting Cognition out of Images for the Purpose of Autonomous Driving

Chen, Chenyi

Please use this identifier to cite or link to this item: http://arks.princeton.edu/ark:/88435/dsp012j62s727w

Full metadata record

DC Field	Value	Language
dc.contributor.advisor	Kornhauser, Alain Lucien	-
dc.contributor.author	Chen, Chenyi	-
dc.contributor.other	Operations Research and Financial Engineering Department	-
dc.date.accessioned	2016-06-08T18:40:53Z	-
dc.date.available	2016-06-08T18:40:53Z	-
dc.date.issued	2016	-
dc.identifier.uri	http://arks.princeton.edu/ark:/88435/dsp012j62s727w	-
dc.description.abstract	Autonomous driving is a broadly recognized solution to serious traffic problems such as accidents and congestions. It is a very broad topic that extends across cognition, artificial intelligence, and control. While this thesis primarily focuses on the cognition aspect, others are also considered. Here, the thesis proposes several computer vision algorithms for autonomous driving, encompassing three major parts: In part one, experiments on motion-based object recognition are presented. The proposed method differentiates objects according to their speed. In part two, the artificial intelligence aspect of autonomous driving is considered. Research on training an autonomous driving AI agent through reinforcement learning is introduced. In part three, the key part of the thesis, a direct perception approach is proposed to drive a car in a highway environment. In this approach, an input image is mapped to a small number of key perception indicators that directly relate to the affordance of a road/traffic state for driving. This representation provides a set of compact yet complete descriptions of the scene to enable a simple controller to drive autonomously on highways. Using synthetic images from a virtual environment, a deep convolutional neural network (ConvNet) is trained for direct perception. Experiments show that the model can effectively drive a car in a very diverse set of virtual environments, and it provides good estimation of affordance indicators from real driving images. To further improve the performance of the direct perception-based system, the issue of temporal information is considered by studying the Long Short Term Memory (LSTM) unit and its influence on the affordance indicator estimation. Quantitative results show that adding the LSTM unit does help to improve the system's performance. Finally, as object detection is closely related to autonomous driving, in Appendix A a deep learning-based small object detection approach is proposed. The applicability of the state-of-the-art object detection algorithms to the small object detection task is studied.	-
dc.language.iso	en	-
dc.publisher	Princeton, NJ : Princeton University	-
dc.relation.isformatof	The Mudd Manuscript Library retains one bound copy of each dissertation. Search for these copies in the library's main catalog: http://catalog.princeton.edu/	-
dc.subject	Artificial Intelligence	-
dc.subject	Autonomous Driving	-
dc.subject	Computer Vision	-
dc.subject	Deep Learning	-
dc.subject	Machine Learning	-
dc.subject.classification	Artificial intelligence	-
dc.subject.classification	Computer science	-
dc.title	Extracting Cognition out of Images for the Purpose of Autonomous Driving	-
dc.type	Academic dissertations (Ph.D.)	-
pu.projectgrantnumber	690-2143	-
Appears in Collections:	Operations Research and Financial Engineering

Files in This Item:

File	Description	Size	Format
Chen_princeton_0181D_11760.pdf		34.44 MB	Adobe PDF	View/Download

Show simple item record

Search

Browse