Enhancing Deep Neural Networks in Diverse Resource-Constrained Hardware Settings

Tuli, Shikhar

Please use this identifier to cite or link to this item: http://arks.princeton.edu/ark:/88435/dsp01s4655k92r

Full metadata record

DC Field	Value	Language
dc.contributor.advisor	Jha, Niraj K.
dc.contributor.author	Tuli, Shikhar
dc.contributor.other	Electrical and Computer Engineering Department
dc.date.accessioned	2024-02-21T17:21:08Z	-
dc.date.available	2024-02-21T17:21:08Z	-
dc.date.created	2023-01-01
dc.date.issued	2024
dc.identifier.uri	http://arks.princeton.edu/ark:/88435/dsp01s4655k92r	-
dc.description.abstract	Over the past decade, artificial intelligence (AI) has gained significant interest in industry and academia. Deep neural network (DNN) models have exploded in size over the years. Wider access to graphical processing units (GPUs) and machine learning (ML) accelerators, along with increasing dataset sizes, have fueled this growth. However, efficient evaluation of such models on hardware in terms of accuracy, peak power draw, energy consumption, and chip area of the integrated circuit remains challenging, requiring long design cycles and domain expertise. Increasing model sizes exacerbate this problem. In this thesis, we propose a set of frameworks targeting this challenge from various perspectives. We propose FlexiBERT, the first wide-scale design space for heterogeneous and flexible transformer architectures. We then propose AccelTran, a state-of-the-art transformer accelerator. Taking motivation from AccelTran, we propose ELECTOR, a design space of transformer accelerators, and implement transformer-accelerator co-design leveraging our co-design technique, namely BOSHCODE. We also propose EdgeTran, a co-search technique to find out the best-performing pair, i.e., the transformer model and the edge-AI device. We apply this framework to convolutional neural networks (CNNs) as well (CODEBench). Finally, we discuss two extensions of BOSHCODE: DINI for data imputation and BREATHE for generic multi-objective optimization in vector and graphical search spaces. These works expand the scope of the proposed approach to a much more diverse set of applications.
dc.format.mimetype	application/pdf
dc.language.iso	en
dc.publisher	Princeton, NJ : Princeton University
dc.subject	Bayesian Optimization
dc.subject	Electronic Design Automation
dc.subject	Machine Learning
dc.subject.classification	Computer engineering
dc.subject.classification	Electrical engineering
dc.title	Enhancing Deep Neural Networks in Diverse Resource-Constrained Hardware Settings
dc.type	Academic dissertations (Ph.D.)
pu.date.classyear	2024
pu.department	Electrical and Computer Engineering
Appears in Collections:	Electrical Engineering

Files in This Item:

File	Description	Size	Format
Tuli_princeton_0181D_14849.pdf		13.05 MB	Adobe PDF	View/Download

Show simple item record

Search

Browse