Please use this identifier to cite or link to this item:
http://arks.princeton.edu/ark:/88435/dsp01s4655k92r
Title: | Enhancing Deep Neural Networks in Diverse Resource-Constrained Hardware Settings |
Authors: | Tuli, Shikhar |
Advisors: | Jha, Niraj K. |
Contributors: | Electrical and Computer Engineering Department |
Keywords: | Bayesian Optimization Electronic Design Automation Machine Learning |
Subjects: | Computer engineering Electrical engineering |
Issue Date: | 2024 |
Publisher: | Princeton, NJ : Princeton University |
Abstract: | Over the past decade, artificial intelligence (AI) has gained significant interest in industry and academia. Deep neural network (DNN) models have exploded in size over the years. Wider access to graphical processing units (GPUs) and machine learning (ML) accelerators, along with increasing dataset sizes, have fueled this growth. However, efficient evaluation of such models on hardware in terms of accuracy, peak power draw, energy consumption, and chip area of the integrated circuit remains challenging, requiring long design cycles and domain expertise. Increasing model sizes exacerbate this problem. In this thesis, we propose a set of frameworks targeting this challenge from various perspectives. We propose FlexiBERT, the first wide-scale design space for heterogeneous and flexible transformer architectures. We then propose AccelTran, a state-of-the-art transformer accelerator. Taking motivation from AccelTran, we propose ELECTOR, a design space of transformer accelerators, and implement transformer-accelerator co-design leveraging our co-design technique, namely BOSHCODE. We also propose EdgeTran, a co-search technique to find out the best-performing pair, i.e., the transformer model and the edge-AI device. We apply this framework to convolutional neural networks (CNNs) as well (CODEBench). Finally, we discuss two extensions of BOSHCODE: DINI for data imputation and BREATHE for generic multi-objective optimization in vector and graphical search spaces. These works expand the scope of the proposed approach to a much more diverse set of applications. |
URI: | http://arks.princeton.edu/ark:/88435/dsp01s4655k92r |
Type of Material: | Academic dissertations (Ph.D.) |
Language: | en |
Appears in Collections: | Electrical Engineering |
Files in This Item:
File | Description | Size | Format | |
---|---|---|---|---|
Tuli_princeton_0181D_14849.pdf | 13.05 MB | Adobe PDF | View/Download |
Items in Dataspace are protected by copyright, with all rights reserved, unless otherwise indicated.