Skip navigation
Please use this identifier to cite or link to this item: http://arks.princeton.edu/ark:/88435/dsp01vt150n551
Title: Using an instruction interface for compiling deep learning applications to the NVIDIA Deep Learning Accelerator (NVDLA)
Authors: Dhome-Casanova, Thomas
Advisors: Malik, Sharad
Department: Electrical and Computer Engineering
Certificate Program: Center for Statistics and Machine Learning
Class Year: 2023
Abstract: Advances in neural networks are increasingly enhancing or replicating human abilities. Currently, consumers train neural networks and run inference on CPUs and GPUs. Some commercial uses of deep learning accelerators exist and offload supported operations to faster hardware. However, due to specialized drivers and compilers, these are not portable or applicable for widespread use. This thesis extends the D2A methodology beyond the current set of supported accelerators, FlexASR, HLSCNN and VTA, to include the more complex NVIDIA Deep Learning Accelerator (NVDLA). This enables a range of applications written in domain specific languages, such as TensorFlow, to be automatically compiled for the NVDLA and thus makes the NVDLA more widely usable. Using only direct neural network to NVDLA operation matches, 65 of the 69 neural network operations in the popular ResNet-18 image classification model could be offloaded. One additional operator was matched when the semantic rewrite rules of D2A were also included. The accuracy of offloaded individual neural network layers as well as the full direct-matched model was successfully validated against the CPU by using existing NVDLA simulators. Overall, the NVDLA’s large set of supported operators allows for a significant number of accurate offloads in convolutional models. Further, D2A’s combined semantic and accelerator specific rewrites result in more offloaded operators than simple direct matching. This advantage would be larger for less optimally expressed models.
URI: http://arks.princeton.edu/ark:/88435/dsp01vt150n551
Type of Material: Princeton University Senior Theses
Language: en
Appears in Collections:Electrical and Computer Engineering, 1932-2023

Files in This Item:
File Description SizeFormat 
DHOME-CASANOVA-THOMAS-THESIS.pdf2.07 MBAdobe PDF    Request a copy


Items in Dataspace are protected by copyright, with all rights reserved, unless otherwise indicated.