Please use this identifier to cite or link to this item:
http://arks.princeton.edu/ark:/88435/dsp01xs55mg43d
Title: | The Development of a Drum Identifier and Transcription Tool |
Authors: | Larsen, Louis |
Advisors: | Finkelstein, Adam |
Department: | Computer Science |
Class Year: | 2024 |
Abstract: | Automatic Drum Transcription (ADT) is the process of turning audio recordings of percussion instruments into another representation, typically sheet music or a MIDI file. We propose a novel approach to ADT using a CNN and a generic 2-layer Hierarchical Encoder-Decoder Transformer, which analyzes the drum audio in both the frequency and time axis. While other sub-areas of Automatic Music Transcription (AMT) use Transformers like these, ADT still relies heavily on complex, meticulously tuned CNNs. We evaluated the model with the Expanded Midi Groove Dataset (E-GMD) and IMST dataset and achieved state-of-the-art results with regard to Drum classification, Onset, Offset, and Velocity Estimations. |
URI: | http://arks.princeton.edu/ark:/88435/dsp01xs55mg43d |
Type of Material: | Princeton University Senior Theses |
Language: | en |
Appears in Collections: | Computer Science, 1987-2024 |
Files in This Item:
File | Size | Format | |
---|---|---|---|
LARSEN-LOUIS-THESIS.pdf | 762.72 kB | Adobe PDF | Request a copy |
Items in Dataspace are protected by copyright, with all rights reserved, unless otherwise indicated.