Deconstructing Mozart: A GAN-Style Approach to Raw Audio Processing and Generation

Please use this identifier to cite or link to this item: http://arks.princeton.edu/ark:/88435/dsp01vh53wz49h

Title:	Deconstructing Mozart: A GAN-Style Approach to Raw Audio Processing and Generation
Authors:	Kim, DG
Advisors:	Jha, Niraj
Department:	Electrical Engineering
Certificate Program:	Applications of Computing Program
Class Year:	2018
Abstract:	We propose a new system for raw audio processing and generation which combines the success of Generative Adversarial Networks (GANs) and WaveNet, a generative model created by DeepMind. This system will generate piano music using only sample piano audio files that are given to it. The use of GANs together with raw audio, rather than derived features, makes this project unique in the realm of machine learning. Most previous works that compose music through artificial means utilize representations of music, rather than raw audio, due to computational complexity, and previous works that use GANs often solve the problem of image synthesis. Current progress of the fully implemented system with WaveNet and GANs are able to nearly mimic the frequency domain characteristics of the inputted data.
URI:	http://arks.princeton.edu/ark:/88435/dsp01vh53wz49h
Type of Material:	Princeton University Senior Theses
Language:	en
Appears in Collections:	Electrical and Computer Engineering, 1932-2023

Files in This Item:

File	Description	Size	Format
KIM-DG-THESIS.pdf		2.53 MB	Adobe PDF	Request a copy

Search

Browse