Replicating Audio Features: An Application of Machine Learning Concepts

Phillips, Aqeel

Please use this identifier to cite or link to this item: http://arks.princeton.edu/ark:/88435/dsp01rx913s519

Title:	Replicating Audio Features: An Application of Machine Learning Concepts
Authors:	Phillips, Aqeel
Advisors:	Engelhardt, Barbara
Department:	Computer Science
Class Year:	2017
Abstract:	Researchers have found several applications for machine learning systems to extract, analyze, and replicate musical features. However, most approaches hinge on the reduction of a musical piece to its compositional aspects, rather than the raw audio of a musical recording (or any arbitrary audio recording). We sought to explore what applications machine learning concepts might have in music and audio analysis beyond compositional aspects. To exemplify this, a system was designed and implemented to leverage machine learning concepts and both extract and replicate empirical data about audio recordings’ timbres. To accomplish this, the “timbre” of an audio recording was defined by its frequency spectrum, i.e. the relative amplitudes of certain frequencies in the recording. The system is able to extract timbral data from a large number of sources, reduce the data to a useful core, and modify arbitrary audio recordings to replicate the timbre. From a qualitative standpoint, modified recordings generated by the system were convincing and encouraging, but from an empirical standpoint, there is significant room for improvement. However, this project proves that machine learning concepts can have usefulness beyond strictly musical analysis and in the realm of general audio analysis and modification.
URI:	http://arks.princeton.edu/ark:/88435/dsp01rx913s519
Type of Material:	Princeton University Senior Theses
Language:	en_US
Appears in Collections:	Computer Science, 1987-2023

Files in This Item:

File	Size	Format
written_final_report.pdf	1.19 MB	Adobe PDF	Request a copy

Show full item record

Search

Browse