Skip navigation
Please use this identifier to cite or link to this item: http://arks.princeton.edu/ark:/88435/dsp01p5547v675
Full metadata record
DC FieldValueLanguage
dc.contributor.advisorHeide, Felix-
dc.contributor.authorMcManamon, Brendan-
dc.date.accessioned2023-08-08T12:22:00Z-
dc.date.available2023-08-08T12:22:00Z-
dc.date.created2023-04-12-
dc.date.issued2023-08-08-
dc.identifier.urihttp://arks.princeton.edu/ark:/88435/dsp01p5547v675-
dc.description.abstractObject detection based on RGB images alone often suffers due to a lack of illumination or other environmental conditions, while thermal infrared cameras generally succeed in those exact challenging scenarios. In the realm of RGB-thermal sensor fusion, previous research has been conducted using a variety of network architectures and fusion techniques. Using a YOLOv7 architecture, this work uses pixel-level early fusion and ensemble late fusion to compare against single-sensor models, trained and evaluated on the Teledyne FLIR dataset. Results indicate that pixel-level fusion significantly outperforms RGB and thermal models while maintaining latency in the 8 ms range on a Tesla V100 with an overall mAP of 92.8%, while the ensemble approach performed below baseline and more than doubled latency.en_US
dc.format.mimetypeapplication/pdf
dc.language.isoenen_US
dc.titleRGB-Thermal Fusion for Improved Object Detectionen_US
dc.typePrinceton University Senior Theses
pu.date.classyear2023en_US
pu.departmentElectrical and Computer Engineeringen_US
pu.pdf.coverpageSeniorThesisCoverPage
pu.contributor.authorid920228126
pu.mudd.walkinNoen_US
Appears in Collections:Electrical and Computer Engineering, 1932-2023

Files in This Item:
File Description SizeFormat 
MCMANAMON-BRENDAN-THESIS.pdf5.6 MBAdobe PDF    Request a copy


Items in Dataspace are protected by copyright, with all rights reserved, unless otherwise indicated.