Please use this identifier to cite or link to this item:
http://arks.princeton.edu/ark:/88435/dsp01p5547v675
Title: | RGB-Thermal Fusion for Improved Object Detection |
Authors: | McManamon, Brendan |
Advisors: | Heide, Felix |
Department: | Electrical and Computer Engineering |
Class Year: | 2023 |
Abstract: | Object detection based on RGB images alone often suffers due to a lack of illumination or other environmental conditions, while thermal infrared cameras generally succeed in those exact challenging scenarios. In the realm of RGB-thermal sensor fusion, previous research has been conducted using a variety of network architectures and fusion techniques. Using a YOLOv7 architecture, this work uses pixel-level early fusion and ensemble late fusion to compare against single-sensor models, trained and evaluated on the Teledyne FLIR dataset. Results indicate that pixel-level fusion significantly outperforms RGB and thermal models while maintaining latency in the 8 ms range on a Tesla V100 with an overall mAP of 92.8%, while the ensemble approach performed below baseline and more than doubled latency. |
URI: | http://arks.princeton.edu/ark:/88435/dsp01p5547v675 |
Type of Material: | Princeton University Senior Theses |
Language: | en |
Appears in Collections: | Electrical and Computer Engineering, 1932-2023 |
Files in This Item:
File | Description | Size | Format | |
---|---|---|---|---|
MCMANAMON-BRENDAN-THESIS.pdf | 5.6 MB | Adobe PDF | Request a copy |
Items in Dataspace are protected by copyright, with all rights reserved, unless otherwise indicated.