An FPGA Smart Camera Implementation of Segmentation Models for Drone Wildfire Imagery

Garduño, Eduardo; Ciprián-Sánchez, Jorge; Vázquez-García, Valente; González-Mendoza, Miguel; Rodríguez-Hernández, Gerardo; Palacios-Rosas, Adriana; Rossi-Tisson, Lucile; Ochoa-Ruiz, Gilberto

doi:10.13053/cys-27-4-4773

Services on Demand

Journal

Article

Indicators

Cited by SciELO
Access statistics

Computación y Sistemas

On-line version ISSN 2007-9737Print version ISSN 1405-5546

Abstract

GARDUNO, Eduardo et al. An FPGA Smart Camera Implementation of Segmentation Models for Drone Wildfire Imagery. Comp. y Sist. [online]. 2023, vol.27, n.4, pp.965-977. Epub May 17, 2024. ISSN 2007-9737. https://doi.org/10.13053/cys-27-4-4773.

Wildfires represent one of the most relevant natural disasters worldwide, due to their impact on various societal and environmental levels. Thus, a significant amount of research has been carried out to investigate and apply computer vision techniques to address this problem. One of the most promising approaches for wildfire fighting is the use of drones equipped with visible and infrared cameras for the detection, monitoring, and fire spread assessment in a remote manner but in close proximity to the affected areas. However, implementing effective computer vision algorithms on board is often prohibitive since deploying full-precision deep learning models running on GPU is not a viable option, due to their high power consumption and the limited payload a drone can handle. Thus, in this work, we posit that smart cameras, based on low-power consumption field-programmable gate arrays (FPGAs), in tandem with binarized neural networks (BNNs), represent a cost-effective alternative for implementing onboard computing on the edge. Herein we present the implementation of a segmentation model applied to the Corsican Fire Database. We optimized an existing U-Net model for such a task and ported the model to an edge device (a Xilinx Ultra96-v2 FPGA). By pruning and quantizing the original model, we reduce the number of parameters by 90%. Furthermore, additional optimizations enabled us to increase the throughput of the original model from 8 frames per second (FPS) to 33.63 FPS without loss in the segmentation performance: our model obtained 0.912 in Matthews correlation coefficient (MCC), 0.915 in F1 score and 0.870 in Hafiane quality index (HAF), and comparable qualitative segmentation results when contrasted to the original full-precision model. The final model was integrated into a low-cost FPGA, which was used to implement a neural network accelerator.

Keywords : SoC FPGA; computer vision; segmentation; binarized neural networks; artificial intelligence; infrared imaging; pruning.

· text in English · English (

pdf )