An FPGA Smart Camera Implementation of Segmentation Models for Drone Wildfire Imagery

Garduño, Eduardo; Ciprián-Sánchez, Jorge; Vázquez-García, Valente; González-Mendoza, Miguel; Rodríguez-Hernández, Gerardo; Palacios-Rosas, Adriana; Rossi-Tisson, Lucile; Ochoa-Ruiz, Gilberto

doi:10.13053/cys-27-4-4773

Servicios Personalizados

Revista

Articulo

Indicadores

Citado por SciELO
Accesos

Links relacionados

Similares en SciELO

Permalink

Computación y Sistemas

versión On-line ISSN 2007-9737versión impresa ISSN 1405-5546

Resumen

GARDUNO, Eduardo et al. An FPGA Smart Camera Implementation of Segmentation Models for Drone Wildfire Imagery. Comp. y Sist. [online]. 2023, vol.27, n.4, pp.965-977. Epub 17-Mayo-2024. ISSN 2007-9737. https://doi.org/10.13053/cys-27-4-4773.

Wildfires represent one of the most relevant natural disasters worldwide, due to their impact on various societal and environmental levels. Thus, a significant amount of research has been carried out to investigate and apply computer vision techniques to address this problem. One of the most promising approaches for wildfire fighting is the use of drones equipped with visible and infrared cameras for the detection, monitoring, and fire spread assessment in a remote manner but in close proximity to the affected areas. However, implementing effective computer vision algorithms on board is often prohibitive since deploying full-precision deep learning models running on GPU is not a viable option, due to their high power consumption and the limited payload a drone can handle. Thus, in this work, we posit that smart cameras, based on low-power consumption field-programmable gate arrays (FPGAs), in tandem with binarized neural networks (BNNs), represent a cost-effective alternative for implementing onboard computing on the edge. Herein we present the implementation of a segmentation model applied to the Corsican Fire Database. We optimized an existing U-Net model for such a task and ported the model to an edge device (a Xilinx Ultra96-v2 FPGA). By pruning and quantizing the original model, we reduce the number of parameters by 90%. Furthermore, additional optimizations enabled us to increase the throughput of the original model from 8 frames per second (FPS) to 33.63 FPS without loss in the segmentation performance: our model obtained 0.912 in Matthews correlation coefficient (MCC), 0.915 in F1 score and 0.870 in Hafiane quality index (HAF), and comparable qualitative segmentation results when contrasted to the original full-precision model. The final model was integrated into a low-cost FPGA, which was used to implement a neural network accelerator.

Palabras llave : SoC FPGA; computer vision; segmentation; binarized neural networks; artificial intelligence; infrared imaging; pruning.

· texto en Inglés