Accurate, Low-latency, Efficient SAR Automatic Target Recognition on FPGA

Zhang, Bingyi; Kannan, Rajgopal; Prasanna, Viktor; Busart, Carl

doi:10.1109/FPL57034.2022.00013

Full-text links:

Download:

Current browse context:

cs.AR

< prev | next >

new | recent | 2301

Computer Science > Hardware Architecture

Title: Accurate, Low-latency, Efficient SAR Automatic Target Recognition on FPGA

Authors: Bingyi Zhang, Rajgopal Kannan, Viktor Prasanna, Carl Busart

(Submitted on 4 Jan 2023)

Abstract: Synthetic aperture radar (SAR) automatic target recognition (ATR) is the key technique for remote-sensing image recognition. The state-of-the-art convolutional neural networks (CNNs) for SAR ATR suffer from \emph{high computation cost} and \emph{large memory footprint}, making them unsuitable to be deployed on resource-limited platforms, such as small/micro satellites. In this paper, we propose a comprehensive GNN-based model-architecture {co-design} on FPGA to address the above issues. \emph{Model design}: we design a novel graph neural network (GNN) for SAR ATR. The proposed GNN model incorporates GraphSAGE layer operators and attention mechanism, achieving comparable accuracy as the state-of-the-art work with near $1/100$ computation cost. Then, we propose a pruning approach including weight pruning and input pruning. While weight pruning through lasso regression reduces most parameters without accuracy drop, input pruning eliminates most input pixels with negligible accuracy drop. \emph{Architecture design}: to fully unleash the computation parallelism within the proposed model, we develop a novel unified hardware architecture that can execute various computation kernels (feature aggregation, feature transformation, graph pooling). The proposed hardware design adopts the Scatter-Gather paradigm to efficiently handle the irregular computation {patterns} of various computation kernels. We deploy the proposed design on an embedded FPGA (AMD Xilinx ZCU104) and evaluate the performance using MSTAR dataset. Compared with the state-of-the-art CNNs, the proposed GNN achieves comparable accuracy with $1/3258$ computation cost and $1/83$ model size. Compared with the state-of-the-art CPU/GPU, our FPGA accelerator achieves $14.8\times$/$2.5\times$ speedup (latency) and is $62\times$/$39\times$ more energy efficient.

Subjects:	Hardware Architecture (cs.AR); Computer Vision and Pattern Recognition (cs.CV); Image and Video Processing (eess.IV)
DOI:	10.1109/FPL57034.2022.00013
Cite as:	arXiv:2301.01454 [cs.AR]
	(or arXiv:2301.01454v1 [cs.AR] for this version)

Submission history

From: Bingyi Zhang [view email]
[v1] Wed, 4 Jan 2023 05:35:30 GMT (2279kb,D)

Which authors of this paper are endorsers? | Disable MathJax (What is MathJax?)

Link back to: arXiv, form interface, contact.

> cs > arXiv:2301.01454

Download:

Current browse context:

Change to browse by:

References & Citations

DBLP - CS Bibliography

Bookmark

Computer Science > Hardware Architecture

Title: Accurate, Low-latency, Efficient SAR Automatic Target Recognition on FPGA

Submission history