多路径融合与多重交叉自注意力的毫米波雷达水上人体行为识别

赵楠; 常俊; 颜凌岚; 刘夕龙; 李栋; 王燕

doi:10.7540/j.ynu.20250061

多路径融合与多重交叉自注意力的毫米波雷达水上人体行为识别

Multi-path fusion and multi-cross self-attention for aquatic human activity recognition using millimeter-wave radar

摘要

摘要: 针对现有毫米波雷达水上人体行为识别方法在复杂环境中表现不佳的问题，提出一种基于交叉自注意力机制的网络模型. 首先，设计多路径特征融合模块，通过并行提取不同尺度的行为特征，实现对细粒度和全局特征的互补表达；然后，设计多重自注意力与多重交叉注意力模块，通过域内与跨域特征的深层次交互，以识别高相似度行为；最后，采用Transformer模块捕捉空间特征的全局关联性，提升模型的特征表达能力. 实验结果表明，在AHAR公开数据集上，所提模型的识别结果在准确率、召回率和F-score指标上分别达到了0.926 1，0.928 0和0.927 5，比效果最优的对比模型分别提高了0.102 4，0.102 8和0.101 7，泛化性能较其他对比模型更优.

Abstract: To address the challenges faced by current millimeter-wave radar-based aquatic human activity recognition methods in complex environments, this paper proposes a network model based on the cross-self-attention mechanism. First, a multipath feature fusion module is designed to extract behavior features at different scales in parallel, achieving complementary representation of fine-grained and global features. Next, a multi self-attention and multi cross-attention module is introduced to enable deep interactions both within and across feature domains, effectively distinguishing behaviors with high similarity. Finally, a Transformer module is employed to capture the global correlation of spatial features, further improving the model's representational capability. Experimental results on the publicly available AHAR (Aquatic Human Activity Recognition) dataset demonstrate that the proposed model achieves classification performance with an accuracy, recall, and F-score of 0.926 1, 0.928 0, and 0.927 5, respectively, surpassing the best comparative model by 0.102 4, 0.102 8, and 0.101 7. Additionally, the proposed model exhibits superior generalization performance compared to other baseline models.

HTML全文

参考文献(34)

施引文献

资源附件(0)