欢迎访问《兵工学报》官方网站,今天是 分享到:

兵工学报 ›› 2023, Vol. 44 ›› Issue (7): 2197-2206.doi: 10.12382/bgxb.2022.0367

• • 上一篇    

面向战场环境下的语种识别

华英杰, 刘晶, 邵玉斌*(), 朵琳   

  1. 昆明理工大学 信息工程与自动化学院, 云南 昆明 650500
  • 收稿日期:2022-05-11 上线日期:2023-07-30
  • 通讯作者:
  • 基金资助:
    国家自然科学基金项目(61962032); 云南省科技厅优秀青年项目(202001AW07000)

Language Identification in Battlefield Environments

HUA Yingjie, LIU Jing, SHAO Yubin*(), DUO Lin   

  1. Faculty of Information Engineering and Automation, Kunming University of Science and Technology, Kunming 650500, Yunnan, China
  • Received:2022-05-11 Online:2023-07-30

摘要:

为实现语种识别在战场环境下保持较高的识别性能,提出一种基于语谱图灰度变换的语种识别方法。根据语音信息和战场环境下的噪声信息在语谱图上的分布特性,引入带通滤波;根据人耳听觉特性提取对数灰度语谱图;采用自动色阶算法抑制语谱图上的噪声信息,增强语种信息,并采用残差神经网络模型进行训练识别。实验结果表明:在-10dB掠夺者战斗机驾驶舱噪声环境下,相对于线性灰度语谱图特征,识别正确率提升了46%;在其他噪声环境下,识别性能也大幅度提升。

关键词: 语种识别, 对数灰度语谱图, 自动色阶算法, 残差神经网络

Abstract:

To achieve accurate language identification in battlefield environments, a language identification method based on spectrogram gray transformation is proposed. Bandpass filtering is introduced based on the distribution characteristics of speech information and noise information in the spectrogram under battlefield noise conditions. Logarithmic gray spectrogram is extracted in line with human auditory characteristics. An automatic color adjustment algorithm is used to suppress noise information and enhance language information on the spectrogram, and a residual neural network model is used for training and identification. Experimental results show that compared with linear gray spectrogram features, the recognition accuracy is improved by 46% in the -10dB Predator fighter cockpit noise environment. In other noise environments, the recognition performance is also greatly improved.

Key words: language identification, logarithmic grayscale spectrogram, automatic tone scale algorithm, residual neural network