欢迎访问《兵工学报》官方网站,今天是 分享到:

兵工学报 ›› 2022, Vol. 43 ›› Issue (11): 2827-2835.doi: 10.12382/bgxb.2021.0549

• 论文 • 上一篇    下一篇

面向战场环境下的语音传输与重构

邵玉斌, 刘晶, 龙华, 李一民   

  1. (昆明理工大学 信息工程与自动化学院, 云南 昆明 650500)
  • 上线日期:2022-06-30
  • 作者简介:邵玉斌(1970—),男,教授,硕士生导师。E-mail: shaoyubin@kust.edu.cn
  • 基金资助:
    国家自然科学基金项目(61761025)

Voice Transmission and Reconstruction on the Battlefield

SHAO Yubin, LIU Jing, LONG Hua, LI Yimin   

  1. (Faculty of Information Engineering and Automation, Kunming University of Science and Technology, Kunming 650500, Yunnan, China)
  • Online:2022-06-30

摘要: 针对语音在高压缩比及低信噪比下传输与重构质量不佳的问题,提出一种基于语谱图的语音压缩传输重构方法。在发送端将语音信号转为语谱图进行传输,再在接收端对语谱图作图像去噪处理,根据去噪后的图像恢复出语音信号的幅度谱;建立发声重构模型,用幅度谱对语音信号进行重构,实现语音恢复。实验结果表明:无噪声环境下,压缩比为10和40的条件下,重构语音质量客观平均得分达到3分以上;低信噪比条件下,压缩比为10时,重构语音质量客观平均得分也能达到2分以上。相比于传统的压缩感知语音重构算法,在高压缩比下,新方法对重构语音质量有明显改善。

关键词: 语音传输与重构, 图像增强, 发声重构模型, 压缩比及低信噪比

Abstract: A spectrogram-based reconstruction method is proposed to address the problem of poor voice transmission and reconstruction quality under conditions of high compression ratios and low signal-to-noise ratios. Speech signals are converted into spectrograms at the transmitter, which are later transmitted and denoised at the receiver. Then, the amplitude spectrum is restored from the denoised spectrogram image and the voice is reconstructed through the amplitude spectrum by the voice model. Experiments show that the perceptual evaluation of speech quality (PESQ) of the reconstructed speech exceeds 3 under noise-free environment with compression ratios of 10 and 40 respectively. The PESQ can also exceed 2 under the low signal-to-noise ratio with compression ratio of 10. The proposed method shows significant improvement in reconstructed speech quality at high compression ratios compared with the traditional algorithm.

Key words: speechtransmissionandreconstruction, imageenhancement, vocalreconstructionmodel, compressionratioandlowsignal-to-noiseratio

中图分类号: