Acta Armamentarii ›› 2023, Vol. 44 ›› Issue (7): 2197-2206.doi: 10.12382/bgxb.2022.0367
HUA Yingjie, LIU Jing, SHAO Yubin*(), DUO Lin
Received:
2022-05-11
Online:
2023-07-30
Contact:
SHAO Yubin
HUA Yingjie, LIU Jing, SHAO Yubin, DUO Lin. Language Identification in Battlefield Environments[J]. Acta Armamentarii, 2023, 44(7): 2197-2206.
Add to citation manager EndNote|Ris|BibTeX
α | β | ||
---|---|---|---|
0.30 | 0.35 | 0.40 | |
0.50 | 73.74 | 74.68 | 76.18 |
0.45 | 72.72 | 79.62 | 78.70 |
0.40 | 72.92 | 73.12 | 73.12 |
Table 1 Average value of recognition rate with different values of α and β%
α | β | ||
---|---|---|---|
0.30 | 0.35 | 0.40 | |
0.50 | 73.74 | 74.68 | 76.18 |
0.45 | 72.72 | 79.62 | 78.70 |
0.40 | 72.92 | 73.12 | 73.12 |
噪声源 | 识别方法 | SNR/dB | ||||
---|---|---|---|---|---|---|
-10 | -5 | 0 | 5 | 10 | ||
Fbank | 20.0 | 26.3 | 32.5 | 63.6 | 71.3 | |
LGSS | 20.0 | 22.8 | 55.5 | 68.3 | 73.7 | |
DBF | 20.7 | 22.5 | 68.4 | 79.7 | 83.7 | |
WN | FRSCIRF | 37.7 | 64.0 | 76.7 | 81.7 | 85.7 |
TGSS | 22.5 | 40.7 | 64.5 | 80.0 | 83.5 | |
FTGSS | 26.6 | 42.3 | 69.7 | 80.1 | 85.1 | |
FTGSSE | 61.2 | 79.8 | 83.2 | 86.0 | 87.9 | |
LGSS | 20.0 | 20.1 | 24.7 | 44.9 | 65.7 | |
DORBN | TGSS | 20.4 | 27.8 | 47.7 | 73.2 | 80.4 |
FTGSS | 20.7 | 44.2 | 72.9 | 79.8 | 82.6 | |
FTGSSE | 42.6 | 76.0 | 81.6 | 84.4 | 86.9 | |
LGSS | 19.5 | 23.5 | 36.7 | 62.3 | 70.7 | |
MVN | TGSS | 22.7 | 30.1 | 41.3 | 67.5 | 74.1 |
FTGSS | 27.0 | 38.0 | 45.1 | 70.9 | 80.5 | |
FTGSSE | 37.8 | 51.2 | 77.7 | 84.9 | 87.2 | |
LGSS | 18.0 | 27.9 | 50.2 | 64.9 | 73.9 | |
HFCN | TGSS | 20.1 | 34.9 | 59.2 | 72.7 | 78.3 |
FTGSS | 21.9 | 40.2 | 65.1 | 79.4 | 84.5 | |
FTGSSE | 62.2 | 79.0 | 81.4 | 85.1 | 87.0 | |
LGSS | 19.8 | 31.3 | 41.5 | 68.8 | 73.2 | |
PN | TGSS | 21.6 | 35.2 | 44.7 | 70.9 | 78.4 |
FTGSS | 22.8 | 39.6 | 47.6 | 72.1 | 80.3 | |
FTGSSE | 37.3 | 47.0 | 77.0 | 85.4 | 86.9 | |
LGSS | 53.4 | 62.1 | 68.6 | 69.5 | 70.9 | |
VN | TGSS | 58.2 | 65.5 | 71.9 | 74.3 | 75.8 |
FTGSS | 62.0 | 69.2 | 75.4 | 78.7 | 79.7 | |
FTGSSE | 48.3 | 67.8 | 83.5 | 86.9 | 88.6 | |
LGSS | 19.7 | 22.1 | 21.4 | 36.9 | 66.6 | |
F16CN | TGSS | 22.6 | 28.1 | 34.6 | 54.7 | 70.3 |
FTGSS | 23.4 | 31.9 | 45.6 | 65.6 | 78.2 | |
FTGSSE | 35.3 | 43.8 | 65.1 | 83.7 | 86.4 | |
LGSS | 14.1 | 32.1 | 41.9 | 64.2 | 79.7 | |
BFCN | TGSS | 21.5 | 36.4 | 43.6 | 68.1 | 80.1 |
FTGSS | 24.9 | 40.7 | 47.3 | 78.9 | 80.3 | |
FTGSSE | 60.1 | 78.1 | 80.4 | 85.7 | 86.7 | |
LGSS | 52.5 | 62.0 | 65.9 | 69.2 | 71.2 | |
MGN | TGSS | 53.6 | 63.4 | 66.9 | 69.7 | 72.3 |
FTGSS | 54.2 | 63.7 | 66.3 | 71.3 | 75.9 | |
FTGSSE | 49.3 | 59.5 | 67.6 | 75.3 | 83.5 |
Table 2 Language recognition accuracy under different noise sources and different signal-to-noise ratios%
噪声源 | 识别方法 | SNR/dB | ||||
---|---|---|---|---|---|---|
-10 | -5 | 0 | 5 | 10 | ||
Fbank | 20.0 | 26.3 | 32.5 | 63.6 | 71.3 | |
LGSS | 20.0 | 22.8 | 55.5 | 68.3 | 73.7 | |
DBF | 20.7 | 22.5 | 68.4 | 79.7 | 83.7 | |
WN | FRSCIRF | 37.7 | 64.0 | 76.7 | 81.7 | 85.7 |
TGSS | 22.5 | 40.7 | 64.5 | 80.0 | 83.5 | |
FTGSS | 26.6 | 42.3 | 69.7 | 80.1 | 85.1 | |
FTGSSE | 61.2 | 79.8 | 83.2 | 86.0 | 87.9 | |
LGSS | 20.0 | 20.1 | 24.7 | 44.9 | 65.7 | |
DORBN | TGSS | 20.4 | 27.8 | 47.7 | 73.2 | 80.4 |
FTGSS | 20.7 | 44.2 | 72.9 | 79.8 | 82.6 | |
FTGSSE | 42.6 | 76.0 | 81.6 | 84.4 | 86.9 | |
LGSS | 19.5 | 23.5 | 36.7 | 62.3 | 70.7 | |
MVN | TGSS | 22.7 | 30.1 | 41.3 | 67.5 | 74.1 |
FTGSS | 27.0 | 38.0 | 45.1 | 70.9 | 80.5 | |
FTGSSE | 37.8 | 51.2 | 77.7 | 84.9 | 87.2 | |
LGSS | 18.0 | 27.9 | 50.2 | 64.9 | 73.9 | |
HFCN | TGSS | 20.1 | 34.9 | 59.2 | 72.7 | 78.3 |
FTGSS | 21.9 | 40.2 | 65.1 | 79.4 | 84.5 | |
FTGSSE | 62.2 | 79.0 | 81.4 | 85.1 | 87.0 | |
LGSS | 19.8 | 31.3 | 41.5 | 68.8 | 73.2 | |
PN | TGSS | 21.6 | 35.2 | 44.7 | 70.9 | 78.4 |
FTGSS | 22.8 | 39.6 | 47.6 | 72.1 | 80.3 | |
FTGSSE | 37.3 | 47.0 | 77.0 | 85.4 | 86.9 | |
LGSS | 53.4 | 62.1 | 68.6 | 69.5 | 70.9 | |
VN | TGSS | 58.2 | 65.5 | 71.9 | 74.3 | 75.8 |
FTGSS | 62.0 | 69.2 | 75.4 | 78.7 | 79.7 | |
FTGSSE | 48.3 | 67.8 | 83.5 | 86.9 | 88.6 | |
LGSS | 19.7 | 22.1 | 21.4 | 36.9 | 66.6 | |
F16CN | TGSS | 22.6 | 28.1 | 34.6 | 54.7 | 70.3 |
FTGSS | 23.4 | 31.9 | 45.6 | 65.6 | 78.2 | |
FTGSSE | 35.3 | 43.8 | 65.1 | 83.7 | 86.4 | |
LGSS | 14.1 | 32.1 | 41.9 | 64.2 | 79.7 | |
BFCN | TGSS | 21.5 | 36.4 | 43.6 | 68.1 | 80.1 |
FTGSS | 24.9 | 40.7 | 47.3 | 78.9 | 80.3 | |
FTGSSE | 60.1 | 78.1 | 80.4 | 85.7 | 86.7 | |
LGSS | 52.5 | 62.0 | 65.9 | 69.2 | 71.2 | |
MGN | TGSS | 53.6 | 63.4 | 66.9 | 69.7 | 72.3 |
FTGSS | 54.2 | 63.7 | 66.3 | 71.3 | 75.9 | |
FTGSSE | 49.3 | 59.5 | 67.6 | 75.3 | 83.5 |
噪声源 | 识别方法 | SNR/dB | ||||
---|---|---|---|---|---|---|
-10 | -5 | 0 | 5 | 10 | ||
WN | LGSS | 0.14 | 0.18 | 0.52 | 0.58 | 0.63 |
FTGSSE | 0.55 | 0.79 | 0.83 | 0.86 | 0.88 | |
DORBN | LGSS | 0.13 | 0.16 | 0.21 | 0.39 | 0.61 |
FTGSSE | 0.39 | 0.76 | 0.81 | 0.84 | 0.86 | |
MVN | LGSS | 0.15 | 0.21 | 0.32 | 0.58 | 0.65 |
FTGSSE | 0.37 | 0.49 | 0.77 | 0.85 | 0.87 | |
HFCN | LGSS | 0.13 | 0.24 | 0.47 | 0.62 | 0.71 |
FTGSSE | 0.59 | 0.78 | 0.80 | 0.85 | 0.87 | |
PN | LGSS | 0.17 | 0.28 | 0.36 | 0.64 | 0.67 |
FTGSSE | 0.36 | 0.44 | 0.77 | 0.85 | 0.87 | |
VN | LGSS | 0.44 | 0.58 | 0.64 | 0.64 | 0.67 |
FTGSSE | 0.47 | 0.67 | 0.83 | 0.87 | 0.88 | |
F16CN | LGSS | 0.16 | 0.20 | 0.23 | 0.35 | 0.62 |
FTGSSE | 0.33 | 0.45 | 0.67 | 0.84 | 0.86 | |
BFCN | LGSS | 0.11 | 0.31 | 0.39 | 0.61 | 0.78 |
FTGSSE | 0.54 | 0.80 | 0.78 | 0.85 | 0.86 | |
MGN | LGSS | 0.50 | 0.60 | 0.63 | 0.65 | 0.67 |
FTGSSE | 0.45 | 0.59 | 0.68 | 0.76 | 0.84 |
Table 3 Language recognition F1 scores under different noise sources and different signal-to-noise ratios
噪声源 | 识别方法 | SNR/dB | ||||
---|---|---|---|---|---|---|
-10 | -5 | 0 | 5 | 10 | ||
WN | LGSS | 0.14 | 0.18 | 0.52 | 0.58 | 0.63 |
FTGSSE | 0.55 | 0.79 | 0.83 | 0.86 | 0.88 | |
DORBN | LGSS | 0.13 | 0.16 | 0.21 | 0.39 | 0.61 |
FTGSSE | 0.39 | 0.76 | 0.81 | 0.84 | 0.86 | |
MVN | LGSS | 0.15 | 0.21 | 0.32 | 0.58 | 0.65 |
FTGSSE | 0.37 | 0.49 | 0.77 | 0.85 | 0.87 | |
HFCN | LGSS | 0.13 | 0.24 | 0.47 | 0.62 | 0.71 |
FTGSSE | 0.59 | 0.78 | 0.80 | 0.85 | 0.87 | |
PN | LGSS | 0.17 | 0.28 | 0.36 | 0.64 | 0.67 |
FTGSSE | 0.36 | 0.44 | 0.77 | 0.85 | 0.87 | |
VN | LGSS | 0.44 | 0.58 | 0.64 | 0.64 | 0.67 |
FTGSSE | 0.47 | 0.67 | 0.83 | 0.87 | 0.88 | |
F16CN | LGSS | 0.16 | 0.20 | 0.23 | 0.35 | 0.62 |
FTGSSE | 0.33 | 0.45 | 0.67 | 0.84 | 0.86 | |
BFCN | LGSS | 0.11 | 0.31 | 0.39 | 0.61 | 0.78 |
FTGSSE | 0.54 | 0.80 | 0.78 | 0.85 | 0.86 | |
MGN | LGSS | 0.50 | 0.60 | 0.63 | 0.65 | 0.67 |
FTGSSE | 0.45 | 0.59 | 0.68 | 0.76 | 0.84 |
[1] |
doi: 10.1109/JPROC.2012.2237151 URL |
[2] |
doi: 10.1109/TASSP.1980.1163420 URL |
[3] |
|
[4] |
doi: 10.1121/1.399423 URL |
[5] |
张卫强, 刘加. 基于听感知特征的语种识别[J]. 清华大学学报(自然科学版), 2009, 49(1):78-81.
|
|
|
[6] |
doi: 10.1109/LSP.2006.870086 URL |
[7] |
doi: 10.1006/dspr.1999.0361 URL |
[8] |
doi: 10.1109/TSA.1996.481450 URL |
[9] |
|
[10] |
|
[11] |
|
[12] |
|
[13] |
|
[14] |
|
[15] |
|
[16] |
|
[17] |
|
[18] |
|
[19] |
doi: 10.1007/s11277-019-06373-3 |
[20] |
doi: 10.1016/j.neunet.2021.03.026 URL |
[21] |
doi: 10.1007/s10579-020-09527-z |
[22] |
刘威. 单通道语音水印与语音增强算法研究[D]. 南京: 东南大学, 2017.
|
|
|
[23] |
doi: 10.1049/ell2.v56.25 URL |
[24] |
|
[25] |
马元锋, 陈克安, 马苗, 等. 一种新的可应用于声目标识别的倒谱系数[J]. 兵工学报, 2009, 30(11):1477-1483.
|
|
|
[26] |
冯红波, 李萍, 李波. 基于自动色阶和多尺度Retinex彩色图像增强算法[J]. 无线电工程, 2019, 49(10):911-914.
|
|
|
[27] |
|
[28] |
邵玉斌, 刘晶, 龙华, 等. 基于声道频谱参数的语种识别[J]. 北京邮电大学学报, 2021, 44(3):112-119.
doi: 10.13190/j.jbupt.2020-228 |
SHAOYB,
|
No related articles found! |
Viewed | ||||||
Full text |
|
|||||
Abstract |
|
|||||