
浏览全部资源
扫码关注微信
1. 北京理工大学 光电学院, 北京 100081
2. 中国空间技术研究院 通信与导航卫星总体部, 北京 100094
Received:29 November 2022,
Published Online:25 September 2023,
Published:20 September 2023
移动端阅览
Peiran PENG, Shubo REN, Jianan LI, et al. Illumination-aware Multispectral Fusion Network for Pedestrian Detection[J]. Acta Armamentarii, 2023, 44(9): 2622-2630.
Peiran PENG, Shubo REN, Jianan LI, et al. Illumination-aware Multispectral Fusion Network for Pedestrian Detection[J]. Acta Armamentarii, 2023, 44(9): 2622-2630. DOI: 10.12382/bgxb.2022.1114.
多光谱行人检测在智能安防、自动驾驶等领域得到广泛应用。在光照较弱或存在遮挡的情况下
行人检测的准确性和鲁棒性仍然面临挑战。为解决这个问题
提出一种新的光照感知跨光谱融合行人检测网络。该网络利用交叉注意力和光照感知机制来充分利用多光谱特异性特征
以提高行人检测的鲁棒性和准确性。为增强两光谱之间特征表达
引入交叉注意力模块。此外提出一个光照感知子网络
它能够根据可见光和红外光谱的光照强度变化自适应地选择有效的光谱特征信息
从而提高检测系统的鲁棒性。在两个主流的多光谱行人数据集上进行了实验。实验结果显示
新方法在检测精度和检测速度方面都优于现有方法
所得成果对于提高行人检测模型的鲁棒性和泛用性具有重要意义
并在实际应用中具有广泛的潜力。
Multispectral pedestrian detection has been widely applied in scenarios such as intelligent security and autonomous driving. However
the accuracy and robustness of pedestrian detection still face challenges
especiallyin low-light conditions or in scenarios with occlusions. To address this issue
a novel pedestrian detection network is proposed
which is namedillumination-aware cross-spectral fusion network. Thenetwork leverages cross-attention and illumination-aware mechanisms to fully exploitmulti-spectral specific features
thereby improving the robustness and accuracy of pedestrian detection. To enhance feature representation between the two spectra
a cross-attention module is introduced. Additionally
an illumination-aware sub-network is proposed
which adaptively selects effective spectral feature information based on the illumination intensity variations of visible and infrared spectra
thusimproving the robustness of the detection system. Experiments areconducted on two multi-spectral pedestrian detection datasets
the KAIST dataset and the CVC-14 dataset. The experimental results demonstratethat theproposed method outperforms existing methods in terms of detection accuracy and speed. This achievementis of significant importance for enhancing the robustness and versatility of pedestrian detection models
with broad potential for practical applications.
李博 , 王博 , 韩京冶 , 等 . 基于车载计算机的红外图像移动目标检测 [J ] . 兵工学报 , 2022 , 43 ( 增刊1 ): 66 - 73 .
LI B , WANG B , HAN J Y , et al . Infrared image moving object detection technology based on onboard computer [J ] . Acta Armamentarii , 2022 , 43 ( S1 ): 66 - 73 . (in Chinese) DOI: 10.12382/bgxb.2022.A012 http://doi.org/10.12382/bgxb.2022.A012 As an important part of driverless vehicle platform,the environmental perception system is the basis and premise to realize as the path planning and decision control function. Based on the onboard computer,an infrared image moving object detection system in complex scenes is designed.The browser/server architecture is used in the detection system. After the client browser requests for services,the server applies the improved YOLOv2 algorithm to detect the infrared images transmitted by the onboard RapidIO high-speed bus in real time.the missed detection and error detection rates are effectively reduced by dimensional re-clustering and improved activation function.In order to make up for disadvantages of low contrast and weak texture features of infrared images,bilateral filtering image enhancement algorithm based on histogram equalization is proposed,which can effectively maintain the image contour details.The results show that the real-time detection accuracy of the improved YOLOv2 is increased by 4.1% compared with the original algrithm and the image preprocessing significantly improves the detection accuracy.
BILAL M , KHAN A , KHAN M U K , et al . A low-complexity pedestrian detection framework for smart video surveillance systems [J ] . IEEE Transactions on Circuits and Systems for Video Technology , 2016 , 27 ( 10 ): 2260 - 2273 . DOI: 10.1109/TCSVT.2016.2581660 http://doi.org/10.1109/TCSVT.2016.2581660 http://ieeexplore.ieee.org/document/7492605/ http://ieeexplore.ieee.org/document/7492605/
PAIS G D , DIAS T J , NASCIMENTO J C , et al . OmniDRL: robust pedestrian detection using deep reinforcement learning on omnidirectional cameras [C ] //Proceedings of 2019 International Conference on Robotics and Automation.Washington,D.C.,US:IEEE , 2019 : 4782 - 4789 .
PAREKH D , PODDAR N , RAJPURKAR A , et al . A review on autonomous vehicles: progress, methods and challenges [J ] . Electronics , 2022 , 11 ( 14 ): 2162 . DOI: 10.3390/electronics11142162 http://doi.org/10.3390/electronics11142162 https://www.mdpi.com/2079-9292/11/14/2162 https://www.mdpi.com/2079-9292/11/14/2162 Vehicular technology has recently gained increasing popularity, and autonomous driving is a hot topic. To achieve safe and reliable intelligent transportation systems, accurate positioning technologies need to be built to factor in the different types of uncertainties such as pedestrian behavior, random objects, and types of roads and their settings. In this work, we look into the other domains and technologies required to build an autonomous vehicle and conduct a relevant literature analysis. In this work, we look into the current state of research and development in environment detection, pedestrian detection, path planning, motion control, and vehicle cybersecurity for autonomous vehicles. We aim to study the different proposed technologies and compare their approaches. For a car to become fully autonomous, these technologies need to be accurate enough to gain public trust and show immense accuracy in their approach to solving these problems. Public trust and perception of auto vehicles are also explored in this paper. By discussing the opportunities as well as the obstacles of autonomous driving technology, we aim to shed light on future possibilities.
QIU R , XU M , YAN Y Y , et al . A methodology review on multi-view pedestrian detection [J ] . Recent Advancements in Multi-View Data Analytics , 2022 : 317 - 339 .
HWANG S , PARK J , KIM N , et al . Multispectral pedestrian detection: benchmark dataset and baseline [C ] //Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition. Washington,D.C.,US:IEEE , 2015 : 1037 - 1045 .
KONIG D , ADAM M , JARVERS C , et al . Fully convolutional region proposal networks for multispectral person detection [C ] //Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition Workshops.Washington,D.C.,US:IEEE , 2017 : 49 - 56 .
ZHANG H , FROMONT E , LEFEVRE S , et al . Multispectral fusion for object detection with cyclic fuse-and-refine blocks [C ] //Proceedings of 2020 IEEE International Conference on Image Processing. Washington,D.C.,US:IEEE , 2020 : 276 - 280 .
LI C Y , SONG D , TONG R F , et al . Illumination-aware faster R-CNN for robust multispectral pedestrian detection [J ] . Pattern Recognition , 2019 , 85 : 161 - 171 . DOI: 10.1016/j.patcog.2018.08.005 http://doi.org/10.1016/j.patcog.2018.08.005 https://linkinghub.elsevier.com/retrieve/pii/S0031320318303030 https://linkinghub.elsevier.com/retrieve/pii/S0031320318303030
ZHANG L , ZHU X Y , CHEN X Y , et al . Weakly aligned cross-modal learning for multispectral pedestrian detection [C ] //Proceedings of the IEEE/CVF International Conference on Computer Vision.Seoul, Korea:IEEE , 2019 : 5127 - 5137 .
KIEU M , BAGDANOV A D , BERTINI M , et al . Task-conditioned domain adaptation for pedestrian detection in thermal imagery [C ] //Proceedings of European Conference on Computer Vision. Cham, Germany:Springer , 2020 : 546 - 562 .
LIU J J , ZHANG S T , WANG S , et al . Multispectral deep neural networks for pedestrian detection:arXiv:1611.02644 [R ] . Ithaca,NY , US : Cornell University , 2016 :1611.02644.
WAGNER J , FISCHER V , HERMAN M , et al. Multispectral pedestrian detection using deep fusion convolutional neural networks [C ] //Proceedings of the 24th European Symposium on Artificaial Neural Networks, Computational Intelligence and Machine Learning.Bruges, Belgium:ESANN , 2016 , 587 : 509 - 514 .
ZHOU K L , CHEN L S , CAO X . Improving multispectral pedestrian detection by addressing modality imbalance problems [C ] //Proceedings of European Conference on Computer Vision.Cham , Germany : Springer , 2020 : 787 - 803 .
DOLLAR P , APPEL R , BELONGIE S , et al . Fast feature pyramids for object detection [J ] . IEEE Transactions on Pattern Analysis and Machine Intelligence , 2014 , 36 ( 8 ): 1532 - 1545 . DOI: 10.1109/TPAMI.2014.2300479 http://doi.org/10.1109/TPAMI.2014.2300479 Multi-resolution image features may be approximated via extrapolation from nearby scales, rather than being computed explicitly. This fundamental insight allows us to design object detection algorithms that are as accurate, and considerably faster, than the state-of-the-art. The computational bottleneck of many modern detectors is the computation of features at every scale of a finely-sampled image pyramid. Our key insight is that one may compute finely sampled feature pyramids at a fraction of the cost, without sacrificing performance: for a broad family of features we find that features computed at octave-spaced scale intervals are sufficient to approximate features on a finely-sampled pyramid. Extrapolation is inexpensive as compared to direct feature computation. As a result, our approximation yields considerable speedups with negligible loss in detection accuracy. We modify three diverse visual recognition systems to use fast feature pyramids and show results on both pedestrian detection (measured on the Caltech, INRIA, TUD-Brussels and ETH data sets) and general object detection (measured on the PASCAL VOC). The approach is general and is widely applicable to vision algorithms requiring fine-grained multi-scale analysis. Our approximation is valid for images with broad spectra (most natural images) and fails for images with narrow band-pass spectra (e.g., periodic textures).
ZHANG L , LIU Z Y , ZHANG S F , et al . Cross-modality interactive attention network for multispectral pedestrian detection [J ] . Information Fusion , 2019 , 50 : 20 - 29 . DOI: 10.1016/j.inffus.2018.09.015 http://doi.org/10.1016/j.inffus.2018.09.015 https://linkinghub.elsevier.com/retrieve/pii/S1566253518304111 https://linkinghub.elsevier.com/retrieve/pii/S1566253518304111
VASWANI A , SHAZEER N , PARMAR N , et al . Attention is all you need [J ] . Advances in Neural Information Processing Systems , 2017 , 30 : 13 - 22 .
HU J , SHEN L , SUN G . Squeeze-and-excitation networks [C ] //Proceedings of 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition. Salt Lake City,UT,US:IEEE , 2018 : 7132 - 7141 .
FU J , LIU J , TIAN H J , et al . Dual attention network for scene segmentation [C ] //Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition.Long Beach, CA, US:IEEE , 2019 : 3146 - 3154 .
HUANG L , WANG W M , CHEN J , et al . Attention on attention for image captioning [C ] //Proceedings of the IEEE/CVF International Conference on Computer Vision.Seoul, Korea:IEEE , 2019 : 4634 - 4643 .
LI X , WANG W H , HU X L , et al . Selective kernel networks [C ] //Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition.Washington,D.C.,US:IEEE , 2019 : 510 - 519 .
WOO S , PARK J , LEE J Y , et al . Cbam: Convolutional block attention module [C ] //Proceedings of the European Conference on Computer Vision . 2018 : 3 - 19 .DOI: 10.1007978-3-030-01234-2_1 https://dx.doi.org/10.1007978-3-030-01234-2_1 .
LIN T Y , GOYAL P , GIRSHICK R , et al . Focal loss for dense object detection [C ] //Proceedings of the IEEE International Conference on Computer Vision.Washington,D.C.,US:IEEE , 2017 : 2980 - 2988 .
LI C Y , SONG D , TONG R F , et al . Multispectral pedestrian detection via simultaneous detection and segmentation:arXiv : 1808 .04818[R ] . Ithaca,NY , US : Cornell University , 2018 :1808.04818.
GONZALEZ A , FANG Z , SOCARRAS Y , et al . Pedestrian detection at day/night time with visible and FIR cameras: a comparison [J ] . Sensors , 2016 , 16 ( 6 ): 820 . DOI: 10.3390/s16060820 http://doi.org/10.3390/s16060820 http://www.mdpi.com/1424-8220/16/6/820 http://www.mdpi.com/1424-8220/16/6/820
PARK K , KIM S , SOHN K . Unified multi-spectral pedestrian detection based on probabilistic fusion networks [J ] . Pattern Recognition , 2018 , 80 : 143 - 155 . DOI: 10.1016/j.patcog.2018.03.007 http://doi.org/10.1016/j.patcog.2018.03.007 https://linkinghub.elsevier.com/retrieve/pii/S0031320318300906 https://linkinghub.elsevier.com/retrieve/pii/S0031320318300906
0
Views
254
下载量
0
CNKI被引量
Publicity Resources
Related Articles
Related Author
Related Institution
京公网安备11010802024360号