STEMM Institute Press
Science, Technology, Engineering, Management and Medicine
Lane Image Semantic Segmentation Technology Based on BiSeNetV2 Network
DOI: https://doi.org/10.62517/jike.202404110
Author(s)
Xiao Hu1,*, Mingju Chen1,2
Affiliation(s)
1School of Automation and Information Engineering, Sichuan University of Science & Engineering, Sichuan, Yibin, Sichuan, China 2Artificial Intelligence Key Laboratory of Sichuan Province, Sichuan University of Science & Engineering, Sichuan, Yibin, Sichuan,China *Corresponding Author.
Abstract
With the rapid development of automatic driving technology, lane image semantic segmentation plays an increasingly important role in intelligent transportation systems. In this paper, a lane image semantic segmentation technology based on the BiSeNetV2 network is proposed. First, we describe the dual-branch structure and feature fusion module in the BiSeNetV2 network, and then elaborate on our improvements in the lane image semantic segmentation task. We incorporated the attention mechanism to help the model grasp the overall structure of the image more effectively and enhance the segmentation accuracy. Simultaneously, we introduce depth separable convolution to decrease computational redundancy and simplify the model's complexity. Ultimately, we performed experiments on the Cityscapes dataset, and the results revealed that the proposed algorithm comprises 1.21× parameters, with an average intersection ratio of 71.4%. At the same time, the network model and algorithm proposed are contrasted with other equally sophisticated techniques. The comparison findings demonstrate that our approach successfully enhances the accuracy and real-time performance of lane image segmentation in comparison to alternative methods.
Keywords
Image Semantic Segmentation; BiSeNetV2 network; Dual-branch Structure; Feature Fusion; Attention Mechanism
References
[1] Long J, Shelhamer E, Darrell T. Fully convolutional networks for semantic segmentation Proceedings of the IEEE conference on computer vision and pattern recognition. 2015: 3431-3440. [2] Yu C, Gao C, Wang J, et al. Bisenet v2: Bilateral network with guided aggregation for real-time semantic segmentation. International Journal of Computer Vision, 2021, 129: 3051-3068. [3] Badrinarayanan V, Kendall A, Cipolla R. Segnet: A deep convolutional encoder-decoder architecture for image segmentation. IEEE transactions on pattern analysis and machine intelligence, 2017, 39 (12): 2481-2495. [4] Chen L C, Papandreou G, Kokkinos I, et al. Deeplab: Semantic image segmentation with deep convolutional nets, atrous convolution, and fully connected crfs. IEEE transactions on pattern analysis and machine intelligence, 2017, 40 (4): 834-848. [5] Lin G, Milan A, Shen C, et al. Refinenet: Multi-path refinement networks for high-resolution semantic segmentation Proceedings of the IEEE conference on computer vision and pattern recognition. 2017: 1925-1934. [6] WANG Longfei, YAN Chunman. Overview of semantic segmentation of road scenes. Advances in Lasers and Optoelectronics, 2021, 58(12): 44-66. [7] Mingju Chen, Hongyang Li, Hongming Peng, Xingzhong Xiong, Ning Long. HPCDNet: Hybrid position coding and dual-frquency domain transform network for low-light image enhancement. Mathematical Biosciences and Engineering, 2024, 21 (2): 1917-1937. [8] Yu C, Wang J, Peng C, et al. Bisenet: Bilateral segmentation network for real-time semantic segmentation Proceedings of the European conference on computer vision (ECCV). 2018: 325-341. [9] Yu C, Gao C, Wang J, et al. Bisenet v2: Bilateral network with guided aggregation for real-time semantic segmentation. International Journal of Computer Vision, 2021, 129: 3051-3068. [10] IOFFE S, SZEGEDY C. Batch normalization: accelerating deep network training by reducing internal covariate shift Proceedings of the 32nd International Conference on Machine Learning. Lille: JMLR. org, 2015: 448 − 456. [11] Chen M, Yi S, Lan Z, et al. An Efficient Image Deblurring Network with a Hybrid Architecture. Sensors, 2023. [12] Xia T H, Tan M, Li J H, et al. Establish a normal fetal lung gestational age grading model and explore the potential value of deep learning algorithms in fetal lung maturity evaluation. Chinese Medical Journal, 2021, 134. [13] Wang F, Luo X, Wang Q, et al. Aerial-BiSeNet: A real-time semantic segmentation network for high resolution aerial imagery. Chinese Journal of Aeronautics, 2021. [14] BADRINARAYANAN V, KENDALL A, CIPOLLA R. SegNet: a deep convolutional encoder-decoder architecture for image segmentation. IEEE Transactions on Pattern Analysis and Machine Intelligence, 2017, 39 (12): 2481 – 2495.
Copyright @ 2020-2035 STEMM Institute Press All Rights Reserved