Research on a Lightweight Small Object Detection Method Based on Lite-RFB Modules

Fei Wang; Liping Lu

doi:10.2478/ijanmc-2025-0039

Abstract

—Small object detection remains a formidable challenge in computer vision, primarily because conventional models like SSD suffer from two critical limitations: weak semantic information in shallow feature maps and a mismatch between the receptive field and the actual size of small targets. To address these deficiencies, this paper introduces Lite-RFB SSD, an innovative architecture that strategically integrates a lightweight Receptive Field Block (RFB) module into the SSD framework. This module is meticulously reconstructed using depthwise separable convolutions and channel pruning techniques, resulting in a remarkable 62% reduction in parameters. By embedding this optimized module into the shallow conv4_3 layer, the model preserves high-resolution features crucial for small object detection while significantly enhancing computational efficiency. Experimental validation on the PASCAL VOC dataset demonstrates that Lite-RFB SSD achieves an average precision for small objects (APs) of 22.9%, a substantial 4.2% improvement over the original SSD. Furthermore, it operates at an impressive 28 FPS on edge devices, establishing a superior balance between accuracy and efficiency that outperforms competing methods such as standard RFB and MobileNet-SSD.

References

Tan, M., & Le, Q. V. (2021). EfficientNetV2: Smaller Models and Faster Training. In Proceedings of the IEEE/CVF International Conference on Computer Vision (pp. 10096-10105).
Search in Google Scholar Back to article
Howard, A., Sandler, M., Chu, G., Chen, L. C., Chen, B., Tan, M, & Adam, H. (2019). Searching for MobileNetV3. In Proceedings of the IEEE/CVF International Conference on Computer Vision (pp. 1314-1324).
Search in Google Scholar Back to article
Lin, T. Y., Goyal, P., Girshick, R., He, K., & Dollár, P. (2020). Focal Loss for Dense Object Detection. International Journal of Computer Vision, 128(3), 640-657.
Search in Google Scholar Back to article
Redmon, J., & Farhadi, A. (2018). Yolov3: An incremental improvement. arXiv preprint arXiv:1804.02767.
Search in Google Scholar Back to article
Bochkovskiy, A., Wang, C. Y., & Liao, H. Y. M. (2020). Yolov4: Optimal speed and accuracy of object detection. arXiv preprint arXiv:2004.10934.
Search in Google Scholar Back to article
Liu, S., Qi, L., Qin, H., Shi, J., & Jia, J. (2020). Path aggregation network for instance segmentation. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (pp. 9206-9215).
Search in Google Scholar Back to article
Wang, C. Y., Mark Liao, H. Y., & Wu, Y. H. (2023). YOLOv7: Trainable bag-of-freebies sets new state-of-the-art for real-time object detectors. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (pp. 7264-7275).
Search in Google Scholar Back to article
Jocher, G. (2020). Yolov5: YOLOv5 by Ultralytics. GitHub Repository. https://github.com/ultralytics/yolov5.
Search in Google Scholar Back to article
Rezatofighi, H., Tsoi, N., Gwak, J., Sadeghian, A., Reid, I., & Savarese, S. (2019). Generalized intersection over union: A metric and a loss for bounding box regression. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (pp. 658-666).
Search in Google Scholar Back to article
Zhu, X., Su, W., Lu, L., Li, B., Wang, X., & Dai, J. (2021). Deformable DETR: Deformable transformers for end-to-end object detection. International Journal of Computer Vision, 129(6), 1553-1569.
Search in Google Scholar Back to article
Wang, D., Li, S., & Guo, Y. (2022). A Lightweight Small Object Detection Algorithm Based on Improved YOLOv5. Acta Automatica Sinica, 48(5), 1201-1210.
Search in Google Scholar Back to article
Zhang, T., Liu, J., & Guo, Y. (2021). A Survey of Lightweight Object Detection Algorithms for Complex Scenes. Chinese Journal of Computers, 44(8), 1623-1645.
Search in Google Scholar Back to article
Chen, X., Li, X., & Jiao, L. (2020). Research on Small Object Detection Algorithm Fusing Multi-Scale Features. Journal of Image and Graphics, 25(7), 1345-1356.
Search in Google Scholar Back to article
Zhao, Y., Wang, N., & Ding, X. (2023). Small Object Detection in Remote Sensing Images Based on Attention Mechanism and Feature Fusion. Journal of Electronics & Information Technology, 45(2), 456-464.
Search in Google Scholar Back to article
Sun, H., Wang, L., & Tan, T. (2022). Optimization and Implementation of Real-Time Object Detection Algorithms for Edge Computing. Journal of Computer Research and Development, 59(9), 1987-2001.
Search in Google Scholar Back to article

Research on a Lightweight Small Object Detection Method Based on Lite-RFB Modules

Abstract

Paradigm

My account