Articles | Open Access |

PREPARING A DATASET FOR FACE DETECTION IN COMPUTER VISION

Rustamov Ilhom A‘zam ugli,Khojamurotov Azizbek Shukhrat ugli , Diplomat University intern teacher,Tashkent University of Information Technologies named after Muhammad Al-Khwarizmi

Abstract

 In recent years, the rapid development of artificial intelligence and computer vision technologies has significantly increased interest in face detection tasks. The effectiveness of face detection systems largely depends on the quality of the datasets used for training. Therefore, preparing a high-quality and diverse dataset has become a crucial scientific and practical challenge. This article provides a detailed overview of the dataset preparation process for face detection. It examines essential stages such as data collection and cleaning, face annotation, image normalization, and augmentation. These steps ensure dataset diversity and balance, which in turn contribute to the accuracy and robustness of face detection models.

The findings highlight that a well-prepared dataset not only improves the overall performance of face detection but also ensures that models remain reliable under various conditions, including changes in lighting, pose, facial expressions, and background environments.

Keywords

dataset, computer vision, face detection, annotation, normalization, augmentation, artificial intelligence.

References

Zhang, K., Zhang, Z., Li, Z., & Qiao, Y. (2016). Joint Face Detection and Alignment using Multi-task Cascaded Convolutional Networks. IEEE Signal Processing Letters, 23(10), 1499–1503.

Viola, P., & Jones, M. (2001). Rapid object detection using a boosted cascade of simple features. Proceedings of the 2001 IEEE Computer Society Conference on Computer Vision and Pattern Recognition, 1, 511–518.

Deng, J., Dong, W., Socher, R., Li, L.-J., Li, K., & Fei-Fei, L. (2009). ImageNet: A large-scale hierarchical image database. 2009 IEEE Conference on Computer Vision and Pattern Recognition, 248–255.

Everingham, M., Van Gool, L., Williams, C. K. I., Winn, J., & Zisserman, A. (2010). The PASCAL Visual Object Classes (VOC) Challenge. International Journal of Computer Vision, 88(2), 303–338.

Behnke, S. (2003). Hierarchical Neural Networks for Image Interpretation. Springer.

Chollet, F. (2015). Keras: Deep Learning library for Theano and TensorFlow. GitHub Repository. https://keras.io

Bradski, G. (2000). The OpenCV Library. Dr. Dobb’s Journal of Software Tools.

Russell, B. C., Torralba, A., Murphy, K. P., & Freeman, W. T. (2008). LabelMe: A database and web-based tool for image annotation. International Journal of Computer Vision, 77(1–3), 157–173.

Dutta, A., & Zisserman, A. (2019). The VIA Annotation Software for Images, Audio and Video. Proceedings of the 27th ACM International Conference on Multimedia, 2276–2279.

Van der Walt, S., Schönberger, J. L., Nunez-Iglesias, J., Boulogne, F., Warner, J. D., Yager, N., Gouillart, E., & Yu, T. (2014). scikit-image: image processing in Python. PeerJ, 2, e453.

Article Statistics

Downloads

Download data is not yet available.

Copyright License

Download Citations

How to Cite

PREPARING A DATASET FOR FACE DETECTION IN COMPUTER VISION. (2025). International Journal of Artificial Intelligence, 5(09), 680-690. https://www.academicpublishers.org/journals/index.php/ijai/article/view/6454