ЗАСТОСУВАННЯ НЕЙРОННИХ МЕРЕЖ ДЛЯ АВТОМАТИЗОВАНОГО РОЗПІЗНАВАННЯ ТЕКСТУ ТА СИМВОЛІВ У ЗАДАЧАХ КІБЕРБЕЗПЕКИ

Nataliia Cherniiashchuk

doi:10.28925/2663-4023.2025.31.1032

Authors

Nataliia Cherniiashchuk Lesya Ukrainka Volyn National University https://orcid.org/0000-0002-3178-8377

DOI:

https://doi.org/10.28925/2663-4023.2025.31.1032

Keywords:

text recognition, OCR, CNN, RNN, CTC, machine learning, distorted images, Tesseract OCR.

Abstract

This work presents the development and investigation of an Optical Character Recognition (OCR) system for low-quality text using machine learning methods. To address the task, two grayscale image datasets were created: the first consisting of isolated English alphabet characters and digits (4,960 images of 250×50 pixels), and the second containing fragments of meaningful text from the book The Hunger Games (4,010 images of 680×50 pixels). To enhance model robustness, the images were distorted using blur and digital noise functions. The OCR model was built based on a combination of Convolutional Neural Networks (CNN) and Recurrent Neural Networks (RNN) with a Connectionist Temporal Classification (CTC) layer for sequence correction. Training was conducted over 70–80 epochs with a 9:1 split for training and validation datasets. A comparative analysis was carried out between the developed system and Tesseract OCR. Experimental results demonstrated that the proposed model achieves superior recognition performance on low-quality images, particularly those affected by digital noise, whereas Tesseract OCR significantly loses accuracy under such conditions. The results confirm the effectiveness of hybrid neural network architectures for recognizing distorted text. Future work will focus on recognizing multi-line meaningful text and improving the model's resilience to various types of visual distortions.

Downloads

Download data is not yet available.

References

Kumar, M., Singh, S., Dureja, A., Narula, R., & Shyla, S. (2025). OCR-CRNN (WBS): An optical character recognition system based on convolutional recurrent neural network embedded with word beam search decoder for extraction of text. International Journal of Information Technology, 17(7), 849–860. https://doi.org/10.1007/s41870-025-02540-x

Bernasconi, E. (2025). Enhancing symbol recognition in library science via convolutional neural networks. Journal of Computer Science, 16(2), 119–130. https://doi.org/10.3390/jcs16020119

Liu, Y. (2023). A convolutional recurrent neural-network-based model for scene text recognition. Symmetry, 15(4), 849–860. https://doi.org/10.3390/sym15040849

Drobac, S. (2020). Optical character recognition with neural networks and post-correction techniques. Pattern Recognition, 100, 107–118. https://doi.org/10.1007/s10032-020-00359-9

Sinthuja, M. (2024). Extraction of text from images using deep learning. Procedia Computer Science, 187, 751–758. https://doi.org/10.1016/j.procs.2024.04.099

Yousef, M., Hussain, K. F., & Mohammed, U. S. (2018). Accurate, data-efficient, unconstrained text recognition with convolutional neural networks. arXiv preprint arXiv:1812.11894. https://arxiv.org/abs/1812.11894

Chen, L. (2020). Attacking optical character recognition (OCR) systems with adversarial examples. arXiv preprint arXiv:2002.03095. https://arxiv.org/abs/2002.03095

Li, B., Tang, X., Qi, X., Chen, Y., & Xiao, R. (2020). Hamming OCR: A locality sensitive hashing neural network for scene text recognition. arXiv preprint arXiv:2009.10874. https://arxiv.org/abs/2009.10874

Du, Y., et al. (2020). PP-OCR: A practical ultra lightweight OCR system. arXiv preprint arXiv:2009.09941. https://arxiv.org/abs/2009.09941

Liao, M., et al. (2019). Mask TextSpotter: An end-to-end trainable neural network for spotting text with arbitrary shapes. arXiv preprint arXiv:1908.08207. https://arxiv.org/abs/1908.08207

Shi, B., Bai, X., & Yao, C. (2017). An end-to-end trainable neural network for image-based sequence recognition and its application to scene text recognition. IEEE Transactions on Pattern Analysis and Machine Intelligence, 39(11), 2298–2304. https://doi.org/10.1109/TPAMI.2016.2645393

Liu, Y. (2024, October 14). Convolutional recurrent neural network for text recognition. XenonStack Insights. https://www.xenonstack.com/insights/crnn-for-text-recognition

APPLICATION OF NEURAL NETWORKS FOR AUTOMATED TEXT AND SYMBOL RECOGNITION IN CYBERSECURITY TASKS

Authors

DOI:

Keywords:

Abstract

Downloads

References

Downloads

Published

How to Cite

Issue

Section

License

index

Language

Make a Submission

counter

Information

Developed By

Current Issue