APPLICATION OF NEURAL NETWORKS FOR AUTOMATED TEXT AND SYMBOL RECOGNITION IN CYBERSECURITY TASKS
DOI:
https://doi.org/10.28925/2663-4023.2025.31.1032Keywords:
text recognition, OCR, CNN, RNN, CTC, machine learning, distorted images, Tesseract OCR.Abstract
This work presents the development and investigation of an Optical Character Recognition (OCR) system for low-quality text using machine learning methods. To address the task, two grayscale image datasets were created: the first consisting of isolated English alphabet characters and digits (4,960 images of 250×50 pixels), and the second containing fragments of meaningful text from the book The Hunger Games (4,010 images of 680×50 pixels). To enhance model robustness, the images were distorted using blur and digital noise functions. The OCR model was built based on a combination of Convolutional Neural Networks (CNN) and Recurrent Neural Networks (RNN) with a Connectionist Temporal Classification (CTC) layer for sequence correction. Training was conducted over 70–80 epochs with a 9:1 split for training and validation datasets. A comparative analysis was carried out between the developed system and Tesseract OCR. Experimental results demonstrated that the proposed model achieves superior recognition performance on low-quality images, particularly those affected by digital noise, whereas Tesseract OCR significantly loses accuracy under such conditions. The results confirm the effectiveness of hybrid neural network architectures for recognizing distorted text. Future work will focus on recognizing multi-line meaningful text and improving the model's resilience to various types of visual distortions.
Downloads
References
Kumar, M., Singh, S., Dureja, A., Narula, R., & Shyla, S. (2025). OCR-CRNN (WBS): An optical character recognition system based on convolutional recurrent neural network embedded with word beam search decoder for extraction of text. International Journal of Information Technology, 17(7), 849–860. https://doi.org/10.1007/s41870-025-02540-x
Bernasconi, E. (2025). Enhancing symbol recognition in library science via convolutional neural networks. Journal of Computer Science, 16(2), 119–130. https://doi.org/10.3390/jcs16020119
Liu, Y. (2023). A convolutional recurrent neural-network-based model for scene text recognition. Symmetry, 15(4), 849–860. https://doi.org/10.3390/sym15040849
Drobac, S. (2020). Optical character recognition with neural networks and post-correction techniques. Pattern Recognition, 100, 107–118. https://doi.org/10.1007/s10032-020-00359-9
Sinthuja, M. (2024). Extraction of text from images using deep learning. Procedia Computer Science, 187, 751–758. https://doi.org/10.1016/j.procs.2024.04.099
Yousef, M., Hussain, K. F., & Mohammed, U. S. (2018). Accurate, data-efficient, unconstrained text recognition with convolutional neural networks. arXiv preprint arXiv:1812.11894. https://arxiv.org/abs/1812.11894
Chen, L. (2020). Attacking optical character recognition (OCR) systems with adversarial examples. arXiv preprint arXiv:2002.03095. https://arxiv.org/abs/2002.03095
Li, B., Tang, X., Qi, X., Chen, Y., & Xiao, R. (2020). Hamming OCR: A locality sensitive hashing neural network for scene text recognition. arXiv preprint arXiv:2009.10874. https://arxiv.org/abs/2009.10874
Du, Y., et al. (2020). PP-OCR: A practical ultra lightweight OCR system. arXiv preprint arXiv:2009.09941. https://arxiv.org/abs/2009.09941
Liao, M., et al. (2019). Mask TextSpotter: An end-to-end trainable neural network for spotting text with arbitrary shapes. arXiv preprint arXiv:1908.08207. https://arxiv.org/abs/1908.08207
Shi, B., Bai, X., & Yao, C. (2017). An end-to-end trainable neural network for image-based sequence recognition and its application to scene text recognition. IEEE Transactions on Pattern Analysis and Machine Intelligence, 39(11), 2298–2304. https://doi.org/10.1109/TPAMI.2016.2645393
Liu, Y. (2024, October 14). Convolutional recurrent neural network for text recognition. XenonStack Insights. https://www.xenonstack.com/insights/crnn-for-text-recognition
Published
How to Cite
Issue
Section
License
Copyright (c) 2025 Наталія Чернящук

This work is licensed under a Creative Commons Attribution-NonCommercial-ShareAlike 4.0 International License.