TINYBERT WITH CROSS-ATTENTION + LORA + BI-GRU: A COMPACT NEURAL NETWORK ARCHITECTURE FOR FAKE NEWS DETECTION
DOI:
https://doi.org/10.28925/2663-4023.2025.29.908Keywords:
fake news detection, TinyBERT, cross-attention, LoRA, Bi-GRU, natural language processing, transformer models, AI fact-checkingAbstract
This paper proposes a compact neural network architecture based on TinyBERT with a Cross-Attention mechanism, Low-Rank Adaptation (LoRA), and a bidirectional recurrent Bi-GRU layer for automatic fake news detection. The rapid development of the digital information space has led to a sharp increase in the amount of disinformation, while traditional fact-checking methods can no longer effectively counter this trend. Existing neural network approaches based on large transformer models are often unsuitable for practical use due to their high computational cost and inability to explicitly analyze semantic inconsistencies between a news headline and its body text. The proposed model addresses these limitations by employing a distilled TinyBERT transformer, which significantly reduces computational requirements, and a specialized Cross-Attention module that provides sensitivity to headline–body discrepancies. The use of LoRA adapters minimizes the number of trainable parameters, speeding up the fine-tuning process, while the integration of Bi-GRU enables the preservation of contextual information in sequences. Experimental evaluation on the Fake News Classification dataset demonstrated that the model outperforms classical machine learning algorithms, achieving an accuracy of 99%, an F1-score of 0.985, and a ROC AUC of 0.998. Due to its low resource consumption, rapid adaptability to new topics, and high interpretability of decisions, the model shows strong potential for integration into real-time fact-checking services.
Downloads
References
Yoon, S., et al. (2021). Learning to detect incongruence in news headline and body text via a graph neural network. IEEE Access, 9, 36195–36206. https://doi.org/10.1109/access.2021.3062029
Jiao, X., et al. (2020). TinyBERT: Distilling BERT for natural language understanding. Findings of the Association for Computational Linguistics: EMNLP 2020. https://doi.org/10.18653/v1/2020.findings-emnlp.372
Ye, H., et al. (2025). LoRA-Adv: Boosting text classification in large language models through adversarial low-rank adaptations. IEEE Access, 1. https://doi.org/10.1109/access.2025.3579539
Taher, Y., Moussaoui, A., & Moussaoui, F. (2022). Automatic fake news detection based on deep learning, FastText and news title. International Journal of Advanced Computer Science and Applications, 13(1). https://doi.org/10.14569/ijacsa.2022.0130118
Mishra, R., & Setty, V. (2019). SADHAN: Hierarchical attention networks to learn latent aspect embeddings for fake news detection. In Proceedings of the 2019 ACM SIGIR International Conference on Theory of Information Retrieval (ICTIR ’19) (pp. 197–204). ACM. https://doi.org/10.1145/3341981.3344229
Shu, K., et al. (2020). Hierarchical propagation networks for fake news detection: Investigation and exploitation. Proceedings of the International AAAI Conference on Web and Social Media, 14, 626–637. https://doi.org/10.1609/icwsm.v14i1.7329
Chang, Q., Li, X., & Duan, Z. (2024). Graph global attention network with memory: A deep learning approach for fake news detection. Neural Networks, 106115. https://doi.org/10.1016/j.neunet.2024.106115
Qawasmeh, E., Tawalbeh, M., & Abdullah, M. (2019). Automatic identification of fake news using deep learning. In 2019 Sixth International Conference on Social Networks Analysis, Management and Security (SNAMS) (pp. 1–7). IEEE. https://doi.org/10.1109/snams.2019.8931873
Dun, Y., et al. (2021). KAN: Knowledge-aware attention network for fake news detection. Proceedings of the AAAI Conference on Artificial Intelligence, 35(1), 81–89. https://doi.org/10.1609/aaai.v35i1.16080
Alghamdi, J., Lin, Y., & Luo, S. (2024). Enhancing hierarchical attention networks with CNN and stylistic features for fake news detection. Expert Systems with Applications, 125024. https://doi.org/10.1016/j.eswa.2024.125024
Zhang, W., et al. (2024). Cross-attention multi-perspective fusion network-based fake news censorship. Neurocomputing, 128695. https://doi.org/10.1016/j.neucom.2024.128695
Maham, S., et al. (2024). ANN: Adversarial news net for robust fake news classification. Scientific Reports, 14(1). https://doi.org/10.1038/s41598-024-56567-4
Zhou, D., et al. (2024). GS2F: Multimodal fake news detection utilizing graph structure and guided semantic fusion. ACM Transactions on Asian and Low-Resource Language Information Processing. https://doi.org/10.1145/3708536
Kuntur, S., et al. (2024). Under the influence: A survey of large language models in fake news detection. IEEE Transactions on Artificial Intelligence, 1–21. https://doi.org/10.1109/tai.2024.3471735
Kabir, M. M., et al. (2025). CUET-NLP_MP@DravidianLangTech 2025: A transformer and LLM-based ensemble approach for fake news detection in Dravidian. In Proceedings of the Fifth Workshop on Speech, Vision, and Language Technologies for Dravidian Languages (pp. 420–426). ACL. https://doi.org/10.18653/v1/2025.dravidianlangtech-1.75
Kaggle. (2023, October 8). Fake news classification.
Published
How to Cite
Issue
Section
License
Copyright (c) 2025 Іван Пелещак, Василь Литвин, Вікторія Висоцька, Олексій Хобор

This work is licensed under a Creative Commons Attribution-NonCommercial-ShareAlike 4.0 International License.