ДОСЛІДЖЕННЯ ЗАСТОСУВАННЯ ВЕКТОРНИХ БАЗ ДАНИХ У ГЕНЕРАТИВНОМУ ШТУЧНОМУ ІНТЕЛЕКТІ

Oleksii Smirnov; Roman Tkachuk; Nataliia Kozirova; Liliia Konstantynova; Oksana Konoplitska-Slobodeniuk; Nataliia Yakymenko; Serhii Smirnov

doi:10.28925/2663-4023.2026.33.1248

Authors

Oleksii Smirnov Central Ukrainian National Technical University https://orcid.org/0000-0001-9543-874X
Roman Tkachuk Central Ukrainian National Technical University https://orcid.org/0000-0002-1984-0419
Nataliia Kozirova Central Ukrainian National Technical University https://orcid.org/0009-0005-8753-5132
Liliia Konstantynova Central Ukrainian National Technical University https://orcid.org/0000-0002-3305-2427
Oksana Konoplitska-Slobodeniuk Central Ukrainian National Technical University https://orcid.org/0000-0001-9981-5194
Nataliia Yakymenko Central Ukrainian National Technical University https://orcid.org/0000-0002-4498-0093
Serhii Smirnov Central Ukrainian National Technical University https://orcid.org/0000-0002-7649-7442

DOI:

https://doi.org/10.28925/2663-4023.2026.33.1248

Keywords:

databases, artificial intelligence, vector databases (VDB), generative artificial intelligence (GenAI), large language models (LLM), Retrieval-Augmented Generation (RAG), semantic search, vector embeddings, HNSW, RAFT, scalability, cognitive infrastructure, ANN, quantization

Abstract

This paper studies the use of vector databases in generative artificial intelligence. The purpose of this article is to study the possibility of using vector databases as the foundation of modern AI infrastructure and their role in expanding the cognitive capabilities of GenAI. The object of the study is the process of using databases in artificial intelligence. The subject of the study is the use of vector databases in generative artificial intelligence. The following tasks were solved in this study: the role of vector repositories in expanding the cognitive capabilities of generative AI was investigated; a comparative analysis of traditional relational systems and vector databases was carried out; the mechanics and verification potential of the RAG and RAFT architectural paradigms were analyzed. Prospects for further research are identified, which consist of the following: development of effective mechanisms for monitoring and automatic correction of semantic drift of embeddings in conditions of constant updating of corporate knowledge; research into adaptive memory management strategies to eliminate threshold performance effects when scaling vector indexes; research into the possibilities of combining hybrid search architectures that combine semantic and full-text indexing in a single technological environment; standardization of metrics for assessing the quality of vector search in the context of specific requirements of industrial RAG systems, which will ensure a more informed choice of infrastructure solutions for specific application tasks.

Downloads

Download data is not yet available.

References

FAISS. (n.d.). Documentation. https://faiss.ai/index.html

Taipalus, T. (2024). Vector database management systems: Fundamental concepts, use-cases, and current challenges. Cognitive Systems Research, 85. https://doi.org/10.1016/j.cogsys.2024.101216

Zhang, T., et al. (2024). RAFT: Adapting language model to domain specific RAG. arXiv. https://doi.org/10.48550/arXiv.2403.10131

Gao, L., et al. (2023). Retrieval-augmented generation for large language models: A survey. arXiv. https://doi.org/10.48550/arXiv.2312.10997

Pan, X., et al. (2023). Survey of vector database management systems. arXiv. https://doi.org/10.48550/arXiv.2310.14021

Shi, Y., et al. (2024). Enhancing retrieval and managing retrieval: A four-module synergy for improved quality and efficiency in RAG systems. arXiv. https://doi.org/10.48550/arXiv.2407.10670

Ma, L., et al. (2023). A comprehensive survey on vector database: Storage and retrieval technique, challenge. arXiv. https://doi.org/10.48550/arXiv.2310.11703

Wang, M., et al. (2023). Embedding in recommender systems: A survey. arXiv. https://doi.org/10.48550/arXiv.2310.18608

Milvus. (n.d.). Vector database documentation. https://milvus.io

Pinecone. (n.d.). Vector database guide. https://www.pinecone.io/learn/vector-database/

Gao, Y., et al. (2023). Retrieval-augmented generation for large language models: A survey. arXiv. https://doi.org/10.48550/arXiv.2312.10997

Huang, L., et al. (2023). A survey on hallucination in large language models: Principles, taxonomy, challenges, and open questions. arXiv. https://doi.org/10.48550/arXiv.2311.05232

Gupta, S., Ranjan, R., & Singh, S. N. (2024). A comprehensive survey of retrieval-augmented generation (RAG): Evolution, current landscape and future directions. arXiv. https://doi.org/10.48550/arXiv.2410.12837

Pan, J. J., Wang, J., & Li, G. (2024). Survey of vector database management systems. https://dbgroup.cs.tsinghua.edu.cn/ligl/papers/vldbj2024-vectordb.pdf

Schwaber-Cohen, R. (n.d.). What is a vector database & how does it work? Use cases + examples. Pinecone. https://www.pinecone.io/learn/vector-database/

Tu, Y., Su, W., Zhou, Y., Liu, Y., & Ai, Q. (2025). RbFT: Robust fine-tuning for retrieval-augmented generation against retrieval defects. arXiv. https://doi.org/10.48550/arXiv.2501.18365

Gu, J. (2024). A research of challenges and solutions in retrieval augmented generation (RAG) systems. Humanities and Social Sciences Communications / DRPress (HSET). https://doi.org/10.54097/364hex16

Wallace, J. A. (n.d.). What are the most common vector database challenges? Redis. https://redis.io/blog/common-challenges-working-with-vector-databases/

Kang, D., Jiang, D., Yang, H., Liu, H., & Li, B. (2025). Scalable disk-based approximate nearest neighbor search with page-aligned graph. arXiv. https://doi.org/10.48550/arXiv.2509.25487

Subramanya, S. J., Devvrit, Kadekodi, R., Krishaswamy, R., & Simhadri, H. V. (2019). DiskANN: Fast accurate billion-point nearest neighbor search on a single node. In Proceedings of the 33rd International Conference on Neural Information Processing Systems (pp. 13766-13776). Curran Associates. https://dl.acm.org/doi/abs/10.5555/3454287.3455520

Chen, Q., Zhao, B., Wang, H., Li, M., Liu, C., Li, Z., Yang, M., & Wang, J. (2021). SPANN: Highly-efficient billion-scale approximate nearest neighbor search. arXiv. https://doi.org/10.48550/arXiv.2111.08566

Microsoft Research. (n.d.). DiskANN: Vector search for all. https://www.microsoft.com/en-us/research/project/project-akupara-approximate-nearest-neighbor-search/

Jiang, P., Ouyang, S., Jiao, Y., Zhong, M., Tian, R., & Han, J. (2025). A survey on retrieval and structuring augmented generation with large language models. arXiv. https://arxiv.org/html/2509.10697v1

Vejendla, H. (2025). Drift-adapter: A practical approach to near zero-downtime embedding model upgrades in vector databases. arXiv. https://arxiv.org/pdf/2509.23471

Milvus. (n.d.). RRF ranker. https://milvus.io/docs/rrf-ranker.md

Li, Z., Wang, Z., Wang, W., Hung, K., Xie, H., & Wang, F. (2025). Retrieval-augmented generation for educational application: A systematic survey. Computers and Education: Artificial Intelligence, 8, 100417. https://doi.org/10.1016/j.caeai.2025.100417

Brown, A., Roman, M., & Devereux, B. (2025). A systematic literature review of retrieval-augmented generation: Techniques, metrics, and challenges. arXiv. https://doi.org/10.48550/arXiv.2508.06401

Johnson, J., Douze, M., & Jégou, H. (2017). Billion-scale similarity search with GPUs. arXiv. https://doi.org/10.48550/arXiv.1702.08734

Chakraborty, A., Dahal, C., & Gupta, V. (2025). Federated retrieval-augmented generation: A systematic mapping study. arXiv. https://arxiv.org/pdf/2505.18906

Amazon Web Services. (n.d.). Cost considerations for vector databases. https://docs.aws.amazon.com/prescriptive-guidance/latest/choosing-an-aws-vector-database-for-rag-use-cases/cost.html

Microsoft Azure. (n.d.). Pricing for search services. https://azure.microsoft.com/en-us/pricing/details/search/

TensorBlue Blog. (2025). Best vector database 2025: Pinecone vs Weaviate vs Qdrant vs Milvus. https://tensorblue.com/blog/vector-database-comparison-pinecone-weaviate-qdrant-milvus-2025

Theodo. (n.d.). How to choose your vector database. https://www.theodo.com/en-fr/blog/how-to-choose-your-vector-database

Pinecone. (n.d.). Official website. https://www.pinecone.io/

Milvus. (n.d.). Official website. https://milvus.io/

Weaviate. (n.d.). Official website. https://weaviate.io/

Qdrant. (n.d.). Official website. https://qdrant.tech/

pgvector. (n.d.). GitHub repository. GitHub. https://github.com/pgvector/pgvector

Amazon Web Services. (n.d.). What is Elasticsearch? https://aws.amazon.com/what-is/elasticsearch/

Kuznetsov, O., Smirnov, O., Akhmetov, B., Alimseitova, Z., & Imoize, A. L. (2025). Deep learning frontiers in copy-move forgery detection: Advances, challenges, and future directions. In Advancements in Cybersecurity Next Generation Systems and Applications (pp. 202-229). https://doi.org/10.1201/9781003546153

Smirnov, O., Fedorov, E., Neskorodieva, A., & Neskorodieva, T. (2024). Intellectual classification method of gymnastic elements based on combinations of descriptive and generative approaches. CEUR Workshop Proceedings, 3664, 11-23.

Al-Mudhafar, A. A., Smirnova, T., Buravchenko, K., & Smirnov, O. (2023). The method of assessing and improving the user experience of subscribers in software-configured networks based on the use of machine learning. Advanced Information Systems, 7(2), 49-56.

Smirnov, O., Karapetyan, A., & Fedorov, E. (2022). Creating neural network and single solution human-based metaheuristic methods of solving the traveling salesman problem. CEUR Workshop Proceedings, 3312, 47-58.

Lysenko, I., Mynailenko, R., Smirnov, S., Buravchenko, K., Yakymenko, N., & Smirnov, O. (2025). Research of artificial intelligence tools for intelligent data analysis. Cybersecurity: Education, Science, Technique, 3(31), 227-241. https://doi.org/10.28925/2663-4023.2025.31.1022

Usik, P. S., Smirnova, T. V., Buravchenko, K. O., Smirnov, O. A., Ulichev, O. S., & Smirnov, S. A. (2025). Research of cybersecurity technologies for banking systems using artificial intelligence. Cybersecurity: Education, Science, Technique, 1(29), 704-716. https://doi.org/10.28925/2663-4023.2025.29.930

Smirnov, O. A., Konstantynova, L. V., Konoplitska-Slobodeniuk, O. K., Kozirova, N. V., Yakymenko, N. M., Dorenskyi, O. P., & Buravchenko, K. O. (2025). Research of artificial intelligence tools for working with databases and data analysis. Cybersecurity: Education, Science, Technique, 3(27), 429-448. https://doi.org/10.28925/2663-4023.2025.27.763

Al-Mudhafar, A. A. A., Smirnova, T. V., Buravchenko, K. O., & Smirnov, O. A. (2023). Method of assessment and improvement of subscriber user experience in software-defined networks based on machine learning. Advanced Information Systems, 7(2), 49-56. https://doi.org/10.20998/2522-9052.2023.2.07

RESEARCH INTO THE APPLICATION OF VECTOR DATABASES IN GENERATIVE ARTIFICIAL INTELLIGENCE

Authors

DOI:

Keywords:

Abstract

Downloads

References

Downloads

Published

How to Cite

Issue

Section

License

Most read articles by the same author(s)

index

Language

Make a Submission

counter

Information

Developed By

Current Issue