KNOWLEDGE GRAPH-BASED MODELING OF WEB APPLICATIONS

Authors

DOI:

https://doi.org/10.28925/2663-4023.2025.31.1005

Keywords:

web applications; knowledge graphs; web application parsing; data extraction; web application modeling.

Abstract

Traditional web crawling and parsing techniques, which rely on extracting and recursively following hyperlinks to map a web application, are increasingly inadequate for modern, dynamic web applications. This conventional method essentially creates a graph where nodes are merely chunks of unstructured data (the web page content) and edges represent simple page transitions. It fundamentally fails to capture the rich interactive behaviors and state changes that define contemporary applications. In response, a novel approach is proposed that models the web application not as a collection of links, but as a system of structured states. Under this paradigm, each node represents a precise, organized description of the application's current condition, while the edges strictly denote user-initiated actions or purposeful transitions. This fundamental shift from unstructured content to structured representation yields a far more complete and functional understanding of the application's true behavior. This enhanced model promises significant utility for advanced downstream tasks, particularly in automated testing and sophisticated behavior analysis.

Downloads

Download data is not yet available.

References

Mesbah, A., van Deursen, A., & Lenselink, S. (2012). Crawling AJAX-based Web Applications through Dynamic Analysis of User Interface State Changes. ACM Transactions on the Web, 6(1), 1–30.

Pellegrino, G., Tschürtz, C., Bodden, E., & Rossow, C. (2015). jÄk: Using Dynamic Analysis to Crawl and Test Modern Web Applications. У Proceedings of the 18th International Symposium on Research in Attacks, Intrusions and Defenses (RAID’15) (pp. 295–316). Kyoto, Japan.

van Deursen, A., & Mesbah, A. (2015). Crawl-based analysis of web applications: Prospects and challenges. Information & Software Technology, 57, 123–136.

Najork, M. (2009). Web Crawler Architecture (Microsoft Research Technical Report).

Heydon, A., & Najork, M. (1999). Mercator: A Scalable, Extensible Web Crawler. У Proceedings of the 8th International World Wide Web Conference (WWW’99). Toronto, Canada.

Abu Kausar, M., Dhaka, V. S., & Singh, S. K. (2013). Web Crawler: A Review. International Journal of Computer Applications, 63(2), 31–34.

Olston, C., & Najork, M. (2010). Web Crawling Contents: A Survey (Stanford University Technical Report).

Furche, T., Gottlob, G., Grasso, G., Guo, X., Orsi, G., & Schallhart, C. (2012). The Ontological Key: Automatically Understanding and Integrating Forms to Access the Deep Web. У Proceedings of the VLDB Workshop on Deep Web.

Kosala, R., & Blockeel, H. (2000). Web Mining Research: A Survey. ACM SIGKDD Explorations, 2(1), 1–15.

Mukhopadhyay, D., Mukherjee, S., Ghosh, S., Kar, S., & Kim, Y.-C. (2011). Architecture of a Scalable Dynamic Parallel WebCrawler with High-Speed Downloadable Capability for a Web Search Engine (arXiv preprint).

Hassanshahi, B., Lee, H., Krishnan, P., & Güß, J. (2020). Gelato: Feedback-driven and Guided Security Analysis of Client-side Web Applications (arXiv preprint).

Cazzaro, L. (2025). Less is More: Boosting Coverage of Web Crawling through Adaptive Strategies. У Proceedings of the International Conference on Dependable Systems and Networks (DSN’25).

Stafeev, A. (2024). Evaluating the Effectiveness of Crawling Algorithms for Web Applications (USENIX Summer School Report).

Mirtaheri, S. M., Dinçtürk, M. E., Hooshmand, S., & Bochmann, G. V. (2013). A Brief History of Web Crawlers. У Proceedings of the 22nd International World Wide Web Conference (WWW’13) (pp. 1–10).

Kolias, V., Anagnostopoulos, I., & Kayafas, E. (2014). Exploratory Analysis of a Terabyte-Scale Web Corpus (arXiv preprint).

Olston, C., & Najork, M. (2010). Web Crawling Contents: Techniques and Challenges (Stanford University Technical Report).

Kausar, M. A., Dhaka, V. S., & Singh, S. K. (2013). Web Crawler: Survey and Comparison. International Journal of Computer Applications, 63(2), 35–40.

Downloads


Abstract views: 0

Published

2025-12-16

How to Cite

Romanenko, S., Redziuk, Y., & Holub, O. (2025). KNOWLEDGE GRAPH-BASED MODELING OF WEB APPLICATIONS. Electronic Professional Scientific Journal «Cybersecurity: Education, Science, Technique», 3(31), 100–112. https://doi.org/10.28925/2663-4023.2025.31.1005