Assessing Software Architecture Resilience Using Quantitative Metrics in Cloud Native Application Development Environments
Keywords:
system resilience, quantitative metrics, recovery time, fault tolerance, cloud native architecturesAbstract
Cloud native architectures are essential for modern software systems due to their ability to handle dynamic environments, scalability, and high availability. However, ensuring resilience in these systems remains a significant challenge, particularly under varying operational conditions such as high-load periods and failure scenarios. This study aims to assess the resilience of cloud native architectures using quantitative metrics that objectively evaluate key attributes such as availability, fault tolerance, recovery time, and scalability. Through the application of these metrics, the study identifies the strengths and weaknesses of the architecture, providing insights into how the system performs under stress and recovers from failures. The results show that while the architecture demonstrates strong availability and scalability under typical conditions, recovery time and scalability under extreme load conditions reveal areas for improvement. Specifically, issues with resource allocation and self-healing capabilities were identified as key weaknesses affecting the overall resilience of the system. These findings highlight the importance of using data-driven metrics to gain detailed insights into system resilience and to guide architectural improvements. The study also emphasizes the need for continuous monitoring and adaptation of the architecture to optimize fault tolerance and recovery processes. The implications of this research extend to cloud application developers and architects, offering actionable recommendations for improving system resilience. Future research could focus on integrating real-time monitoring systems, developing more advanced resilience metrics, and incorporating AI-driven scaling techniques to further enhance the adaptability and robustness of cloud native systems. By addressing these challenges, cloud native architectures can be better equipped to maintain high performance and reliability in dynamic, real-world environments.
References
[1] B. M. Harve et al., “The Cloud-Native Revolution: Microservices in a Cloud-Driven World,” in 2024 International Conference on Intelligent Cybernetics Technology and Applications, ICICyTA 2024, 2024, pp. 1043 – 1048. doi: 10.1109/ICICYTA64807.2024.10913359.
[2] B. Nascimento, R. Santos, J. Henriques, M. V Bernardo, and F. Caldeira, “Availability, Scalability, and Security in the Migration from Container-Based to Cloud-Native Applications,” Computers, vol. 13, no. 8, 2024, doi: 10.3390/computers13080192.
[3] A. P. Perumal, Building resilient systems: The role of containers and kubernetes. 2025. doi: 10.4018/979-8-3373-0365-9.ch013.
[4] A. Angelis and G. Kousiouris, “An Overview on the Landscape of Self-Adaptive Cloud Design and Operation Patterns: Goals, Strategies, Tooling, Evaluation, and Dataset Perspectives,” Futur. Internet, vol. 17, no. 10, 2025, doi: 10.3390/fi17100434.
[5] L. Mwafaq, N. K. Meftin, E. E. Asanovich, M. K. H. Al-Dulaimi, and T. K. Hasan, “Cloud-Native Architectures: Transforming Enterprise IT Operations,” Iran. J. Inf. Process. Manag., vol. 40, pp. 259 – 286, 2025, doi: 10.22034/jipm.2025.728114.
[6] H. Cho, J.-H. Sung, H.-J. Kang, J. Jang, and D. Shin, “Quantifying Cyber Resilience: A Framework Based on Availability Metrics and AUC-Based Normalization,” Electron., vol. 14, no. 12, 2025, doi: 10.3390/electronics14122465.
[7] H. Wei, N. Madhavji, and J. Steinbacher, “A Map of Cloud-Native Practices and Tools to Achieve Desirable System Qualities,” in Proceedings - 2025 IEEE 22nd International Conference on Software Architecture, ICSA 2025, 2025, pp. 221 – 231. doi: 10.1109/ICSA65012.2025.00030.
[8] H. Wang et al., “Cloud-native systems resilience assessments based on kubernetes architecture graph,” Serv. Oriented Comput. Appl., 2024, doi: 10.1007/s11761-024-00406-x.
[9] P. Souza, W. Marques, R. Reis, and T. Ferreto, “Iagree: Infrastructure-agnostic Resilience Benchmark Tool for Cloud Native Platforms,” in CLOSER 2019 - Proceedings of the 9th International Conference on Cloud Computing and Services Science, 2019, pp. 396 – 403. doi: 10.5220/0007728503960403.
[10] S. Henning and W. Hasselbring, “A configurable method for benchmarking scalability of cloud-native applications,” Empir. Softw. Eng., vol. 27, no. 6, 2022, doi: 10.1007/s10664-022-10162-1.
[11] M. Whitaker, B. Volckaert, and M. Al-Naday, “Idempotency in Service Mesh: For Resiliency of Fog-Native Applications in Multi-Domain Edge-to-Cloud Ecosystems,” in International Conference on Cloud Computing and Services Science, CLOSER - Proceedings, 2025, pp. 182 – 189. doi: 10.5220/0013293900003950.
[12] D. Gannon, R. Barga, and N. Sundaresan, “Cloud-Native Applications,” IEEE Cloud Comput., vol. 4, no. 5, pp. 16 – 21, 2017, doi: 10.1109/MCC.2017.4250939.
[13] O. Pozdniakova, D. Mažeika, and A. Cholomskis, “Adaptive Resource Provisioning and Auto-scaling for Cloud Native Software,” Commun. Comput. Inf. Sci., vol. 920, pp. 113 – 129, 2018, doi: 10.1007/978-3-319-99972-2_9.
[14] Q. Jiao, B. Xu, and Y. Fan, “Design of Cloud Native Application Architecture Based on Kubernetes,” in Proceedings - 2021 IEEE International Conference on Dependable, Autonomic and Secure Computing, International Conference on Pervasive Intelligence and Computing, International Conference on Cloud and Big Data Computing and International Conference on Cyber Science and Technology Congress, DASC/PiCom/CBDCom/CyberSciTech 2021, 2021, pp. 494 – 499. doi: 10.1109/DASC-PICom-CBDCom-CyberSciTech52372.2021.00088.
[15] D. Bharadwaj and B. S. Premananda, “Transition of Cloud Computing from Traditional Applications to the Cloud Native Approach,” in 2022 IEEE North Karnataka Subsection Flagship International Conference, NKCon 2022, 2022. doi: 10.1109/NKCon56289.2022.10126871.
[16] Y. Kim, C. Park, and Y. Shin, “Security Consideration of Each Layers in a Cloud-Native Environment,” Commun. Comput. Inf. Sci., vol. 1644 CCIS, pp. 231 – 242, 2023, doi: 10.1007/978-981-99-4430-9_17.
[17] D. Li, Z. Liu, J. Pan, and B. Li, “Evaluating the Resilience of Software Architecture Based on Minimal Path,” in Proceedings - 2023 10th International Conference on Dependable Systems and Their Applications, DSA 2023, 2023, pp. 42 – 53. doi: 10.1109/DSA59317.2023.00016.
[18] S. Singla, M. Rani, S. Rani, and U. M. Modibbo, “EVALUATING THE IMPACT OF PARAMETER SENSITIVITY IN META-HEURISTICS ON RELIABILITY PERFORMANCE,” Reliab. Theory Appl., vol. 20, no. 2, pp. 214 – 226, 2025, doi: 10.24412/1932-2321-2025-284-214-226.
[19] D. E. Adjepon-Yamoah, “Cloud-ATAM: Method for analysing resilient attributes of cloud-based architectures,” Lect. Notes Comput. Sci. (including Subser. Lect. Notes Artif. Intell. Lect. Notes Bioinformatics), vol. 9823 LNCS, pp. 105 – 114, 2016, doi: 10.1007/978-3-319-45892-2_8.
[20] G. Paparistodimou, P. Knight, M. Robb, and G. Hughes, “Dynamic Dual-Layer Network Resilience Assessment as a System Architecting Tool,” J. Mech. Des., vol. 147, no. 11, 2025, doi: 10.1115/1.4068458.
[21] A. Wachtel et al., “pyRoCS: A Python package to evaluate the resilience of complex systems,” SoftwareX, vol. 29, 2025, doi: 10.1016/j.softx.2024.101977.
[22] Upendra, R. Tripathi, T. P. Sahu, and G. Verma, Scalability and reliability issues for edge intelligence: Challenges and future directions. 2025. doi: 10.1201/9781003512066-14.
[23] S. Kadri, S. Aouag, and D. Hedjazi, “MS-QuAAF: A generic evaluation framework for monitoring software architecture quality,” Inf. Softw. Technol., vol. 140, 2021, doi: 10.1016/j.infsof.2021.106713.


