Software Evolution Prediction Using Machine Learning Algorithms
Keywords:
Software Engineering, Software Evolution, Artificial Intelligence (AI), Machine Learning (ML), RegressionAbstract
Software evolution represents the most time-consuming phase in the software development life cycle (SDLC) after a software release. Further, it covers an increasingly significant role in modern software development practices (e.g., Agile, DevOps, CI/CD) and web development (e.g., React, Next.js, etc.), where teams contribute more effort in improving and maintaining existing systems than building new ones. These evolutionary actions such as modifying and adding features, incorporating contributions, and fixing issues generate large software project data that demonstrate the dynamics of software development. Software evolution in machine learning (ML) techniques comprises the unending adaptation of models, data, and pipelines to continue performance under changing domains. In this paper, we are predicting the software evolution by analyzing features like Repository name, Repository link, Commits, Issues, Pull Requests, Stars, Forks, Total Contributors, Top Contributors, Code lines cover with Simple Linear Regression, Multiple Linear Regression, Polynomial Regression, and Support Vector Regression (SVR), are methods of ML. Thus, it is based on the results obtained a comparative study is conducted to evaluate and analyze model accuracy.
Downloads
References
Maryam Navaei, Nasseh Tabrizi, Machine Learning in Software Development Life Cycle: A Comprehensive Review, 17th International Conference on Evaluation of Novel Approaches to Software Engineering (ENASE 2022), pages 344-354, Science and Technology Publications, 2022. DOI: DOI: 10.5220/0011040600003176.
Vamsi Krishna Thatikonda, Machine Learning in Software Engineering: Practical Applications and Impact on The Modern Software Industry, International Journal of Artificial Intelligence & Machine Learning, Volume 2, Issue 1, January- December 2023, Pages 35-43, Page Count – 9. DOI: https://doi.org/10.17605/OSF.IO/6KX3.
Saad Shafiq, Atif Mashkoor, Christoph Mayr-Dorn, Alexander Egyed, A Literature Review of Using Machine Learning in Software Development Life Cycle Stages, IEEE Access (Volume: 9), 13 October 2021, IEEE. DOI: 10.1109/ACCESS.2021.3119746.
G. R. Gupta, N. k. Nagwani, Data Analytics and Visualization, PHI Learning Pvt. Ltd., 2026.
Swapnil Saurav, Data Science Machine Learning, Eka Publishers, 2020.
Raed Shatnawi, predicting software change-proneness from software evolution using machine learning methods. Interdisciplinary Journal of Information, Knowledge, and Management, 18, 769-790, 2023. DOI: https://doi.org/10.28945/5193.
Dr. Md Jaffar Sadiq, Shaik Ruma Sufian, K. Sowmya Reddy, Madapati Vinisha, Software Defect Estimation Using Machine Learning, International Journal for Research in Applied Science & Engineering Technology (IJRASET), 2020. DOI: https://doi.org/10.22214/IJRASET.2022.44551.
Marcos P.S. Gôlo, Adailton F. Araújo, Rafael G. Rossi, Ricardo M. Marcacini, Detecting relevant app reviews for software evolution and maintenance through multimodal one-class learning, Information and Software Technology, Volume 151, November 2022. DOI: https://doi.org/10.1016/j.infsof.2022.106998.
SHAHO HASSEN, Prof. Dr. Ali YAZICI, Prof. Dr. Alok MISHRA, A New Method for Software Defect Prediction Based on Optimized Machine Learning Techniques, Eurasian Journal of Engineering and Technology (EJET), 2023.
Sayed Moshin Reza, Md. Mahfujur Rahman, Hasnat Parvez, Omar Badreddin, Shamim Al Mamun, Performance Analysis of Machine Learning Approaches in Software Complexity Prediction, Proceedings of International Conference on Trends in Computational and Cognitive Engineering, Advances in Intelligent Systems and Computing 1309, Springer Nature Singapore Pte Ltd. 2021. DOI: https://doi.org/10.1007/978-981-33-4673-4_3.
Somya Goyal, Pradeep Kumar Bhatia, Software Quality Prediction Using Machine Learning Techniques, Innovations in Computational Intelligence and Computer Vision, Advances in Intelligent Systems and Computing 1189, Springer Nature Singapore Pte Ltd. 2021. DOI: https://doi.org/10.1007/978-981-15-6067-5_62.
Hongbing Wang, Haoran Ji, Evolution Model of Open-Source Software Projects in GitHub, 2022 IEEE 2nd International Conference on Software Engineering and Artificial Intelligence (SEAI), IEEE, 2022. DOI:10.1109/SEAI55746.2022.9832099.
Miryung Kim, Na Meng, Tianyi Zhang, Handbook of Software Engineering, Springer Nature Switzerland AG, 2019, DOI: https://doi.org/10.1007/978-3-030-00262-6_6.
IHAB F. ILYAS, THEODOROS REKATSINAS, Machine Learning and Data Cleaning: Which Serves the Other? Journal of Data and Information Quality, Vol. 14, No. 3, July 2022.DOI: https://doi.org/10.1145/3506712ACM.
Downloads
Published
How to Cite
Issue
Section
ARK
License
Copyright (c) 2026 Rajeeb Bal, Jibendu Mantri

This work is licensed under a Creative Commons Attribution-NonCommercial 4.0 International License.
Copyright on any article published in the International Journal of Computer Engineering and Data Science (IJCEDS) is retained by the author(s). All articles are published under the terms of the Creative Commons Attribution-NonCommercial 4.0 International License (CC BY-NC 4.0), which permits any non-commercial use, distribution, and reproduction in any medium, provided that the original work is properly cited.
License Agreement
By submitting and publishing their work in IJCEDS, the authors:
-
Grant IJCEDS the non-exclusive right to publish the article and to identify IJCEDS as the original publisher.
-
Authorize any third party to use, share, and reproduce the article for non-commercial purposes, provided that appropriate credit is given to the original authors and source, and a link to the license is included.
