An empirically based object-oriented testing using Machine learning




Software defect, Machine learning, Early defect prediction, Software Quality assurance, stacking classifier


INTRODUCTION: The rapid growth of machine learning has the potential to revolutionize various industries and applications by automating complex tasks and enhancing efficiency. Effective software testing is crucial for ensuring software quality and minimizing resource expenses in software engineering. Machine learning techniques play a vital role in software testing by aiding in test case prioritization, predicting software defects, and analyzing test results.

OBJECTIVES: The primary objective of this study is to explore the use of machine learning algorithms for software defect prediction.

METHODS: Machine Learning models including Random Forest Classifier, Logistic Regression, K Nearest Neighbors, Gradient Boosting Classifiers, Catboost Classifier, and Convolutional Neural Networks have been employed for the study. The dataset includes a wide range of features relevant to software defect prediction and evaluates the performance of different prediction models. The study also focussed on developing hybrid models using stacking classifiers, which combine multiple individual models to improve accuracy.

RESULTS: The experimental results show that the hybrid models combining CatBoost and Convolutional Neural Network have outperformed individual models, achieving the highest accuracy of 89.5%, highlighting the effectiveness of combining machine learning algorithms for software defect prediction.

CONCLUSION: In conclusion, this study sheds light on the pivotal role of machine learning in enhancing software defect prediction.


Download data is not yet available.
<br data-mce-bogus="1"> <br data-mce-bogus="1">


M. Assim, Q. Obeidat and M. Hammad, "Software Defects Prediction using Machine Learning Algorithms," 2020 International Conference on Data Analytics for Business and Industry: Way Towards a Sustainable Economy (ICDABI), Sakheer, Bahrain, 2020, pp. 1-6, doi: 10.1109/ICDABI51230.2020.9325677. DOI:

P. Tadapaneni, N. C. Nadella, M. Divyanjali and Y. Sangeetha, "Software Defect Prediction based on Machine Learning and Deep Learning," 2022 International Conference on Inventive Computation Technologies (ICICT), Nepal, 2022, pp. 116-122, doi: 10.1109/ ICICT54344.2022.9850643. DOI:

Z. Tian, J. Xiang, S. Zhenxiao, Z. Yi and Y. Yunqiang, "Software Defect Prediction based on Machine Learning Algorithms," 2019 IEEE 5th International Conference on Computer and Communications (ICCC), Chengdu, China, 2019, pp. 520-525, doi: 10.1109/ICCC47050.2019.9064412. DOI:

M. Cetiner and O. K. Sahingoz, "A Comparative Analysis for Machine Learning based Software Defect Prediction Systems," 2020 11th International Conference on Computing, Communication and Networking Technologies (ICCCNT), Kharagpur, India, 2020, pp. 1-7, doi: 10.1109/ICCCNT49239.2020.9225352. DOI:

S. A. K, V. Gururaj, K. R. Umadi, M. Kumar, S. P. Shankar and D. Varadam, "Comprehensive Survey of different Machine Learning Algorithms used for Software Defect Prediction," 2022 International Conference on Decision Aid Sciences and Applications (DASA), Chiangrai, Thailand, 2022, pp. 425-430, doi: 10.1109/DASA54658.2022.9764982. DOI:

Razauddin, S. Madhuri G, A. Oberoi, A. Vats, A. Sivasangari and K. Siwach, "Research on Efficient Software Defect Prediction Using Deep Learning Approaches," 2022 2nd International Conference on Technological Advancements in Computational Sciences (ICTACS), Tashkent, Uzbekistan, 2022, pp. 549-554, doi: 10.1109/ICTACS56270.2022.9988292. DOI:

J. Xu, L. Yan, F. Wang and J. Ai, "A GitHub-Based Data Collection Method for Software Defect Prediction," 2019 6th International Conference on Dependable Systems and Their Applications (DSA), Harbin, China, 2020, pp. 100-108, doi: 10.1109/DSA.2019.00020. DOI:

S. Krishnan, J. Wang, E. Wu, M. J. Franklin, and K. Goldberg. Activeclean: Interactive data cleaning while learning convex loss models. In Arxiv: http:// pdf/ 1601.03797.pdf , 2015. DOI:

Youchen Miao, Zeyu Jin, Yumeng Zhang, Yuchen Chen, and Junren Lai. 2022. Compare Machine Learning Models in Text Classification Using Steam User Reviews. In 2021 3rd International Conference on Software Engineering and Development (ICSED) (ICSED 2021). Association for Computing Machinery, New York, NY, USA, 40–45. DOI:

Lawrence Mitchell, Terence M. Sloan, Muriel Mewissen, Peter Ghazal, Thorsten Forster, Michal Piotrowski, and Arthur S. Trew. 2011. A parallel random forest classifier for R. In Proceedings of the second international workshop on Emerging computational methods for the life sciences (ECMLS '11). Association for Computing Machinery, New York, NY, USA, 1–6. DOI:

Hans Jørgen Andersen, Ann Morrison, and Lars Knudsen. 2012. Modeling vibrotactile detection by logistic regression. In Proceedings of the 7th Nordic Conference on Human-Computer Interaction: Making Sense Through Design (NordiCHI '12). Association forComputing Machinery, New York, NY, USA, 500–503. DOI:

Bromley and Säckinger. Neural-network and K-nearestneighbor classifiers. Technical Report 11359-910819-16TM AT&T 1991

Manuel Fernández-Delgado, Eva Cernadas, Senén Barro, and Dinani Amorim. Do we need hundreds of classifiers to solve real world classification problems? J. Mach. Learn. Res, 15(1):3133–3181, 2014.

Yunsang Joo, Seungwon Lee, Hyoungju Kim, Pankoo Kim, Seongoun Hwang, and Chang Choi. 2021. Efficient healthcare service based on Stacking Ensemble. In Proceedings of the 2020 ACM International Conference on Intelligent Computing and its Emerging Applications (ACM ICEA '20). Association for Computing Machinery, New York, NY, USA, Article 28, 1–5. DOI:

Lee Soo-eun, Kim Han-joon. (2020). A new ensemble learning technique with multiple stacking. Journal of the Korea Electronic Trade Association, 25(3) and 1-13.

Christian Federmann. 2012. Can machine learning algorithms improve phrase selection in hybrid machine translation? In Proceedings of the Joint Workshop on Exploiting Synergies between Information Retrieval and Machine Translation (ESIRMT) and Hybrid Approaches to Machine Translation (HyTra) (EACL 2012). Association for Computational Linguistics, USA, 113–118.

Ghosh, H., Tusher, M.A., Rahat, I.S., Khasim, S., Mohanty, S.N. (2023). Water Quality Assessment Through Predictive Machine Learning. In: Intelligent Computing and Networking. IC-ICN 2023. Lecture Notes in Networks and Systems, vol 699. Springer, Singapore. DOI:

Rahat IS, Ghosh H, Shaik K, Khasim S, Rajaram G. Unraveling the Heterogeneity of Lower-Grade Gliomas: Deep Learning-Assisted Flair Segmentation and Genomic Analysis of Brain MR Images. EAI Endorsed Trans Perv Health Tech [Internet]. 2023 Sep. 29 [cited 2023 Oct. 2];9. DOI:

Ghosh H, Rahat IS, Shaik K, Khasim S, Yesubabu M. Potato Leaf Disease Recognition and Prediction using Convolutional Neural Networks. EAI Endorsed Scal Inf Syst [Internet]. 2023 Sep. 21 DOI:

Mandava, S. R. Vinta, H. Ghosh, and I. S. Rahat, “An All-Inclusive Machine Learning and Deep Learning Method for Forecasting Cardiovascular Disease in Bangladeshi Population”, EAI Endorsed Trans Perv Health Tech, vol. 9, Oct. 2023. DOI:

Mandava, M.; Vinta, S. R.; Ghosh, H.; Rahat, I. S. Identification and Categorization of Yellow Rust Infection in Wheat through Deep Learning Techniques. EAI Endorsed Trans IoT 2023, 10. DOI:

Khasim, I. S. Rahat, H. Ghosh, K. Shaik, and S. K. Panda, “Using Deep Learning and Machine Learning: Real-Time Discernment and Diagnostics of Rice-Leaf Diseases in Bangladesh”, EAI Endorsed Trans IoT, vol. 10, Dec. 2023 DOI:

Khasim, H. Ghosh, I. S. Rahat, K. Shaik, and M. Yesubabu, “Deciphering Microorganisms through Intelligent Image Recognition: Machine Learning and Deep Learning Approaches, Challenges, and Advancements”, EAI Endorsed Trans IoT, vol. 10, Nov. 2023. DOI:

Mohanty, S.N.; Ghosh, H.; Rahat, I.S.; Reddy, C.V.R. Advanced Deep Learning Models for Corn Leaf Disease Classification: A Field Study in Bangladesh. Eng. Proc. 2023, 59, 69. DOI:

Alenezi, F.; Armghan, A.; Mohanty, S.N.; Jhaveri, R.H.; Tiwari, P. Block-Greedy and CNN Based Underwater Image Dehazing for Novel Depth Estimation and Optimal Ambient Light. Water 2021, 13, 3470. DOI:




How to Cite

P. Sindhu, G. S. Peruri, and M. Yalavarthi, “An empirically based object-oriented testing using Machine learning”, EAI Endorsed Trans IoT, vol. 10, Mar. 2024.