A Comparison of Software Defect Prediction Metrics Using Data Mining Algorithms

dc.contributor.authorAydın, Zeynep Behrin Güven
dc.contributor.authorŞamlı, Rüya
dc.date.accessioned2026-02-08T15:03:29Z
dc.date.available2026-02-08T15:03:29Z
dc.date.issued2020
dc.departmentBursa Teknik Üniversitesi
dc.description118E682.
dc.description.abstractData mining is an interdisciplinary field that uses methods such as machine learning, artificial intelligence, statistics, and deep learning. Classification is an important data mining technique as it is widely used by researchers. Generally, statistical methods or machine learning algorithms such as Decision Trees, Fuzzy Logic, Genetic Programming, Random Forest, Artificial Neural Networks and Logistic Regression have been used in software defect prediction in the literature. Performance measures such as Accuracy, Precision, Mean Absolute Error (MAE) and Root Mean Squared Error (RMSE) are used to examine the performance of these classifiers. In this paper, 4 data sets entitled JM1, KC1, CM1, PC1 in the PROMISE repository, which are created within the scope of the publicly available NASA institution's Metric Data Program, are examined as in the other software defect prediction studies in the literature. These datasets include Halstead, McCabe method-level, and some other class-level metrics. Data sets are used with Wakiato Environment for Knowledge Analysis (WEKA) data mining software tool. By this tool, some classification algorithms such as Naive Bayes, SMO, K *, AdaBoost1, J48 and Random Forest were applied on NASA error datasets in PROMISE repository and their accuracy rates were compared. The best value among the accuracy rates was obtained in the Bagging algorithm in the PC1 data set with the values of %94.13.Keywords: Software Defect Prediction, McCabe, Halstead, Data Mining, Accuracy, Random ForestCite this paper as:GÜVEN AYDIN, Z.B., SAMLI, R. (2020). A Comparison of Software Defect Prediction Metrics Using Data Mining Algorithms. Journal of Innovative Science and Engineering. 4(1): 11-21*Corresponding author: Zeynep Behrin GÜVEN AYDINE-mail: zeynepguven@maltepe.edu.trReceived Date: 24/02/2020Accepted Date: 05/05/2020© Copyright 2020 byBursa Technical University. Available online at http://jise.btu.edu.tr/The works published in Journal of Innovative Science and Engineering (JISE) are licensed under a Creative Commons Attribution-NonCommercial 4.0 International License.
dc.description.sponsorshipTÜBİTAK
dc.identifier.doi10.38088/jise.693098
dc.identifier.endpage21
dc.identifier.issn2602-4217
dc.identifier.issue1
dc.identifier.startpage11
dc.identifier.urihttps://doi.org/10.38088/jise.693098
dc.identifier.urihttps://hdl.handle.net/20.500.12885/4146
dc.identifier.volume4
dc.language.isoen
dc.publisherBursa Teknik Üniversitesi
dc.relation.ispartofJournal of Innovative Science and Engineering
dc.relation.publicationcategoryMakale - Ulusal Hakemli Dergi - Kurum Öğretim Elemanı
dc.rightsinfo:eu-repo/semantics/openAccess
dc.snmzKA_DergiPark_20260207
dc.subjectEngineering
dc.subjectMühendislik
dc.titleA Comparison of Software Defect Prediction Metrics Using Data Mining Algorithms
dc.typeArticle

Dosyalar