Vol 18 no.2 2018

Yakub Kayode SAHEED1, Abdulsabur Oluseye AKANNI1, Maruf O ALIMI1, F.E. HAMZA-USMAN2

1Department of Computer Science, Al-Hikmah University, Ilorin, Nigeria; 2Department of Computer Science, University of Ilorin, Nigeria


Breast cancer (BC) is one of the leading cancers for women when compared to all other cancers. It is a killer disease prominent and most frequent type of cancer affecting women worldwide and is increasing particularly in Africa. The aim of this paper is to investigate the influence of data preprocessing based on dicretization in the classification of BC. Two different classification algorithms Support vector machine-Radial basis function (SVM-RBF) and Adaboost algorithm were employed. We analyzed the BC data available from the Wisconsin dataset from UCI machine learning repository. The experiment was performed in Waikato Environment For knowledge analysis (Weka) software. The experimental results showed that discretized SVM-RBF and discretized Adaboost algorithms outperforms the non-discretized SVM-RBF and non-discretized Adaboost algorithms in terms of accuracy, precision, recall, f-measure and time taken to build the model

Full Text: