FINDING FEATURES FOR THE PREDICTION OF BREAST CANCER USING THE K NEAREST NEIGHBORHOOD AND SUPPORT MACHINES METHODS


Abstract views: 145 / PDF downloads: 88

Authors

  • Serdar GÜNDOĞDU Öğr. Gör. Dr., Dokuz Eylül Üniversitesi, Bergama MYO, Bilgisayar Teknolojileri Bölümü

DOI:

https://doi.org/10.38065/euroasiaorg.151

Keywords:

Breast cancer diagnosis, Cross-validation, Feature selection, Classification, KNN, SVM

Abstract

Breast cancer, the most common type of cancer among women, affects millions of women every year. The aim of this study is to find properties for estimating breast cancer using two basic classifier methods and to compare the classifier performances. In the study, a ready data set including age, body mass index (BMI), glucose, resistin, insulin, homeostasis model assessment for insulin resistance (HOMA-IR), monocyte chemoattractant protein 1 (MCP1), leptin hormone and adiponectin hormone were used. 10 fold cross verification (10 fold) method was used while selecting the training and test data. Classification performance of SVM from classifiers was better than KNN. It was seen that the data set with 5 clinical features showed the best classification performance with 85.3% classification accuracy and 89.1% specificity (the rate of detecting those with cancer) values. It has been determined that the most suitable features for breast cancer prediction are age, BMI, resistin, glucose and adiponectin parameters.

Published

2020-08-11

How to Cite

GÜNDOĞDU, S. (2020). FINDING FEATURES FOR THE PREDICTION OF BREAST CANCER USING THE K NEAREST NEIGHBORHOOD AND SUPPORT MACHINES METHODS. Euroasia Journal of Mathematics, Engineering, Natural & Medical Sciences, 7(10), 149–153. https://doi.org/10.38065/euroasiaorg.151

Issue

Section

Articles