Data Mining for Healthcare Data: A Comparison of Neural Networks Algorithms

Debby E. Sondakh

Abstract


Classification has been considered as an important tool utilized for the extraction of useful information from healthcare dataset. It may be applied for recognition of disease over symptoms. This paper aims to compare and evaluate different approaches of neural networks classification algorithms for healthcare datasets. The algorithms considered here are Multilayer Perceptron, Radial Basis Function, and Voted Perceptron which are tested based on resulted classifiers accuracy, precision, mean absolute error and root mean squared error rates, and classifier training time. All the algorithms are applied for five multivariate healthcare datasets, Echocardiogram, SPECT Heart, Chronic Kidney Disease, Mammographic Mass, and EEG Eye State datasets. Among the three algorithms, this study concludes the best algorithm for the chosen datasets is Multilayer Perceptron. It achieves the highest for all performance parameters tested. It can produce high accuracy classifier model with low error rate, but suffer in training time especially of large dataset. Voted Perceptron performance is the lowest in all parameters tested. For further research, an investigation may be conducted to analyze whether the number of hidden layer in Multilayer Perceptron’s architecture has a significant impact on the training time.

Full Text:

PDF

References


Han J, Kamber M. Data Mining Concepts and Techniques, Academic Press: USA, 2001.

Witten I H, Frank E. Data Mining Practical Machine Learning Tools and Techniques. 2nd edn. Morgan Kaufmann, 2005.

WEKA. http://www.cs.waikato.ac.nz/~ml/weka. Date Accessed: 14/02/2015.

UCI. https://archive.ics.uci.edu/ml/datasets.html. Date Accessed: 16/02/2015.

Venkatesann E, Velmurugan T. Performance Analysis of Decisin Tree Algorithms for Breast Cancer Classification. Indian Journal of Science and Technology. 2015 Nov; 8 (29).

Rahman R.M, Afroz F. Comparison of Various Classification Techniques Using Different Data Mining Tools for Diabetes Diagnosis. Journal of Software Engineering and Applications. 2013; 6: 85-97.

Akinola SO, Oyabugbe OJ. Accuracies and Training Time of Data Mining Clasification Algorithms: an Empirical Comparative Study. Journal of Software Engineering and Applications. 2015 Sept; 8: 470-477.

Danjuma K, Osofisan A. Evaluation of Predictive Data Mining Algorithms in Erythemato-Squamous Disease Diagnosis. International Journal of Computer Science Issues. 2014; 11(6): 85-94

Alkrimi, et.al. Comparative Study Using Weka for Red Blood Cells Classification. International Journal of Medical, Health, Pharmaceutical and Biomedical Engineering. 2015; 9(1): 19-22.

Amin MN, Habib MA. Comparison of Different Classificaiton Techniques Using WEKA for Hematological Data. American Journal of Engineering Research. 2015; 4 (3): 55-61.

Durairaj, M, Deepika, R. Comparative Analysis of Classificatin Algorithms for the Prediction of Leukimia Cancer. International Journal of Advanced Research in Computer Science and Software Engineering. 2015 Aug; 5 (8): 787-791.

Barnaghi PM, Sahzabi VA, Bakar AA. A Comparative Study for Various Methods of Classification. Proc. of Int. Conf. on Informatin and Computer Networks, Singapore, 2012.

Gupta N, Rawal A, Narasimhan VL, Shiwani S. Accuracy, Sensitivity and Specifity Measurement of Various Classificatin Techniques on Healthcare Data. IOSR Journal of Computer Engineering. 2013 May-June; 11 (5): 70-73.

Kumar Y, Sahoo G. Analysis of Bayes, Neural Network and Tree Classifier of Classification Technique in Data Mining using WEKA. Computer Science and Information Technology. 2012; 2 (2): 359-369.

Zhang, G.P. Neural Networks for Data Mining. In: Data Mining and Knowledge Discovery Handbook, 2nd edn., Springer, 2010; 419-444.

Nookala, G. K. M, Pottumuthu, B. K, Orsu, N, Mudunuri, S. B. Performance Analysis and Evaluation of Different Data Mining Algorithms used for Cancer Classification. International Journal of Advanced Research in Artificial Intelligence. 2013; 2(5): 49-55.

Mala, V, Lobiyal, D. K. Evaluation and Performance of Classification Methods for Medical Data Sets. International Journal of Advanced Research in Computer Science and Software Engineering. 2015 Nov; 5 (11): 336-340.

Roy, S, Mohapatra, A. Performance Analysis of Machine Learning Techniques in Micro Array Data Classification. International Journal of Software and Web Sciences. MarchMay 2013; 4 (1): 20-25.


Refbacks

  • There are currently no refbacks.



CogITo Smart Journal
A publication of Fakultas Ilmu Komputer, Universitas Klabat
In partnership with Coris and IndoCEISS
Telpon: +62 (431) 891035
email: editorial.cogito@unklab.ac.id | web: http://cogito.unklab.ac.id/index.php/cogito

Flag Counter

CogITo Smart Journal is indexed by:
DOAJ    SINTA Logo Ristek DIKTI     Indonesia OneSearch by Perpusnas    Google Scholar


CogITo Smart Journal is licensed under a Creative Commons Attribution 4.0 International License
Lisensi Creative Commons