Data Mining for Healthcare Data: A Comparison of Neural Networks Algorithms
DOI:
https://doi.org/10.31154/cogito.v3i1.40.10-19Abstract
Classification has been considered as an important tool utilized for the extraction of useful information from healthcare dataset. It may be applied for recognition of disease over symptoms. This paper aims to compare and evaluate different approaches of neural networks classification algorithms for healthcare datasets. The algorithms considered here are Multilayer Perceptron, Radial Basis Function, and Voted Perceptron which are tested based on resulted classifiers accuracy, precision, mean absolute error and root mean squared error rates, and classifier training time. All the algorithms are applied for five multivariate healthcare datasets, Echocardiogram, SPECT Heart, Chronic Kidney Disease, Mammographic Mass, and EEG Eye State datasets. Among the three algorithms, this study concludes the best algorithm for the chosen datasets is Multilayer Perceptron. It achieves the highest for all performance parameters tested. It can produce high accuracy classifier model with low error rate, but suffer in training time especially of large dataset. Voted Perceptron performance is the lowest in all parameters tested. For further research, an investigation may be conducted to analyze whether the number of hidden layer in Multilayer Perceptron’s architecture has a significant impact on the training time.References
Han J, Kamber M. Data Mining Concepts and Techniques, Academic Press: USA, 2001.
Witten I H, Frank E. Data Mining Practical Machine Learning Tools and Techniques. 2nd edn. Morgan Kaufmann, 2005.
WEKA. http://www.cs.waikato.ac.nz/~ml/weka. Date Accessed: 14/02/2015.
UCI. https://archive.ics.uci.edu/ml/datasets.html. Date Accessed: 16/02/2015.
Venkatesann E, Velmurugan T. Performance Analysis of Decisin Tree Algorithms for Breast Cancer Classification. Indian Journal of Science and Technology. 2015 Nov; 8 (29).
Rahman R.M, Afroz F. Comparison of Various Classification Techniques Using Different Data Mining Tools for Diabetes Diagnosis. Journal of Software Engineering and Applications. 2013; 6: 85-97.
Akinola SO, Oyabugbe OJ. Accuracies and Training Time of Data Mining Clasification Algorithms: an Empirical Comparative Study. Journal of Software Engineering and Applications. 2015 Sept; 8: 470-477.
Danjuma K, Osofisan A. Evaluation of Predictive Data Mining Algorithms in Erythemato-Squamous Disease Diagnosis. International Journal of Computer Science Issues. 2014; 11(6): 85-94
Alkrimi, et.al. Comparative Study Using Weka for Red Blood Cells Classification. International Journal of Medical, Health, Pharmaceutical and Biomedical Engineering. 2015; 9(1): 19-22.
Amin MN, Habib MA. Comparison of Different Classificaiton Techniques Using WEKA for Hematological Data. American Journal of Engineering Research. 2015; 4 (3): 55-61.
Durairaj, M, Deepika, R. Comparative Analysis of Classificatin Algorithms for the Prediction of Leukimia Cancer. International Journal of Advanced Research in Computer Science and Software Engineering. 2015 Aug; 5 (8): 787-791.
Barnaghi PM, Sahzabi VA, Bakar AA. A Comparative Study for Various Methods of Classification. Proc. of Int. Conf. on Informatin and Computer Networks, Singapore, 2012.
Gupta N, Rawal A, Narasimhan VL, Shiwani S. Accuracy, Sensitivity and Specifity Measurement of Various Classificatin Techniques on Healthcare Data. IOSR Journal of Computer Engineering. 2013 May-June; 11 (5): 70-73.
Kumar Y, Sahoo G. Analysis of Bayes, Neural Network and Tree Classifier of Classification Technique in Data Mining using WEKA. Computer Science and Information Technology. 2012; 2 (2): 359-369.
Zhang, G.P. Neural Networks for Data Mining. In: Data Mining and Knowledge Discovery Handbook, 2nd edn., Springer, 2010; 419-444.
Nookala, G. K. M, Pottumuthu, B. K, Orsu, N, Mudunuri, S. B. Performance Analysis and Evaluation of Different Data Mining Algorithms used for Cancer Classification. International Journal of Advanced Research in Artificial Intelligence. 2013; 2(5): 49-55.
Mala, V, Lobiyal, D. K. Evaluation and Performance of Classification Methods for Medical Data Sets. International Journal of Advanced Research in Computer Science and Software Engineering. 2015 Nov; 5 (11): 336-340.
Roy, S, Mohapatra, A. Performance Analysis of Machine Learning Techniques in Micro Array Data Classification. International Journal of Software and Web Sciences. MarchMay 2013; 4 (1): 20-25.
Downloads
Published
How to Cite
Issue
Section
License
Authors who publish with this journal agree to the following terms:- Authors retain copyright and grant the journal right of first publication with the work simultaneously licensed under a Creative Commons Attribution License that allows others to share the work with an acknowledgment of the work's authorship and initial publication in this journal.
- Authors are able to enter into separate, additional contractual arrangements for the non-exclusive distribution of the journal's published version of the work (e.g., post it to an institutional repository or publish it in a book), with an acknowledgment of its initial publication in this journal.
- Authors are permitted and encouraged to post their work online (e.g., in institutional repositories or on their website) prior to and during the submission process, as it can lead to productive exchanges, as well as earlier and greater citation of published work (See The Effect of Open Access).