A Survey on Data Mining Algorithms and Techniques in Medicine

Kasra Madadipouya

doi:10.30630/joiv.1.3.25

A Survey on Data Mining Algorithms and Techniques in Medicine

Kasra Madadipouya - Asia Pacific University of Technology & Innovation, Kuala Lumpur, Malaysia

Citation Format:

DOI: http://dx.doi.org/10.30630/joiv.1.3.25

Abstract

Medical Decision Support Systems (MDSS) industry collects a huge amount of data, which is not properly mined and not put to the optimum use. This data may contain valuable information that awaits extraction. The knowledge may be encapsulated in various patterns and regularities that may be hidden in the data. Such knowledge may prove to be priceless in future medical decision making.Â Available medical decision support systems are based on static data, which may be out of date. Thus, a medical decision support system that can learn the relationships between patient histories, diseases in the population, symptoms, pathology of a disease, family history, and test results, would be useful to physicians and hospitals.Â This paper provides an in-depth review of available data mining algorithms and techniques. In addition to that, data mining applications in medicine are discussed as well as techniques for evaluating them and available applications of performance metrics.

Keywords

Data Mining; Classification; Decision Tree; Neural Network; Bayesian Network Classifier; Evaluation Metrics

Full Text:

PDF

References

E. Nolte, and C. M. McKee, Measuring the health of nations: updating an earlier analysis. Health affairs, 27(1), 58-71, 2008.

R. Teach and E. Shortliffe, â€œAn analysis of physician attitudes regarding computer-based clinical consultation systems,â€ Computers and Biomedical Research, vol. 14, 542-558, 1981.

I. Turkoglu, A. Arslan and E. Ikay, â€œAn expert system for diagnosis of the heart valve diseases,â€ Expert Systems with Applications, vol. 23, no.3, 229â€“236, 2002.

I. H. Witten, and E. Frank, â€œData Mining, Practical Machine Learning Tools and Techniques,â€ Elsevier, 2005.

P. Herron, â€œMachine Learning for Medical Decision Support: Evaluating Diagnostic Performance of Machine Learning Classification Algorithms,â€ INLS 110, Data Mining, 2004.

L Li, et al., â€œData mining techniques for cancer detection using serum proteomic profiling,â€ Artificial Intelligence in Medicine, vol. 32, 71-83, 2004.

E. Comak, A. Arslan and I. Turkoglu, â€œA decision support system based on support vector machines for diagnosis of the heart valve diseases,â€ Elsevier, vol. 37, 21-27, 2007.

R. Rojas, â€œNeural Networks: a systematic introduction,â€ Springer-Verlag, 1996.

A. J. Van gerven, R. Jurgelenaite, B. G. Taal, T. Heskes and P. J. F. Lucas, â€œPredicting carcinoid heart disease with the noisy-threshold classifier,â€ Artificial Intelligence in Medicine, vol. 40, 45-55, 2007.

D. Spiegelhalter and R. Knill-Jones, â€œStatistical and knowledge based approaches to clinical decision support systems, with an application in gastroenterology,â€ Journal of the Royal Statistical Society, vol. 147, 35-77, 1984.

A. Vlahou, J. O. Schorge, B. W. Gregory and R. L. Coleman, â€œDiagnosis of ovarian cancer using decision tree classification of mass spectral data,â€ Journal of Biomedicine and Biotechnology, vol. 5 308-314, 2003.

D. Cosic and S. Loncaric, â€œRule-based labeling of CT head image. Lecture Notes in Artificial Intelligence,â€ Berlin, Germany, Springer-Verlag, vol. 1211, 453â€“456, 1999.

W. Duch, K. Grabczewski, R. Adamczak, K. Grudzinski and Z. S. Hippe, â€œRules for melanoma skin cancer diagnosis,â€ Available from: http://www.phys.uni.torun.pl/publications/kmk/ [Accessed 2 May 2016], 2001.

M. Hunt, B. Von Konsky, S. Venkatesh and P. Petros, â€œBayesian networks and decision trees in the diagnosis of female urinary incontinence,â€ Engineering in Medicine and Biology Society, Proceedings of the 22nd Annual International Conference of the IEEE, vol. 1, 551-554, 2000.

G. Richards, V.J. Rayward-Smith, P. H. SÃ¶nksen, S. Carey and C. Weng, â€œData mining for indicators of early mortality in a database of clinical records,â€ Artificial Intelligence in Medicine, vol. 22, no. 3, 215â€“231, 2000.

W. Detmer, G. Barnett, W. Hersh and M. Weaver, â€œIntegrating Decision Support,â€ Literature Searching and Web Exploration using the UMLS, Metathesaurus, 1997.

D. West and V. West, â€œModel selection for a medical diagnostic decision support system: a breast cancer detection case,â€ Artificial Intelligence in Medicine, vol. 20, 183-204, 2000.

T. M. Mitchell, â€œMachine Learning,â€ McGraw-Hill Higher Education, 1997.

L. Autio, M. Juhola and J. Laurikkala, â€œOn the neural network classification of medical data and an endeavor to balance non-uniform data sets with artificial data extension,â€ Computers in Biology and Medicine, vol. 37, no. 3, 388-397, 2007.

Y. Hayashi, R. Setiono and K. Yoshida, â€œA comparison between two neural network rule extraction techniques for the diagnosis of hepatobiliary disorders,â€ Artificial Intelligence in Medicine, vol. 20, no. 3, 205â€“216, 2000.

P. Cunningham, J. Carney and S. Jacob, â€œStability problems with artificial neural networks and the ensemble solution,â€ Artificial Intelligence in Medicine, vol. 20, no. 3, 217â€“225, 2000.

A. Sharkey, N. E. Sharkey and S. S. Cross, â€œAdapting an ensemble approach for the diagnosis of breast cancer,â€ Proceedings of ICANN, SkÃ¶vde, Sweden, 281â€“286, 1998.

P. Domingos and M. Pazzani, â€œOn the Optimality of the Simple Bayesian Classifier under Zero-One Loss,â€ Machine Learning, vol. 29, no. 2-3, 103-130, 1997.

T. Karthikeyan, and P. Thangaraju, â€œAnalysis of Classification Algorithms Applied to Hepatitis Patients,â€ International Journal of Computer Applications, 62(15), 2013.

V. Podgorelec, P. Kokol, B. Stiglic and I. Rozman, â€œDecision trees: an overview and their use in medicine,â€ Journal of Medical Systems, 26(5):445-463, 2002.

J. Han, and M. Kamber, â€œData Mining: Concepts and Techniques,â€ Morgan Kaufmann Publishers, 2nd ed, 2006.

S. K. Murthy, â€œAutomatic Construction of Decision Trees from Data: A Multi-Disciplinary Survey,â€ Data Mining and Knowledge Discovery , 1997

J. Han, â€œData Mining: Concepts and Techniques,â€ Morgan Kaufmann publications, 2006.

C. Van der gaag, and S. Renooij, â€œAligning Bayesian Network Classifiers with Medical Contexts,â€ Technical Report UU-CS-2008-015, 2008.

K. Anil Jain, J. Mao and K.M. Mohiuddi, â€œArtificial Neural Networks: A Tutorial,â€ IEEE Computers, pp.31-44, 1996.

S. Haykin, â€œNeural Networks â€“ A Comprehensive Foundation,â€ Pearson Education, 2001.

K. Cios and G. Moore, â€œUniqueness of Medical Data Mining,â€ Artificial Intelligence in Medicine, 2002, vol. 26, 1-24, 2002.

D. Berrar, I. Bradbury and W. Dubitzky, â€œAvoiding model selection bias in small-sample genomic datasets,â€ Oxford University Press, 2006.

U. Scherf, â€œA gene expression database for the molecular pharmacology of cancer,â€ Nature Genetics, vol. 24, no. 236-245, 2000.

R. E. Banfield, L.O. Hall, K.W. Bowyer and W.P. Kegelmeyer, â€œA Comparison of Decision Tree Ensemble Creation Techniques,â€ IEEE Computer Society, vol. 29, 2007.

S. Daya, â€œDiagnostic test - receiver operating characteristic (ROC) curve,â€ Evidence-based Obstetrics and Gynaecology, vol. 8, no. 1-2, 3-4, 2006.

W. A. Yousef, R.F. Wagner and M.H. Loew, â€œEstimating the uncertainty in the estimated mean area under the ROC curve of a classifier,â€ Pattern Recognition Letters, vol. 26, no. 16, 2600-2610, 2005.

Breiman L, Friedman JH, Olshen RA, Stone CJ. â€œClassification and regression treesâ€. Wadsworth & Brooks. Monterey, CA. 1984.

Kasra Madadipouya â€œA New Decision tree method for Data mining in Medicineâ€ Advanced Computational Intelligence: An International Journal (ACII), Vol.2, No.3, July 2015.

Username
Password
Remember me