The Comparisons of Four Splitting Rules for Fitting a Classification Tree with Simulation and an Application Related to Albuminuria Data in Type 2 Diabetes Mellitus

Authors

  • Imran Kurt Omurlu Adnan Menderes University
  • Mevlut Ture Adnan Menderes University
  • Mustafa Unubol Adnan Menderes University
  • Merve Katranci Adnan Menderes University
  • Engin Guney Adnan Menderes University

Keywords:

albuminuria, classification trees, simulation, Gini, splitting rule

Abstract

The objective of this study was to compare the performances of splitting rules for predicting an ordinal response with simulation and a real data set. In the case of simulations, we compared across the methods using different sample sizes and the number of independent variables by employing the Monte Carlo simulation method. In the real data application, an analysis was performed with 265 cases. The results showed that the performances of the generalized Gini with the linear and quadratic costs of misclassification were better suited for analysis based on the gamma ordinal association measure and misclassification error rate than the other approaches. According to the gamma ordinal association measure, the generalized Gini (linear and quadratic) to the major risk factors determined for albuminuria in type 2 diabetes mellitus patients showed a slightly better performance than the other approaches. The predictive capability of splitting rules based on generalized Gini for predicting an ordinal response can be used for different sample sizes, number of independent variables and potential future suitable classification data problems. Consequently, our study will move towards choosing the generalized Gini (linear or quadratic) as the splitting rule and evaluate the data by using the Classification Trees (CT) in future studies, focusing on predicting an ordinal response.

Author Biographies

Imran Kurt Omurlu, Adnan Menderes University

Medical Faculty,

Mevlut Ture, Adnan Menderes University

Medical Faculty, Department of Biostatistics

Mustafa Unubol, Adnan Menderes University

Medical Faculty, Division of Endocrinology

Merve Katranci, Adnan Menderes University

Medical Faculty, Department of Biostatistics

Engin Guney, Adnan Menderes University

Medical Faculty, Division of Endocrinology

References

H. Zhang.

L. Breiman, J.H. Friedman, R.A. Olshen, C.J. Stone. Classification and Regression Trees. New York: Chapman and Hall/CRC, 1984, pp. 93-126.

R. Timofeev.

K.J. Archer.

R. Piccarreta.

P. Radaelli, C.G. Borroni, M. Zenga.

G. Galimberti, G. Soffritti, M. Di Maso.

AnswerTree

K.P. Soman, S. Diwakar, V. Ajay. Insight into Data Mining: Theory and Practice. Delhi: Prentice-Hall of India Learning, 2006.

P.E. De Jong, R.T. Gansevoort, S.J. Bakker.

American Diabetes Association.

D.E. Busby, G.L. Bakris.

M. Unubol, M. Ayhan, E. Guney.

E. Guney, M. Unubol, V. Yazak, I. Kurt Omurlu.

E. Frank and M. Hall.

P.L.H. Yu, W.M. Wan, P.H. Lee.

J.W.T. Lee and D. Liu.

Y.S. Kim.

Y.J. Hu, T.H. Ku, R.H. Jan, K. Wang, Y.C. Tseng, S.F. Yang.

Downloads

Published

2014-07-18

How to Cite

Kurt Omurlu, I., Ture, M., Unubol, M., Katranci, M., & Guney, E. (2014). The Comparisons of Four Splitting Rules for Fitting a Classification Tree with Simulation and an Application Related to Albuminuria Data in Type 2 Diabetes Mellitus. International Journal of Sciences: Basic and Applied Research (IJSBAR), 17(1), 110–123. Retrieved from https://gssrr.org/index.php/JournalOfBasicAndApplied/article/view/2303

Issue

Section

Articles