Cluster Analysis on Longitudinal Data of Patients with Kidney Dialysis using a Smoothing Cubic B-Spline Model

Authors

  • Noor Seyala Department of Statistics, College of Administration and Economics, University of Baghdad, Iraq
  • Suhail Najm Abdullah Department of Statistics, College of Administration and Economics, University of Baghdad, Iraq

DOI:

https://doi.org/10.59543/ijmscs.v2i.8337

Keywords:

cubic B-spline, cubic spline penalty , ADMM algorithm, kidney failure, nonparametric pairwise grouping

Abstract

Longitudinal data analysis is gaining prominence, particularly in medicine and economics. This research centers on collecting and analyzing longitudinal data, specifically cluster analysis. The thesis emphasizes the nonparametric cubic B-spline model, known for its smoothness, flexibility, and ability to capture intricate patterns and data fluctuations due to the continuity of its derivatives. The penalization method was employed to accomplish clustering. It categorizes longitudinally balanced data by penalizing the pairwise distances between cubic B-spline model coefficients using a penalization function, such as the recently devised concave penalization function. The cubic spline penalty CSP, part of the pair distance penalty, employs the nonparametric pairwise grouping (NPG) method. Model selection criteria, like Bayesian Information Criteria (BIC), help determine the number of clusters. Optimization methods, including the alternative direction method of the multiplier ADMM algorithm, are applied to approximate solutions within the R statistical program. A simulation study generated balanced longitudinal data for 60 and 100 subjects with ten replicates each. The experiments demonstrated the effectiveness of the CSP penalty function in the clustering process. For practical application, the study involved the analysis of data from kidney failure patients collected from Ibn Sina Teaching Hospital for Dialysis in Mosul over seven consecutive months in 2023. The NPG aggregation method and CSP penalty functions were used, resulting in two groups based on the glomerular filtration rate of the kidneys. According to medical criteria, this rate determines the required dialysis frequency twice a week or thrice.

Downloads

Published

2023-11-26

How to Cite

Seyala, N., & Abdullah, S. N. (2023). Cluster Analysis on Longitudinal Data of Patients with Kidney Dialysis using a Smoothing Cubic B-Spline Model. International Journal of Mathematics, Statistics, and Computer Science, 2, 85–95. https://doi.org/10.59543/ijmscs.v2i.8337

Issue

Section

Original Research Articles