Cluster Analysis on Longitudinal Data of Patients with Kidney Dialysis using a Smoothing Cubic B-Spline Model

Authors

  • Noor Nawzat Ahmed Department of Statistics, College of Adminiteration and Economics, University of Baghdad, Iraq. https://orcid.org/0009-0008-9717-9516 Author
  • Suhail Najm Abdullah Department of Statistics, College of Adminiteration and Economics, University of Baghdad, Iraq. https://orcid.org/0009-0007-1753-5967 Author

DOI:

https://doi.org/10.59543/ijmscs.v2i.8337

Keywords:

cubic B-spline, , cubic spline penalty CSP, ADMM algorithm, kidney failure, nonparametric pairwise grouping (NPG)

Abstract

Longitudinal data analysis is gaining prominence, particularly in fields like medicine and economics. This research is centered around the collection and analysis of longitudinal data, with a specific focus on cluster analysis. The thesis places emphasis on the non-parametric cubic B-spline model, known for its smoothness, flexibility, and ability to capture intricate patterns and fluctuations in data due to continuity in its derivatives .
To accomplish clustering, the penalization method was employed. It categorizes longitudinally balanced data by penalizing the pairwise distances between cubic B-spline model coefficients using a penalization function, such as the recently devised concave penalization function. The cubic spline penalty CSP, part of the pair distance penalty, employs the nonparametric pairwise grouping (NPG) method. Model selection criteria, like Bayesian Information Criteria (BIC), help determine the number of clusters. Optimization methods, including the alternative direction method of the multiplier ADMM algorithm, are applied to approximate solutions within the R statistical program .
In a simulation study, balanced longitudinal data for 60 and 100 subjects were generated with ten replicates each. The experiments demonstrated the effectiveness of the CSP penalty function in the clustering process .
For practical application, the study involved the analysis of data from kidney failure patients, collected from Ibn Sina Teaching Hospital for Dialysis in Mosul over seven consecutive months in 2023. The NPG aggregation method and CSP penalty functions were used, resulting in two groups based on the glomerular filtration rate of the kidneys. This rate determines the required dialysis frequency, either twice a week or thrice, according to medical criteria.

Downloads

Published

2023-11-26

How to Cite

Noor Nawzat Ahmed, & Suhail Najm Abdullah. (2023). Cluster Analysis on Longitudinal Data of Patients with Kidney Dialysis using a Smoothing Cubic B-Spline Model. International Journal of Mathematics, Statistics, and Computer Science, 2, 85-95. https://doi.org/10.59543/ijmscs.v2i.8337

Issue

Section

Articles