Heart Failure Patients Classification Using ML Algos

This study presents an optimized machine learning approach using Gradient Boosting Machine (GBM) and Adaptive Inertia Weight Particle Swarm Optimization (AIW-PSO) to predict survival in heart failure patients, achieving a test accuracy of 94%. The methodology includes techniques for addressing class imbalance and feature selection, highlighting the importance of hyperparameter tuning for improved model performance. The findings emphasize the potential of AIW-PSO in enhancing clinical decision-making tools for heart failure management.

Uploaded by

23023089

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

26 views15 pages

Heart Failure Patients Classification Using ML Algos

Uploaded by

23023089

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 15

Received 28 January 2025, accepted 5 February 2025, date of publication 11 February 2025, date of current version 19 February 2025.

Digital Object Identifier 10.1109/ACCESS.2025.3541069

Predicting the Classification of Heart Failure

Patients Using Optimized Machine
Learning Algorithms
MARZIA AHMED 1,2 , MOHD HERWAN SULAIMAN 2 , (Senior Member, IEEE),
MD MARUF HASSAN 1,3 , (Member, IEEE), AND TOUHID BHUIYAN 4
1 Department of Software Engineering, Daffodil International University (DIU), Daffodil Smart City, Dhaka 1216, Bangladesh
2 Facultyof Electrical and Electronics Engineering Technology, Universiti Malaysia Pahang Al-Sultan Abdullah (UMPSA), Pekan 26600, Malaysia
3 Department of Computer Science and Engineering, Southeast University, Dhaka, Bangladesh
4 School of IT, Washington University of Science and Technology (WUST), Alexandria, VA 22314, USA

Corresponding author: Marzia Ahmed ([email protected])

This work was supported by the School of IT, Washington University of Science and Technology (WUST), in Virginia, USA, Program for
Scientific Publication.

ABSTRACT Heart failure is a critical condition with a high mortality rate, making accurate survival
prediction essential for timely interventions. This study proposes an optimized machine learning approach
using Gradient Boosting Machine (GBM) and Adaptive Inertia Weight Particle Swarm Optimization (AIW-
PSO) to predict heart failure survival. The dataset, sourced from Kaggle, includes clinical features such as
age, ejection fraction, and serum creatinine levels for 299 heart failure patients. To address the imbalance
in survival outcomes, Synthetic Minority Over-sampling Technique (SMOTE) was employed to balance the
dataset, followed by SelectKBest and Chi-square feature selection methods to retain the most significant
predictors. The optimized hyperparameters for the GBM model were identified using the AIW-PSO
algorithm, which effectively balanced exploration and exploitation by adaptively adjusting inertia weights.
Model selection was further refined using information criteria, including Akaike Information Criterion (AIC)
and Bayesian Information Criterion (BIC), ensuring that the best-performing model was chosen based on
both predictive accuracy and model complexity. The optimized GBM model achieved a test accuracy of
94%, demonstrating superior performance compared to traditional machine learning models. The study
underscores the importance of hyperparameter tuning through metaheuristic algorithms and highlights the
potential of AIW-PSO in enhancing model performance for clinical prediction tasks. These findings have
significant implications for clinical decision-making, offering a reliable and interpretable tool for predicting
patient outcomes in heart failure management.

INDEX TERMS Heart failure survival prediction, machine learning algorithms, hyperparameter
optimization, class imbalance handling, AIW-PSO optimization.

ABBREVIATIONS AND TERMS • HF: Heart failure

• AIWPSO: Adaptive Inertia-Weight Particle Swarm • GA: Genetic Algorithm
Optimization • ADASYN: Adaptive Synthetic
• GBM: Gradient Boosting Machine • ANOVA: Analysis of Variance
• SMOTE: Synthetic Minority Over-sampling Technique • AIC: Akaike Information Criterion
• ML: Machine Learning • SBIC: Schwarz Bayesian Information Criterion
• AUC: Area Under the Curve • HQIC: Hannan-Quinn Criterion
• ROC: Receiver Operating Characteristic
• SVM: Support Vector Machine I. INTRODUCTION
Cardiovascular diseases (CVDs) include a range of disorders
The associate editor coordinating the review of this manuscript and affecting the heart and blood vessels, such as coronary heart
approving it for publication was Yiqi Liu . disease, stroke, and heart failure (HF). The World Health
2025 The Authors. This work is licensed under a Creative Commons Attribution 4.0 License.
VOLUME 13, 2025 For more information, see https://creativecommons.org/licenses/by/4.0/ 30555
M. Ahmed et al.: Predicting the Classification of HF Patients Using Optimized ML Algorithms

Organization (WHO) reports that CVDs are the leading cause This research is particularly timely given the growing
of death globally, accounting for approximately 17.9 million emphasis on personalized medicine and the need for accurate
deaths each year, which represents nearly 32% of all deaths risk stratification in HF management [13]. As healthcare
worldwide. systems worldwide grapple with resource allocation and
Heart failure (HF) remains a formidable challenge in treatment prioritization, especially in the wake of global
cardiovascular medicine, affecting an estimated 64.3 million health crises, refined prognostic tools could play a pivotal role
people worldwide and accounting for a substantial portion of in optimizing patient care pathways [14].
global healthcare expenditure [1]. This chronic, progressive
condition is characterized by the heart’s inability to pump A. RESEARCH QUESTIONS
blood efficiently, leading to a cascade of symptoms that Our investigation seeks to address the following key
significantly impair quality of life and elevate mortality risk. questions:
It is frequently caused by underlying conditions like diabetes, 1) How effective are optimized machine learning algo-
hypertension, or other heart diseases [2]. Despite advance- rithms in predicting the survival of heart failure
ments in therapeutic interventions, the 5-year mortality rate patients?
for HF patients hovers around 50%, underscoring the urgent 2) Which clinical and demographic factors most signifi-
need for improved prognostic tools [3]. cantly impact the survival predictions for heart failure
In recent years, the intersection of machine learning patients?
(ML) and clinical medicine has opened new avenues for 3) Does optimization improve the performance of
enhancing patient care through data-driven decision support machine learning models in heart failure survival
systems [4], [34]. The application of ML algorithms to prediction?
predict HF outcomes has shown promise, yet challenges
By exploring these questions, we aim to contribute to
persist in model accuracy and generalizability [5]. A crit-
the evolving landscape of ML-assisted clinical decision
ical bottleneck in leveraging ML for clinical prediction
support, potentially offering clinicians a more refined tool for
tasks lies in the optimization of model hyperparame-
prognostication in heart failure management.
ters, a process that can significantly influence predictive
performance [6].
B. KEY CONTRIBUTIONS
The advent of metaheuristic optimization algorithms, such
The key contributions of this work are summarized as
as Particle Swarm Optimization (PSO), has provided a
follows:
powerful framework for navigating the complex hyperpa-
rameter landscape of ML models [7]. However, the classical 1) Introduction of AIW-PSO and GBM Combina-
PSO algorithm often struggles with the delicate balance tion: This study introduces the novel combination
between exploration and exploitation, potentially leading to of AIW-PSO and GBM for optimizing heart fail-
suboptimal solutions [8]. To address this limitation, we pro- ure prediction models, demonstrating its potential
pose the application of Adaptive Inertia Weight Particle to improve model performance by effectively tuning
Swarm Optimization (AIW-PSO), an enhanced variant that hyperparameters.
dynamically adjusts its search behavior, to optimize the 2) Performance Across Balanced and Imbalanced
hyperparameters of a Gradient Boosting Machine (GBM) Datasets: The model’s performance is explored across
model for HF survival prediction. both balanced and imbalanced datasets, showcasing its
Our study leverages a curated dataset of 299 HF patients, practical utility in real-world applications, particularly
encompassing a rich tapestry of clinical features including in dealing with class imbalance issues that are common
left ventricular ejection fraction, serum creatinine levels, in medical datasets.
and comorbidities [9]. To mitigate the inherent class imbal- 3) Identification of Key Predictors: Through feature
ance typical of survival data, we employ the Synthetic selection techniques, the study identifies critical pre-
Minority Over-sampling Technique (SMOTE), ensuring a dictors, such as ejection fraction and serum crea-
balanced representation of outcomes [10]. Feature selection tinine, which provide valuable insights for clinical
is performed using the SelectKBest algorithm in conjunction decision-making and contribute to the model’s high
with Chi-square statistical tests, distilling the most salient accuracy.
predictors from the feature space [11]. Heart failure (HF) is a major public health concern that
The novelty of our approach lies in the synergistic affects millions of people worldwide, resulting in high
integration of AIW-PSO with GBM, a powerful ensemble morbidity and mortality. The accurate prediction of survival
learning method known for its robustness in handling in patients with heart failure is critical for guiding clinical
complex, non-linear relationships [12]. By harnessing the practice. This study demonstrates a novel application of adap-
adaptive capabilities of AIW-PSO, we aim to fine-tune tive inertia weight particle swarm optimization (AIW-PSO)
the GBM model’s hyperparameters, potentially unlocking in conjunction with a Gradient Boosting Machine (GBM)
superior predictive performance compared to traditional, for model performance improvement in prediction tasks. The
manually-tuned ML models. proposed methodology shows the potential for improving