Mar Athanasius College of Engineering, Kothamangalam
Department of Computer Applications
Project Synopsis
Topic: Diabetes Prediction Using Machine Learning
A cutting-edge healthcare project using machine learning to predict diabetes risk,
combating the global epidemic's impact on healthcare. By analyzing extensive patient data,
it aims to provide accurate predictions for early intervention and improved diabetes care. This
represents a major stride in data-driven healthcare, promising a healthier future.
In healthcare analytics, this project employs predictive modeling and data-driven
techniques to anticipate diabetes development. It utilizes machine learning to analyze patient
data and pinpoint diabetes risk factors and outcomes. The process starts with loading and
reviewing the Diabetes Dataset, with 'Outcome' indicating diabetes status. Data
standardization ensures uniform scaling, and the dataset is divided into training (80%) and
testing (20%) subsets, maintaining class balance. The dataset is utilized to train three distinct
models: Support Vector Machine (SVM), Random Forest (RF), and k-nearest neighbors (KNN).
The accuracy of these models is then rigorously compared and evaluated.
High prediction accuracy is crucial for identifying diabetes risk accurately, enabling
timely interventions, cost savings, and data-driven insights. It informs lifestyle adjustments
and medical treatments, reducing long-term healthcare expenses and shaping a healthier
future through valuable risk factor insights. To access the dataset and more details, visit:
https://www.kaggle.com/datasets/uciml/pima-indians-diabetes-database.
Resources:
1. M. A. Sarwar, N. Kamal, W. Hamid and M. A. Shah, "Prediction of Diabetes Using Machine Learning
Algorithms in Healthcare," 2018 24th International Conference on Automation and Computing (ICAC),
Newcastle Upon Tyne, UK, 2018, pp. 1-6, doi: 10.23919/IConAC.2018.8748992.
2. S. Sivaranjani, S. Ananya, J. Aravinth and R. Karthika, "Diabetes Prediction using Machine Learning
Algorithms with Feature Selection and Dimensionality Reduction," 2021 7th International Conference
on Advanced Computing and Communication Systems (ICACCS), Coimbatore, India, 2021, pp. 141-
146, doi: 10.1109/ICACCS51430.2021.9441935.
3. AC. Lyngdoh, N. A. Choudhury and S. Moulik, "Diabetes Disease Prediction Using Machine Learning
Algorithms," 2020 IEEE-EMBS Conference on Biomedical Engineering and Sciences (IECBES), Langkawi
Island, Malaysia, 2021, pp. 517-521, doi: 10.1109/IECBES48179.2021.9398759.
Submitted By: Faculty Guide:
Name: Gowtham Kumar Prof. Shinu S Kurian
Reg No: M22CA014 MCA Department
S3 MCA, 2022 – 24 Batch