0% found this document useful (0 votes)

2K views

Final Diabetes Prediction Documentation

This project report discusses predicting diabetes using data mining techniques. The objectives are to predict diabetes in the healthcare industry and identify factors that affect health conditions. An artificial neural network model is designed that can predict health performance based on predefined data for a particular health condition like diabetes. Diabetes occurs when the pancreas does not produce enough insulin or the body cannot use the insulin properly. Left uncontrolled, diabetes can damage nerves and blood vessels over time. Data mining techniques are used to explore data to find patterns and relationships between variables that can help predict diabetes.

Uploaded by

Shilpa Shahu

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

2K views

Final Diabetes Prediction Documentation

Uploaded by

Shilpa Shahu

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

You are on page 1/ 52

A

Project Report

Diabetes Prediction Using Data Mining

Submitted to

CHHATTISGARH SWAMI VIVEKANAND TECHNICAL UNIVERSITY

BHILAI

in partial fulfillment

for the award of the degree

Bachelor of Engineering

Computer Science and Engineering

Kankasha Naij, 300102218320

Shilpa Shahu, 300102218315

Under the Guidance of

Prof. Ashok Kumar Behera
Assistant Professor

Department of Computer Science and Engineering

Bhilai Institute of Technology,

Bhilai House , GE Road , Durg , Chhattisgarh 491001

Session: 2020– 2021

DECLARATION BY THE CANDIDATE(s)

We the undersigned solemnly declare that the report of the project work entitled Diabetes Prediction

Using Data Mining, is based on my own work carried out during the course of my study under the

supervision of Prof.Ashok Kumar Behera

We assert that the statements made and conclusions drawn are an outcome of the project work. I

further declare that to the best of my knowledge and belief that the report does not contain any part of

any work which has been submitted for the award of any other degree/diploma/certificate in this

University/ any other University of India or any other country.

_________________ _________________
Kankasha Naij Shilpa Shahu
300102218320 300102218315
AS1108 AS0988
CERTIFICATE
This is to Certify that the report of the project submitted is an outcome of the project work entitled Diabetes
Prediction Using Data Mining carried out by Shilpa Shahu 300102218315,AS0988.; Kankasha Naij
300102218320,AS1108

Under my guidance and supervision for the award of Degree in Bachelor of Engineering in Computer Science
from Chhattisgarh Swami Vivekanand Technical University, Bhilai (C.G).

To the best of my knowledge the report

1. Embodies the work of the candidate himself / herself,

2. Has duly been completed,
3. Fulfills the requirement of the Ordinance relating to the BE degree of the University,
4. Is up to the desired standard for the purpose of which is submitted.

Signature of Coordinator Signature of Coordinator Signature of Guide

Prof. Sumit Sar Prof. Shiv Dutta Mishra Prof. Ashok Kumar Behera

Associate Professor Assistant Professor Assistant Professor

Computer Sc. & Engg Computer Sc. & Engg. Computer Sc. & Engg.

The Project work as mentioned above is hereby being recommended and forwarded for examination and evaluation.

Dr. (Mrs.) M. V. Padmavati

Head of the Department

CERTIFICATEBY THE EXAMINERS

This is to Certify that the project the entitled

“Diabetes Prediction Using Data Mining”,

S u b m i tt e d by

Shilpa Shahu AS0988 300102218315

Kankasha Naij AS1108 300102218320

Have been examined by the undersigned as a part of the examination for the award of Bachelor of Engineering
degree in Computer Science and Engineering from Chhattisgarh Swami Vivekanand Technical University, Bhilai
(C.G)

(Internal Examiner) (External Examiner)

Name: Name:

Date: Date:
ACKNOWLEDGEMENT

We have great pleasure in the submission of this project report entitled Diabetes Prediction Using Data
Mining impartial fulfillment the degree of Bachelor of Engineering (CSE).While submitting this Project report,I take
this opportunity to thank those directly or indirectly related to project work.

We would like to thank my guide Prof. Ashok Kumar Behera who has provided the opportunity and
organizing project for me. Without this active co-operation and guidance ,it would have become very difficult to
complete task in time.

We would like to express sincere thanks and gratitude to Dr. M.K Gupta , Principal of the Institution, Dr.
(Mrs.) M. V. Padmavati, Head of the Department Computer Science & Engineering for their encouragement and
cordial support..

While Submission of the project, We also like to thanks to Prof. Sumit Sar and Prof. Shiv Dutta
Mishra ,Project Coordinator, faculty and all the staff of department of Computer Science & Engineering, Bhilai
Institute of Technology, Durg for their continuous help and guidance through out the course of project.

Acknowledgement is due to our parents ,family members, friends and all those persons who have helped us
directly or indirectly in the successful completion of the project work.

Shilpa Shahu, Kankasha Naij

300102218315,300102218320

AS0988, AS1108
Content

1. INTRODUCTION
1.1 OBJECTIVE 1-2
1.2 PROJECT DESCRIPTION 3

2. SYSTEMSTUDY
2.1 EXISTING ANDPROPOSEDSYSTEM 4-5
2.2 FEASIBILITYSTUDY 6-7
2.3 TOOLS ANDTECHNOLOGIES USED 8-10
2.4 HARDWARE ANDSOFTWARE REQUIREMENTS 11
3. SOFTWARE REQUIREMENTSSPECIFICATION
3.1 USERS 12
3.2 FUNCTIONAL REQUIREMENTS 13-14
3.3 NON-FUNCTIONAL REQUIREMENTS 15-17
4. SYSTEM ANALYSIS AND DESIGN
4.1 SYSTEMPERSPECTIVE 18
4.2 DATABASE DESIGN (ER and/or Conceptual schema) 18-19
4.3 CONTEXTDIAGRAM (DFD) 20
4.4 USE CASE DIAGRAM 21
4.5 SEQUENCE DIAGRAMS 22
4.6ACTIVITYDIAGRAM 23-24

5. IMPLEMENTATION
5.1 SCREENSHOTS 25-34
6. SOFTWARE TESTING 35-42
7. CONCLUSION 43
8. FUTUREENHANCEMENTS 44
BIBLIOGRAPHY 45-46
Chapter 1
Introduction

1.1 Objective
 To predict diabetes in healthcare industry using data mining
 To predict and categorize the state of health.
 To identify some appropriate factors that affect health conditions,
 To design an artificial neural network that can be used to predict health performance based on certain pre-
defined data for a particular health condition

Diabetes is a long-lasting disease that happens when the pancreas fails to create enough insulin, or when the body
cannot use the insulin produced efficiently. Insulin is a hormone that controls the level of sugar in the blood.
Hyperglycemia or hyperglycemia is a common result of uncontrolled diabetes and, over time, causes severe damage to
many organs, particularly nerves and blood vessels.

In 2015, 8.5% of adults aged 17 years or older had diabetes. In 2013, diabetes was the cause of 1.5 million deaths, and
high blood glucose caused 2.3 million deaths. Diabetes patients have doubled in the last ten years worldwide. More
than 200 million people are infected and about seven percent increase in the annual predominance of diabetes in the
world. People for a long time suffered from different diseases that in some cases have been able to diagnose diseases
and offer them the solution in order to enhance it, but unfortunately, sometimes, due to the lack of diagnosis of
symptoms in patients for a long time may even threaten the life of the patient. Therefore, many studies have been done
in the field of predicting for several diseases to the extent that today's human take advantage of decision supports
models and smart method to predict. One of the decision support models application is in the medical field and
diagnosis of illnesses such as diabetes [1, 2]. Deferment in the diagnosis and prediction of diabetes due to insufficient
control of blood glucose increases macro vascular and Capillaries difficulties risk, ocular diseases and kidney failure
[1, 2].

Data Mining is an analytic process designed to explore data in search of consistent patterns and systematic
relationships between variables, and then to validate the results by applying the patterns found to a new subset of data.
Data mining is often described as the process of discovering patterns, correlations, trends or relationships by searching
through a large amount of data stored in repositories, databases, and data warehouses. Diabetes, often referred to by
doctors as diabetes mellitus, describes a group of metabolic diseases in which the person has high blood glucose
(blood sugar), either because insulin production is insufficient, or because the body's cells do not respond properly to
insulin, or both. This project helps in identifying whether a person has diabetes or not, if predicted diabetic the project
suggest measures for maintaining normal health and if not diabetic it predicts the risk of getting diabetic. In this
project Classification algorithm was used to classify the Pima Indian diabetes dataset. Results have been obtained
using Android Application.
Data mining, the extraction of hidden predictive information from large databases[ 1,12], is a powerful new
technology with great potential to help companies focus on the most important information in their data warehouses.
Diabetes has become a most common disease in today’s world. So for every individual it is important to take a
precautionary measure to check if the person has any chances of getting diabetes. For this purpose we use data mining
techniques to predict if a person is diabetic or not. It is attractive as the results are obtained through an android
application installed in mobile device. The main reason for accuracy of results is that only most significant attributes
causing diabetes are considered for analysis .

Data mining tools predict future trends and behaviours, allowing businesses to make proactive, knowledge-driven
decisions . The automated, prospective analyses offered by data mining move beyond the analyses of past events
provided by retrospective tools typical of decision support systems. They scour databases for hidden patterns, finding
predictive information that experts may miss because it lies outside their expectations.

Diabetes (diabetes mellitus)[24] is classed as a metabolism disorder. Metabolism refers to the way our bodies use
digested food for energy and growth.. Most of what we eat is broken down into glucose. Glucose is a form of sugar in
the blood - it is the principal source of fuel for our bodies.

A person with diabetes has a condition in which the quantity of glucose in the blood is too elevated (hyperglycemia).
This is because the body either does not produce enough insulin, produces no insulin, or has cells that do not respond
properly to the insulin the pancreas produces. This results in too much glucose building up in the blood. This excess
blood glucose eventually passes out of the body in urine . So, even though the blood has plenty of glucose,
the cells are not getting it for their essential energy and growth requirements

So we proposed an model to predict diabetes that can be useful and helpful for doctors and practitioners. In this
research, we used the following attributes: Gender Details, Age category (Under 35 or Above), Existing sufferance of
diabetes, Thirsty level, Excess hunger, How often patient feel excreta, Weight loss, Genetic existence of diabetes.
High blood glucose, Blurry vision, High blood pressure, Consumption of tobacco products or smoking, Consumption
of vegetables and fruits, Physical Activity, waist circumference, User height, User weight.

Based on the Diabetes Research Center reports, the incidence of diabetes has folded in the last ten years worldwide
and more than 200 million people are infected and about seven percent increase in the annual prevalence of diabetes
worldwide.

Since diabetes is a long-lasting disease and import permanent damage to the limbs and vital organs in the body, using
artificial intelligence tools can enhance the detection methods and disease control which will be of a great help to the
physicians. According to the Diabetes Research Center, it has been shown that early diagnosis of patients at risk can
prevent 80 percent of lasting complications of type II diabetes or deferred them

There are two types of diabetes, type I and type II diabetes. Type I diabetes also named insulin dependent and type II
diabetes named relative insulin deficiency .Protracted complications of diabetes are mainly distributed into two
categories: vascular and non-vascular complications of diabetes.
Vascular complications include micro vascular (eye disease, neuropathy, nephropathy) and macro vascular
complications (coronary artery disease, peripheral vascular disease, cerebrovascular disease). Non-vascular
complications include gastro paresis, sexual dysfunction, and skin changes .

1.2 Project Description

Diabetes is one of the major international health problems. World Health Organization reports says that around 422
million people have diabetes worldwide. Also in this pandemic situation people with diabetes do face a higher chance
of experiencing serious complications from COVID-19. In general, people with diabetes are more likely to
experience severe symptoms and complications when infected with a virus

By looking intensely through literature and soliciting the experience of human experts on pathological conditions, a
number of factors have been recognized that have an impact on determining patients' cases in the subsequent period.
These factors were prudently studied and coordinated with an appropriate number for coding the computer within the
modelling environment ANN. These factors were categorized as input variables and output variables that reflect some
possible levels of disease status in terms of the assessment system. The data were entered into the JNN tool
environment, determined the value of each of the variables using JNN (the most influential factor on diabetes), then
the data were trained, validated, and tested.

Classification is one of the most important decision making techniques in many real world problem. In this work, the
main objective is to classify the data as diabetic or non-diabetic and improve the classification accuracy. For many
classification problem, the higher number of samples chosen but it doesn’t leads to higher classification accuracy. In
many cases, the performance of algorithm is high in the context of speed but the accuracy of data classification is low.
The main objective of our model is to achieve high accuracy. Classification accuracy can be increase if we use much
of the data set for training and few data sets for testing. This survey has analyzed various classification techniques for
classification of diabetic and non-diabetic data. Thus, it is observed that techniques like Support Vector Machine,
Logistic Regression, and Artificial Neural Network are most suitable for implementing the Diabetes prediction system.

The system describes the flow of the project work. The first step in the process is the collection of data needed for the
work. Here the dataset used is Pima Indian diabetes dataset, which is collected in the first step. The next step in the
process is pre processing of the data. Here we covert the raw data into understandable format. Now the pre processed
data is classified into a decision tree to predict the status of a person whether diabetic or not using the algorithm. The
user enters the details to know his/her results for the test into an android app installed in his mobile device. The
attributes entered by the user is compared with the decision tree and the results are generated.

In project diabetes data set is considered. The data set is taken by consulting specialized doctor. The data set consists
of 13 attributes: Here, the class label is binary classification. It has two values
 Tested positive (1) which means diabetic
 Tested negative (0) which says non diabetic
Chapter 2
System Study

2.1 Existing And Proposed System

A major challenge facing healthcare organizations (hospitals, medical centers) is the provision of quality services at
affordable costs. Quality service implies diagnosing patients correctly and administering treatments that are effective.
Poor clinical decisions can lead to disastrous consequences which are therefore unacceptable. Hospitals must also
minimize the cost of clinical tests. They can achieve these results by employing appropriate computer-based
information and/or decision support systems. Most hospitals today employ some sort of hospital information systems
to manage their healthcare or patient data. These systems typically generate huge amounts of data which take the form
of numbers, text, charts and images. Unfortunately, these data are rarely used to support clinical decision making.
There is a wealth of hidden information in these data that is largely untapped. This raises an important question: “How
can we turn data into useful information that can enable healthcare practitioners to make intelligent clinical
decisions?”

Although data mining has been around for more than two decades, its potential is only being realized now. Data
mining combines statistical analysis, machine learning and database technology to extract hidden patterns and
relationships from large databases. The two most common modelling objectives are classification and prediction.
Classification models predict categorical labels (discrete, unordered) while prediction models predict continuous-
valued functions. Decision Trees and Neural Networks use classification algorithms while Regression, Association
Rules and Clustering use prediction algorithms.

Data preprocessing and data mining algorithms are used for the further process in the project. Data preprocessing
technique data transformation is applied to the data set before applying data mining algorithms. The decision tree and
regression models are built. Decision trees and Regression models are used to predict the final binary target variable.
After running different types of models, model comparison needed to select the best algorithm. The best algorithm and
best model is selected based on the high accuracy rate.

Diabetes, often referred to by doctors as diabetes mellitus, describes a group of metabolic diseases in which the person
has high blood glucose (blood sugar), either because insulin production is insufficient, or because the body's cells do
not respond properly to insulin, or both.

This project helps in identifying whether a person has diabetes or not, if predicted diabetic the project suggest
measures for maintaining normal health and if not diabetic it predicts the risk of getting diabetic. In this project
Classification algorithm was used to classify the Indian diabetes dataset. Results have been obtained using Android
Application.
Also there is a need to automate the overall process of diabetes prediction. This automation of diabetic database helps
identification of impact of diabetes on various human organs.

The author in R. Ali, M. H. Siddiqi, M. Idris, B. H. Kang and S. Lee used Data Mining to develop a model for
classifying diabetic patient control level based on historical medical records. The author was motivated by the death
caused by diabetes in the world which necessitated avoiding the complication of the disease. He developed a new
predictive model using data mining techniques which would classify diabetic patient control level based on historical
medical records. The research was carried out using three data mining techniques which are Naïve Bayes, Logistic and
J48. The research was implemented using WEKA application. The result showed that Logistic data mining algorithm
gave a precision average of 0.73, recall of 0.744, F-measure of 0.653 and accuracy of 74.4%. Naïve Bayes gave a
precision average of 0.717, recall of 0.742, F-measure of 0.653 and accuracy of 74.2%. J48 gave a precision average
of 0.54, recall of 0.735, F-measure of 0.623 and accuracy of 73.5%. This proved that the logistic algorithm was more
accurate than the other two. The research was limited in that only diabetes type 2 was considered. They also did not
look into the discovery of appropriate features with minimal effort and validation on discovered features.

The author in S. Abu Naser, I. Zaqout, M. A. Ghosh, R. Atallah and E. Alajrami, developed a prediction model for
diabetes Type II treatment plans by using data mining. The author was motivated by the highly dangerous
complication of chronic disease as well as the complication which required amputation of one of the parties. He
developed a new model for classifying diabetes type 2 treatment plans which could help the control of blood glucose
level of diabetic patient. He made use of J48 algorithm in conducting the experiment on 318 medical records which
was collected from JABER ABN ABU ALIZ clinic centre for diabetes in Sudan. The basic control information
showed that 59.1% of the record was considered for Oral Hypo glycemic, 35.5% for Insulin and 5.3% for Diet. The
evaluation was done using the WEKA application. The research work did not consider diabetes type 1 patients which
could have been included with additional attributes. Also, the nutrition system and exercise could have been included
to increase the accuracy of the system.

The authors in A. Elzamly, S. S. Abu Naser, B. Hussin and M. Doheir, used prediction of diabetes mellitus based on
boosting ensemble modelling. They were motivated by the focus of aiding diabetes patients fit themselves into their
normal activities of life by early predicting their state and tacking it. They intended to predict the diabetes types of
patients based on physical and clinical information using boosting ensemble technique. They made use of boosting
ensemble technique which internally uses random committee classifier. The architecture used was supported by
integrating data management, learning, and prediction components together. The evaluation result of the technique
showed accuracy gave a weighted average TP rate of 0.81, FP rate of 0.198, Precision of 0.81, Recall of 0.81, F-
measure of 0.82 and ROC area of 0.82 for diabetes type 1 and 2. The research work is intended to be extended in
future the integration into a cloud based clinical decision support system for chronic diseases and the inclusion of a
feedback mechanism to increase the level of satisfaction of users.
2.2 Feasibility Study

Diabetes or diabetes mellitus is a metabolic disorder (metabolic) in the body. This disease destroy the ability to
produce insulin in the patient's body or the body develops resistance to insulin the and consequently the produced
insulin cannot achieve its normal job. The main role of the produced insulin is to decrees blood sugar by different
instruments. There are two key types of diabetes. In Type I diabetes, obliteration of beta pancreatic cells damage
insulin construction and in type II, there is a progressive insulin confrontation in the body and ultimately may yield to
the obliteration of pancreatic beta cells and faults in insulin production. In type II diabetes, it is known that genetic
issues, obesity and lack of physical activity have a vital part in a person .
Even though the precise cause of type I diabetes is unidentified, issues that may indicate a greater risk comprise the
followings :
 Family history. A person risk upsurges if his parent or sibling has history of type I diabetes.
 Environmental factors. Situations for example contact with a viral illness probably play some role in type I
diabetes.
 The existence of harmful immune system cells. Occasionally family members of a person with type I diabetes
are examined for the existence of diabetes autoantibodies. If a person has these autoantibodies, he/she has a
chance of increased risk for evolving type I diabetes. Nonetheless not every person who has these
autoantibodies gets diabetes.
 Geography. Some countries, like Sweden, have bigger rates of type I diabetes.

Researchers don't completely comprehend why certain people develop pre-diabetes and type II diabetes and others
don't. It's sure that some factors upsurge the risk like :
 Weight. The more fatty tissue you have, the more resilient a person cells to insulin.
 Inactivity. The less energetic a person is, the more a person has risk. Physical activity assists a person control
of his/her weight, consumes glucose as energy and makes a person cells more sensitive to insulin.
 Family history. A person risk upsurges if his parent or sibling has history of type II diabetes. .
 Age. A person risk upsurges as he/she gets older. This may be because a person has a habit to exercise less,
lose muscle mass and add weight as he/she gets older. Nonetheless type II diabetes is likewise growing
among children, youths and adults.
 Gestational diabetes. If a person developed gestational diabetes when she was pregnant, her risk of emerging
pre-diabetes and type II diabetes far ahead upsurges. If she gives birth to a baby weighing more than 4
kilograms, she is also at risk of type II diabetes.
 Polycystic ovary syndrome. For females, having polycystic ovary syndrome increases the risk of getting
diabetes.
 High blood pressure. Having blood pressure more than 140/90 millimeters of mercury (mm Hg) is connected
to an augmented risk of type II diabetes.
 Abnormal cholesterol and triglyceride levels. If a person has low levels of high-density lipoprotein, or good
cholesterol, his/her risk of type II diabetes is going to be higher. Triglycerides are additional type of fat passed
in the blood. A person with greater levels of triglycerides has an augmented risk of type II diabetes.
A practical approach to this type of problem is the application of regression analysis where past data is better
combined into some functions. The result is an equation in which both xj inputs are multiplied by wj; the sum of all
these products is constant, and then output y = Σ wj xj +, where j = 0..n.
The problem is the difficulty of choosing an appropriate function to have all the collected data and adjust the output
automatically when more information is attained, because the candidate's performance is organized by a number of
arguments, and this control will not have any clear regression model.

The artificial neural network, which emulates the human thinking in solving a problem, is a more common approach
that can address this type of problems. Thus, the attempt to develop an adaptive system such as artificial neural
network to predict the situation and classification based on the results of these arguments.

Diabetes is not only affected by various factors like height, weight, hereditary factor and insulin but the major reason
considered is sugar concentration among all factors. The early identification is the only remedy to stay away from the
complications. Many researchers are conducting experiments for diagnosing the diseases using various classification
algorithms of machine learning approaches like J48, SVM, Naive Bayes, Decision Tree, Decision Table etc. as
researches have proved that machine-learning algorithms works better in diagnosing different diseases. Data Mining
and Machine learning algorithms gain its strength due to the capability of managing a large amount of data to combine
data from several different sources and integrating the background information in the study. This research work
focuses on pregnant women suffering from diabetes. In this work, Naive Bayes, SVM, and Decision Tree machine
learning classification algorithms are used and evaluated on the PIDD dataset to find the prediction of diabetes in a
patient. Experimental performance of all the three algorithms are compared on various measures and achieved good
accuracy. The remaining of the research discussion is organized as follows: Section-II briefs Related Work of various
classification techniques for prediction of diabetes, Section-III describes the Methodology and brief discussion of
Dataset used, Section-IV discusses evaluated Results, and Section-V determines the Conclusion of the research work

Sernyak used logistic regression analysis to calculate odds ratio neuroleptic unusual version and a diagnosis of
diabetes in each of the age groups, control the effects of population, and diagnosis. Thirugnanam has improved
diabetes prediction using fuzzy neural networks [10]. Hamid and others have offered hybrid intelligent systems for the
detection of micro albuminuria in patients with type 2 diabetes without measuring the urinary albumin. Javad and
others proposed the method base on automatic learning on type II diabetes to regulate blood sugar .
2.3 Tools And Technologies Used
Proposed procedure is summarized in figure-1 below in the form of model diagram. The figure shows the flow of the
research conducted in constructing the model.

Fig. 1. Proposed Model Diagram

Brief Description of Algorithms Used:

Support Vector Machine (SVM):SVM is one of the standard set of supervised machine learning model employed in
classification. Given a two-class training sample the aim of a support vector machine is to find the best highest-margin
separating hyperplane between the two classes. For better generalization hyperplane should not lies closer to the data
points belong to the other class. Hyperplane should be selected which is far from the data points from each category.
The points that lie nearest to the margin of the classifier are the support vectors . The Accuracy of the experiment is
evaluated using WEKA interface. The SVM finds the optimal separating hy-perplane by maximizing the distance
between the two decision boundaries. Mathematically, we will maximize the distance between the hyperplane which is
defined by wT x + b = −1 and the hyperplane defined by wT x + b = 1 This distance is equal to 2 _w_. This means we
want to solve max 2 _w_. Equivalently we want min _w_| 2. The SVM should also correctly classify all x(i), which
means yi(wT xi + b) >= 1, _i _ {1, ¢¢, N}. The evaluated performance of SVM algorithm for prediction of Diabetes
using Confusion Matrix is as follows:

Table 1. Confusion Matrix of SVM

A B
A-Tested Negative 500 0
B-Tested Positive 268 0

Naive Bayes Classifier:Naive Bayes is a classification technique with a notion which defines all features are
independent and unrelated to each other. It defines that status of a specific feature in a class does not affect the status
of another feature. Since it is based on conditional probability it is considered as a powerful algorithm employed for
classification purpose. It works well for the data with imbalancing problems and missing values. Naive Bayes [24] is a
machine learning classifier which employs the Bayes Theorem. Using Bayes theorem posterior probability P(C|X) can
be calculated from P(C),P(X) and P(X|C) . Therefore, P(C|X) = (P(X|C) P(C))/P(X) Where, P(C|X) = target class’s
posterior probability . P(X|C) = predictor class’s probability. P(C) = class C’s probability being true. P(X) =
predictor’s prior probability. The evaluated performance of Naive Bayes algorithm using Confusion Matrix is as
follows:

Table 2. Confusion Matrix of Naive Bayes

A B
A-Tested Negative 422 78
B-Tested Positive 104 164

Decision Tree Classifier :Decision Tree is a supervised machine learning algorithm used to solve classification
problems. The main objective of using Decision Tree in this research work is the prediction of target class using
decision rule taken from prior data. It uses nodes and internodes for the prediction and classification. Root nodes
classify the instances with different features. Root nodes can have two or more branches while the leaf nodes represent
classification. In every stage, Decision tree chooses each node by evaluating the highest information gain among all
the attributes . The evaluated performance of Decision Tree technique using Confusion Matrix is as follows:

Table 3. Confusion Matrix of Decision Tree

A B
A-Tested Negative 407 93
B-Tested Positive 108 160

Dataset Used:In this work WEKA tool is used for performing the experiment. WEKA is a software which is designed
in the country New Zealand by University of Waikato, which includes a collection of various machine learning
methods for data classification, clustering, regression, visualization etc. One of the biggest advantages of using WEKA
is that it can be personalized according to the requirements. The main aim of this study is the prediction of the patient
affected by diabetes using the WEKA tool by using the medical database PIDD.

Table-4 shows a brief description of the dataset

Database No. of Attributes No. of Instances

PIDD 8 768
PIDD-Pima Indians Diabetes Dataset The proposed methodology is evaluated on Diabetes Dataset namely (PIDD),
which is taken from UCI Repository. This dataset comprises of medical detail of 768 instances which are female
patients. The dataset also comprises numeric-valued 8 attributes where value of one class ’0’ treated as tested negative
for diabetes and value of another class ’1’ is treated as tested positive for diabetes. Dataset description is defined by
Table-4 and the Table-5 represents Attributes descriptions.

Accuracy Measures: Naive Bayes, SVM and Decision Tree algorithms are used in this research work. Experiments are
performed using internal cross-validation 10-folds. Accuracy, F-Measure, Recall, Precision and ROC (Receiver
Operating Curve)measures are used for the classification of this work.

Table 6. Accuracy Measures

Measures Definitions Formula

1. Accuracy (A) Accuracy determines the accuracy of the algorithm in predicting instances. A=(TP+TN) / (Total no of samples)
2. Precision (P) Classifier¢¢s correctness/accuracy is measured by Precision. P = TP / (TP+ FP)
3. Recall (R) To measure the classifier¢¢s completeness or sensitivity, Recall is used. R =TP / (TP+FN)
4. F-Measure F-Measure is the weighted average of precision and recall. F=2*(P*R) / (P+R)
5. ROC ROC(Receiver Operating Curve) curves are used to compare the
usefulness of tests.

Table 7. Comparative Performance of Classification Algorithms on Various Measures.

Classification Algorithms Precision Recall FMeasure Accuracy

ROC %
Naive Bayes 0.759 0.763 0.760 76.30 0.819
SVM 0.424 0.651 0.513 65.10 0.500
Decision Tree 0.735 0.738 0.736 73.82 0.751

Corresponding classifiers performance over Accuracy, Precision, F-measure, Recall and ROC values are listed in
Table-7 and classifiers performance on the basis of classified instances are defined in Table-8. Where, TP defines True
Positive, TN defines True Negative, FP defines False positive, FN defines False Negative. The corresponding
classifiers performance on the basis of Accuracy, Precision, F-measure, Recall and ROC values are listed in Table-7
and classifier’s performance on the basis of classified instances are shown in Table-8.

2.4 Hardware And Software Requirements.

Software Requirements:
o Windows 7 or higher
o Android Studio
o SQL Server 2008
o Google Chrome Browser
o JDK 8 (32 bit)
Hardware Requirements:
o i3 Processor Based Computer or higher
o Memory: 8 GB RAM or Above
o Hard Drive: 1 TB or Above
Chapter 3.
Software Requirements Specification

3.1 Users
We covert the raw data into understandable format. Now the pre processed data is classified into a decision tree to
predict the status of a person whether diabetic or not using the algorithm. The user enters the details to know his
results for the test into an android app installed in his mobile device. The attributes entered by the user is compared
with the decision tree and the results are generated.

Results have been obtained using Android Application.

User:

Check Diabetes (By providing Details like):-

 Gender Details.
 Age category(Under 35 or Above)
 Existing sufferance of diabetes
 Thirsty level
 Excess hunger
 How often patient feel excreta.
 Weight loss
 Genetic existence of diabetes
 High blood glucose
 Blurry vision
 High blood pressure
 Consumption of tobacco products or smoking
 Consumption of vegetables and fruits
 Physical Activity
 Input of waist circumference
 Input of height
 Input of weight

Manage Diabetes By Entering Data

View Doctor Details
View Suggestions
View Information About Diabetes/ Pre-diabetes.
View Information Of Developer.
3.2 Functional Requirements

We have combined three classification algorithms through a voting mechanism to increase the accuracy level of the
model. So if one algorithm does not predict it correctly, it doesn’t affect to the final prediction because the system
considers the predictions of other two algorithms too. It gives the majorities decision. Thus ensures more accuracy
than a single algorithm.

A. Decision Tree J48 Algorithm

A Decision tree is basically a tree structure(Han and Kamber, 2006), which has the form of a flowchart. It can be used
as a method for classification and prediction with a representation using nodes and internodes. Root and internal nodes
are the test cases. Leaf nodes considered as class variables. Figure 2. shows a sample decision tree structure.

Figure 2 : sample decision tree structure

Among classification data mining methods, decision tree algorithm provides powerful techniques for prediction.
Among ID3, C4.5, C5, J48 and CHAIAD decision tree algorithms, we have selected J48 algorithm to develop our
model. It’s a java based algorithm, it works as follows. In order to classify a new item, it first creates a decision tree
based on the attribute values of the available training data set. Every node of the decision tree is generated by
calculating the highest information gain for all attributes. If any attribute gives an unambiguous end result (explicit
classification of class attribute), the branch of that attribute will be terminated and then target value is assigned to it.
We have used 12-fold cross validation technique to build the model using this algorithm. It’s simply as follows.
 Break data into 12 sets of size n/12.
 Train on 11 datasets and test on 1.
 Repeat 12 times and take a mean accuracy.

In 12-fold cross-validation, the original sample is randomly partitioned into 12 equal sized subsamples. Of the 12
subsamples, a single subsample is retained as the validation data for test the model, and the remaining (12− 1)
subsamples are used as training data.

B. Naïve Bayes Algorithm

Naïve Bayes classifier algorithm has been created based on the Bayes rule of conditional probability. It uses all the
attributes contained in the data, and then analyses them individually as though they are equally important and
independent of each other. There are various data mining existing solutions exists to find relations between the
diseases and their symptoms also the medications for them. But these algorithms have their own limitations like
binning of the continuous arguments, numerous iterations, high computational time, etc. But Naïve Bayes classifier
affords fast, highly scalable model building and scoring. The build process for Naive Bayes is parallelized. It
overcomes various limitations like the omission of complex iterative estimations of the parameter because it can be
applied to a large dataset in real time. The formula used for that algorithm is simply showed here.

Here we have used 70:30 percentage split technique to build the model using Naïve Bayes algorithm. This means 70
percent of the data set have been used to train the data and other 30 percent of the data set have been used to test the
model.

C. SMO (Sequential Minimal Optimization)

This algorithm is commonly used for solving the quadratic programming problems that arise during the training of
SVM (Support Vector Machines). SMO uses heuristics to partition the training problem into smaller problems that can
be solved analytically. SMO algorithm it replaces all missing values and transforms nominal attributes into binary
ones. It also normalizes all attributes by default which helps to speed up the training process. We have used 70:30
percentage split technique to train and test the data set using this model. Here we are not only considering the accuracy
but it should have the ability to handle missing values well. This algorithm does that very accurately because it uses
heuristics to partition the training problem into smaller problems. That’s the main reason we have selected this
algorithm.
3.3 Non-Functional Requirements
This section explains the overall design of the system and what is the process it has followed in order to get the
prediction.

Dataset Used:

The data set we have used is a benchmarked dataset which can be used for comparing the accuracy and the efficiency
of our model. Data has been obtained from Pima Indians Diabetes Database, National Institute of Diabetes and
Digestive and Kidney Diseases.
Number of Instances: 600
Number of Attributes: 13 + (1 class attribute).
For Each Attribute: (all numeric-valued).

Inputs:
 Gender Details.
 Age category(Under 35 or Above)
 Existing sufferance of diabetes
 Thirsty level
 Excess hunger
 How often patient feel excreta.
 Weight loss
 Genetic existence of diabetes
 High blood glucose
 Blurry vision
 High blood pressure
 Consumption of tobacco products or smoking
 Consumption of vegetables and fruits
 Physical Activity
 Input of waist circumference
 Input of height
 Input of weight

Procedure:

 Load previous data sets to the system (768 test cases).

 Data pre-processing has done using integrating WEKA tool. Following operations are performed on the
dataset after that.

a. Replace Missing Values

b. Normalization of values.

 Then User inputs data to the system in order to diagnose whether he has the disease or not.
 Build a model using J48 Decision Tree Algorithm and train the data set.
 Build a model using Naïve Bayes Algorithm and train the data set.
 Build a model using SMO Support Vector Machine Algorithm and train the data set.
 Test the data set using these three models.
 Get the evaluation results.
 Finally, get the predicted voting from all classifiers and gives the diagnostic result.

Artificial Neural Network

The artificial neural network is much similar as natural neural network of a brain. Artificial Neural networks (ANN)
typically consist of multiple layers or a cube design, and the signal path traverses from front to back. Back propagation
is the use of forward stimulation to reset weights on the "front" neural units and this is sometimes done in combination
with training where the correct result is known. More modern networks are a bit freer flowing in terms of stimulation
and inhibition with connections interacting in a much more chaotic and complex fashion. Dynamic neural networks
are the most advanced, in that they dynamically can, based on rules, for new connections and even new neural units
while disabling others.Generally, the artificial neural network is consisting of the layers and network function, the
layers of the network are including: input layer, hidden layer and output layer. The input neurons define all the input
attribute values for the data mining model. In our work, the number of neurons is 7, since each item in our data set has
7 attributes, including: Glucose, Blood Pressure, Skin Thickness, Insulin, BMI, Diabetes Pedigree Function, and age.
For the hidden layer, hidden neurons receive inputs from input neurons and provide outputs to output neurons. The
hidden layer is where the various probabilities of the inputs are assigned weights. A weight describes the relevance or
importance of a particular input to the hidden neuron. Mathematically, a neuron's network function f(x) is defined as
composition of other functions gi (x), which can further be defied as a composition of other functions. The important
characteristic of the activation function is that it provides a smooth transition as input values change, like a small
changes in input produces a small changes in output. The artificial neural networks are applied to tend to fall within
the broad categories. Application areas include the system identification and control (vehicle control, trajectory
prediction, process control, natural resources management), quantum chemistry, game-playing and decision making
(backgammon, chess, poker), pattern recognition (radar systems, face identification, object recognition and more),
sequence recognition (gesture, speech, handwritten text recognition), medical diagnosis, financial applications (e.g.
automated trading systems), data mining (or knowledge discovery in databases, "KDD"), visualization and e-mail
spam filtering.

Support Vector Machine

The Support Vector Machine (SVM) was first proposed by Vapnik, and SVM is a set of related supervised learning
method always used in medical diagnosis for classification and regression. SVM simultaneously minimize the
empirical classification error and maximize the geometric margin. So SVM is called Maximum Margin Classifiers.
SVM is a general algorithm based on guaranteed risk bounds of statistical learning theory, so called structural risk
minimization principle. SVMs can efficiently perform nonlinear classification using what is called the kernel trick,
implicitly mapping their inputs into high-dimensional feature spaces. The kernel trick allows constructing the classifier
without explicitly knowing the feature space. The below Figure 3.Shows the structure of SVM

Fig 3: The structure of SVM.

Recently, SVM has attracted a high degree of interest in the machine learning research community. Several recent
studies have reported that the SVM (support vector machines) generally are capable of delivering higher performance
in terms of classification accuracy than the other data classification algorithms. SVM is a technique suitable for binary
classification tasks, so we choose SVM to predict the diabetes. The reason is SVM is well known for its discriminative
power for classification, especially in the cases where a large number of features are involved, and in our case where
the dimension of the feature is 7.

Logistic Regression

In statistics Logistic regression is a regression model where the dependent variable is categorical, namely binary
dependent variable-that is, where it can take only two values, "0" and "1", which represent outcomes such as pass/fail,
win/lose, alive/dead or healthy/sick. Logistic regression is used in various fields, including machine learning, most
medical fields, and social sciences. For example, the Trauma and Injury Severity Score (TRISS), which is widely used
to predict mortality in injured patients, was originally developed using logistic regression. Many other medical scales
used to assess severity of a patient have been developed using logistic regression. The technique can also be used in
engineering, especially for predicting the probability of failure of a given process, system or product. It is also used in
marketing applications such as prediction of a customer's propensity to purchase a product or halt a subscription. In
economics it can be used to predict the likelihood of a person's choosing to be in the labor force, and a business
application is about to predict the likelihood of a homeowner defaulting on a mortgage. Conditional random fields, an
extension of logistic regression to sequential data, are used in natural language processing. In this paper, Logistic
regression was used to predict whether a patient suffer from diabetes, based on seven observed characteristics of the
patient.
Chapter 4.
System Analysis And Design

4.1 System Perspective

The below figure 4.1 shows how the data is collected and processed by the classification of algorithm used
hence the input is given by the user and the result is been obtained by the android application.

Fig 4.1 System Perspective

4.2 Context Diagram
The below figure 4.2 shows the overall data flow of the system.

Fig 4.2 Data Flow Diagram

The below figure 4.2.1 shows the detailed context of data flow diagram
Fig 4.2.1 Data Flow Diagram(level-1)

4.3 DatabaseDesign
The below figure 4.3 shows the entities and their relationship among the system.

Fig 4.3 Entity Relationship Diagram

4.4 Use Case Diagram
The below figure 4.4 shows the use case diagram thus the diagram proposed the way user interacted
throghtout the application and what are the functions which user can operate or manage.

Test Diabetes

Manage Diabetes

User
View List Of
Diabetologist

View Some Suggestion

View About Diabetes

Fig 4.4 Use-Case Diagram

4.5 Sequence Diagram
The below figure 4.5 shows the sequence flow of the procedure and the process that is been carried out in the
whole system.

Fig 4.5 Sequence Diagram

4.6 Activity Diagram
The below Figure 4.6 Shows the work flow of algorithms used in the proposed system

Fig 4.6 Activity Diagram

The below Figure 4.6.1 shows the activity in which diabetes is detected and the result is produced
Diabetes Patient
Data

Output: Risk Factor

Of Diabetes Disease

Fig 4.6.1 Activity Diagram 1

Chapter 5
Implementation
5.1 ScreenShot

Fig 5.1.1 Show’s the module code of java

Fig 5.1.2 Show’s the module code of Database View

Fig 5.1.3 Show’s the module code of TestDiabetes

Fig 5.1.4 Show’s the module code of BuildConfig

Fig 5.1.5 Show’s the code of Animation Module indicator_left_move.xml

Fig 5.1.6 Show’s the code of Animation Module push_left_in.xml

Fig 5.1.7 Show’s the code of Animated module push_right_in.xml

Fig 5.1.8 Show’s the code of Shape Module buttonlook.xml

Fig 5.1.9 Show’s the structure code of ic_launcher_background.xml module

Fig 5.2.1 Show’s the code Structure of ic_launcher_foreground.xml module

Fig 5.2.2 Show’s the structure code of layout module aboutus.xml

Fig 5.2.3 Show’s the structure code of layout module doctorlist.xml

Fig 5.2.4 Show’s the structure code module of enterdata.xml

Fig 5.2.5 Show’s the structure code module of splash.xml
Fig 5.2.6 Show’s the code of ic_launcher module

Fig 5.2.7 Show’s the code of strings.xml module

Fig 5.2.8 Show’s the code of styles.xml module

Fig 5.2.9 Show’s the code of AndroidManifest.xml Module

Fig 5.3.1 Show’s the Structure code of java Module About Diabetes
Chapter 6.
Software Testing

Fig 6.1 Show’s the menu page of diabetes prediction software

Fig 6.1.1 Show’s the test diabetes module prediction symptoms

Fig 6.1.2 Show’s the test diabetes module user gender check
Fig 6.1.3 shows the test diabetes prediction symptoms check.

Fig 6.1.4 shows the diabetes prediction symptoms

Fig 6.1.5 shows the manage diabetes menu

Fig6.1.6 shows the entering data field of the user

Fig 6.1.7 shows the entered data of the user according to their glucose level

Fig 6.1.8 shows the list of diabetologist available in india.

Fig 6.1.9 shows the diabetes suggestion module providing the introduction of a disease

Fig 6.2.1 shows the diabetes suggestion module describing the type-1 diabetes
Fig 6.2.2 shows the type- 2 level symptoms of diabetes disease

Fig 6.2.3 shows the moderated level of diabetes after symptoms detection as a output
Fig 6.2.4 shows the low level of diabetes as a output

Fig 6.2.5 shows the high risk level of diabetes as a output after the prediction of the system.
Chapter 7.
Conclusion

An Application using a data mining algorithm of classes comparison has been developed to predict the occurrence of
or recurrence of diabetes risks. In addition, the result of the application shows that the predictions system is capable of
predicting diabetes effectively, efficiently and most importantly, timely. That means the application is capable of
helping a physician in making decisions towards patient health risks. It generates results that make it closer to the real
life situations. That makes the data mining more helpful in the health sector, which means that it is necessary for
knowledge discovery in the healthcare’s sector. Much more than huge savings in costs in terms of medical expenses,
loss of duty time and usage of critical medical facilities, The naïve bayes classifier based system is very useful for
diagnosis of diabetes. The system can perform good prediction with less error and this technique could be an important
tool for supplementing the medical doctors improper forming expert diagnosis. In this method the efficiency of
forecasting was found to be around 95%.

This application would be a tremendous asset for doctors who can have structured specific and invaluable information
about their patients / others so that they can ensure that their diagnosis or inferences are correct and professional.
Finally, the huge appreciations received from the doctors on having such software prove that in a place like, where
diseases are on the rise, such applications should be developed to convert the entire state. The common person stands
to benefit from doctors having such a tool so that he/she can be better knowledgeable as far as personal health and
wellbeing is concerned.

The discovery of knowledge from datasets is important in order to make effective diagnosis. The aim of data mining is
to extract information stored in dataset and generate clear and understandable patterns. This study aims at the
discovery of a decision tree model for the prediction of diabetes. Pre-processing is used to improve the quality of data.
While pre processing, the significant attributes of the dataset are considered for prediction of diabetes. This is an
important factor for consideration. The decision tree algorithm used for classification also produces maximum
accuracy when compared to other algorithms of classification. Finally the results of the system are obtained in an
android application which is very useful for the present generation.
Chapter 8.
Future Enhancements

In future this system can designed for any prediction of any other disease such as cancer, thyroid, lung diseases etc., if
these an android application of such disease prediction would be of great use in the near future. Another future
enhancement would be to reduce the no of attributes considered for the prediction purpose. Considering less no of
attributes and produce more accurate results is needed as an enhancement for the existing system .

Also on improving the accuracy of the prediction by increasing the level of training data. Its performance can be
further improved by identifying and incorporating various other parameters and increasing size of training.
Bibliography

[1] P. Yasodha, M. Kannan, “Analysis of a Population of Diabetic Patient Databases in Weka Tool”, International
Journal of Scientific & Engineering Research Volume 2, Issue 5, May-2011

[2] WEKA, by university of Waikato, http://www.cs.waikato.ac.nz/ml/weka

[3] Han, J., Kamber, M.: Data Mining; Concepts and Techniques, Morgan Kaufmann Publishers (2000) pp 132-133

[4] Gloria L.A. Beckles and Patricia E. Thompson-Reidy the authors of“ Diabetes and Women’s Health Across the
Life Stages”.

[5] Jiawei Han, Micheline Kamber, Jian Pei, “Data Mining Concepts and Techniques” Third edition . pp 125-129

[6] Folorunso O and Ogunde A. O (2004), “Data Mining as a Technique for Knowledge” pp 72-76

[7] Management in Business Process Redesign” The Electronic Journal of Knowledge Management Volume 2 Issue 1,
pp 33-44

[8] P.Yashoda, M.Kanan, Analysis of a population of diabetic patients databases in WEKA tool, IJSER, vol2, issue5,
may 2011 pp 21-72

[9] Mukesh kumari, Dr. Rajan Vohra ,Anshul arora Prediction of Diabetes Using Bayesian Network (IJCSIT)
International Journal of Computer Science and Information Technologies, Vol. 5 (4) , 2014, 5174-5178

[10] M. Khajehei, F. Etemady, "Data Mining and Medical Research Studies," cimsim, pp.119-122, 2010 Second
International Conference on Computational Intelligence, Modelling and Simulation, 2010 pp 109-110

[11] Kaur H, Wasan SK,” Empirical Study on Applications of Data Mining Techniques in Healthcare”, Journal of
Computer Science,2(2):194-200,2006

[12] Analysis of a Population of Diabetic Patients Databases with Classifiers using c4.5 Algorithm” World Academy
of Science, Engineering and Technology International Journal of Medical, Pharmaceutical Science and Engineering
Vol: 7 No: 8, 2013 pp 1115-1223

[13] Margaret H. Dunham,-“Data Mining Techniques and tice hall publishers

[14] P. Radha , Dr. B. Srinivasan Predicting Diabetes by cosequencing the various Data Mining Classification
Techniques IJISET - InternationalJournal of Innovative Science
[16] E.Knorr.E and R.Ng, “Algorithms forming distance -based outliers in large datasets”, in proceedings of 1998
International Conference on Very Large Data Bases (VldB’98), pp. 392-403 New York, 1998.

[15] E.Jiawei Hen and Micheline Kamber “DataMining Concepts and Techniques”, CA:Elsevier Inc,SanFranciso,
2006 pp 234-276

[16] U.M.Piatetsky-Shapiro and G.Smyth “From DataMining to Knowledge Discovery : An Overview”,1996, pp.1 -36

[17] S.C.Liao & M.Embrenchts, “Data Mining techniques applied to medical information”, Med.Inform, 2000, pp.81
102.

[18] L.Breiman, J.Friedman, J.Olsen C.Stone, “Classification and Re-gression Trees”, Chapman & Hal, 1984, 122-
134. Engineering & Technology, Vol. 1 Issue6, August 2014pp-124-139

[19] https://archive.ics.uci.edu/ml/datasets/Pima+Indians+Diabetes UCI MACHINE LEARNING REPOSITORY

[20] Szakacs-Simon, P. Dept. of Autom., “Transilvania” Univ., Brasov, Romania Moraru, S.A. ; Perniu, L.Android
application developed to extend health monitoring device range and real-time patient tracking International Journal of
Advanced Research in Computer Science and Software Engineering pp-34-67

[23] en.wikipedia.org/wiki/Diabetes_mellitus

UPI_(Report) (1)
100% (1)
UPI_(Report) (1)
30 pages
NeoGenPSR Brochure Compressed 772f7f8
No ratings yet
NeoGenPSR Brochure Compressed 772f7f8
8 pages
Healthcare - Chatbot Report
No ratings yet
Healthcare - Chatbot Report
44 pages
Fake Logo Detection DT Report
100% (1)
Fake Logo Detection DT Report
26 pages
Diabetics Prediction Using Machine Learning
100% (1)
Diabetics Prediction Using Machine Learning
18 pages
Software Requiement Specifications: Fake News Detector
100% (2)
Software Requiement Specifications: Fake News Detector
10 pages
Traffic Signs Recognition-MiniProject Report
100% (1)
Traffic Signs Recognition-MiniProject Report
19 pages
Driver Drowsiness Detection System: A Project Phase-2 Report ON
100% (1)
Driver Drowsiness Detection System: A Project Phase-2 Report ON
51 pages
GTPS20 Stage 3
100% (3)
GTPS20 Stage 3
37 pages
Stress Detection in It Professional by Image Processing and Machine Learning
No ratings yet
Stress Detection in It Professional by Image Processing and Machine Learning
91 pages
A Mini Project Report On "Brain Tumor Detection Using Mri Images"
100% (2)
A Mini Project Report On "Brain Tumor Detection Using Mri Images"
35 pages
3 Level Authentication - TutorialsDuniya
No ratings yet
3 Level Authentication - TutorialsDuniya
51 pages
Crop Recommender System
No ratings yet
Crop Recommender System
23 pages
Analysis of Heart Diseases Using Machine Learning: September 6, 2018
No ratings yet
Analysis of Heart Diseases Using Machine Learning: September 6, 2018
4 pages
3 SRS Document
0% (1)
3 SRS Document
14 pages
MINI REPORT Final PDF
100% (2)
MINI REPORT Final PDF
68 pages
"House Price Prediction": Internship Project Report On
No ratings yet
"House Price Prediction": Internship Project Report On
34 pages
Minor Project Report
0% (1)
Minor Project Report
25 pages
Skin Disease Detection Using Machine Learning
100% (1)
Skin Disease Detection Using Machine Learning
59 pages
Ieee Paper 2
No ratings yet
Ieee Paper 2
6 pages
Diabetes PPT
100% (1)
Diabetes PPT
9 pages
Credit Fraud Detection: Project Report
No ratings yet
Credit Fraud Detection: Project Report
22 pages
Oral Question Bank Cns
No ratings yet
Oral Question Bank Cns
9 pages
Travel Guide Project
No ratings yet
Travel Guide Project
100 pages
Weather Prediction 2
No ratings yet
Weather Prediction 2
33 pages
Detection of Cyber Attack in Network Using Machine Learning Techniques-1
No ratings yet
Detection of Cyber Attack in Network Using Machine Learning Techniques-1
71 pages
Heart Disease Prediction (Review-1)
No ratings yet
Heart Disease Prediction (Review-1)
10 pages
AIChatbot - Seminar Report PDF Version
No ratings yet
AIChatbot - Seminar Report PDF Version
44 pages
Synopsis of Smart Health Prediction
50% (4)
Synopsis of Smart Health Prediction
22 pages
Final BI Lab Manual
No ratings yet
Final BI Lab Manual
42 pages
Credit Card Fraud Detection
No ratings yet
Credit Card Fraud Detection
76 pages
Food Recipe Blog: Project Report On
50% (2)
Food Recipe Blog: Project Report On
6 pages
Multiple Disease Prediction
No ratings yet
Multiple Disease Prediction
23 pages
21cs644 Module 3
No ratings yet
21cs644 Module 3
95 pages
Handwritten Digits Recognition
No ratings yet
Handwritten Digits Recognition
27 pages
Internship Presentation New PDF
No ratings yet
Internship Presentation New PDF
14 pages
Final Parkinsons Project Report Body
No ratings yet
Final Parkinsons Project Report Body
41 pages
Projct SynopsiS On Image Processing System
100% (1)
Projct SynopsiS On Image Processing System
16 pages
Rainfall Mini Project Report
100% (1)
Rainfall Mini Project Report
38 pages
Heart Disease Prediction Final Report
100% (1)
Heart Disease Prediction Final Report
31 pages
Online Evaluation System: Project Report On
No ratings yet
Online Evaluation System: Project Report On
49 pages
RTO Management System 1
No ratings yet
RTO Management System 1
36 pages
Project Report
No ratings yet
Project Report
67 pages
A Project Report On Fake News Detection
100% (1)
A Project Report On Fake News Detection
29 pages
1NH17CS407
No ratings yet
1NH17CS407
110 pages
Fianl Year Project Report
No ratings yet
Fianl Year Project Report
62 pages
Report Final Biometrics by Himanshu
No ratings yet
Report Final Biometrics by Himanshu
30 pages
Nalaiya Thiran Program: Ai-Powered Nutrition Analyzer For Fitness Enthusiasts
No ratings yet
Nalaiya Thiran Program: Ai-Powered Nutrition Analyzer For Fitness Enthusiasts
16 pages
Synopsis On Face Recognition Technology
0% (1)
Synopsis On Face Recognition Technology
8 pages
Fake Account Detection
100% (1)
Fake Account Detection
34 pages
"Blood Donation Android Application": A Mini Project Report On
No ratings yet
"Blood Donation Android Application": A Mini Project Report On
77 pages
Phishing Website Detection by Machine Learning Techniques Presentation
No ratings yet
Phishing Website Detection by Machine Learning Techniques Presentation
12 pages
Fake Product Review Final
No ratings yet
Fake Product Review Final
30 pages
Machine Learning Laboratory 18CSL76: Institute of Technology and Management
No ratings yet
Machine Learning Laboratory 18CSL76: Institute of Technology and Management
49 pages
Vehicle Count Prediction
100% (2)
Vehicle Count Prediction
33 pages
Fraud App Detection
100% (1)
Fraud App Detection
61 pages
Project Report
50% (2)
Project Report
61 pages
Visvesvaraya Technological University: Computer Graphics Laboratory With Mini Project 18CSL67
100% (1)
Visvesvaraya Technological University: Computer Graphics Laboratory With Mini Project 18CSL67
34 pages
Employee Leave Management System Project Report Vtu
50% (2)
Employee Leave Management System Project Report Vtu
3 pages
DSBDA - Mini Project Report
100% (1)
DSBDA - Mini Project Report
7 pages
Report For Signature Verification
100% (1)
Report For Signature Verification
16 pages
Diabetes Documentation
No ratings yet
Diabetes Documentation
54 pages
Recreation Therapy For Children Having Autism
No ratings yet
Recreation Therapy For Children Having Autism
12 pages
Anaphy Course Outline
No ratings yet
Anaphy Course Outline
2 pages
1 Semester - Module 3 (Dance) WHAT I KNOW: (Pre-Assessment)
No ratings yet
1 Semester - Module 3 (Dance) WHAT I KNOW: (Pre-Assessment)
10 pages
Intro. To Pharma. Ad. Assignment
No ratings yet
Intro. To Pharma. Ad. Assignment
4 pages
Accidental Injury Claim Form: Policy Number
No ratings yet
Accidental Injury Claim Form: Policy Number
2 pages
Summary Sifrol Farmakologi
No ratings yet
Summary Sifrol Farmakologi
38 pages
Street and Hip Hop Dances
No ratings yet
Street and Hip Hop Dances
18 pages
DSMES Intervention Tracking Form (Chart 7)
No ratings yet
DSMES Intervention Tracking Form (Chart 7)
1 page
Making Dentistry Even Safer: Understanding The Proper Choice and Use of Emergency Medications
No ratings yet
Making Dentistry Even Safer: Understanding The Proper Choice and Use of Emergency Medications
10 pages
Download ebooks file Life Span Motor Development 7th Edition Kathleen M Haywood Nancy Getchell all chapters
100% (3)
Download ebooks file Life Span Motor Development 7th Edition Kathleen M Haywood Nancy Getchell all chapters
55 pages
INDU 6310 Outline Winter 2021
No ratings yet
INDU 6310 Outline Winter 2021
5 pages
Skin, Hair, and Nails Assessment: Prior To The Conduct of The Procedure, I Have
50% (2)
Skin, Hair, and Nails Assessment: Prior To The Conduct of The Procedure, I Have
3 pages
Bnap Form 2024 (
No ratings yet
Bnap Form 2024 (
43 pages
Turning Mental Health Into Social Action 1st Edition Bernard Guerin download
No ratings yet
Turning Mental Health Into Social Action 1st Edition Bernard Guerin download
76 pages
Implementation of NPIs 6 Nov
No ratings yet
Implementation of NPIs 6 Nov
1 page
Greyskull LP Logbook - DR Workout
No ratings yet
Greyskull LP Logbook - DR Workout
16 pages
Crist 2001
No ratings yet
Crist 2001
8 pages
Web Scraping - Detail of Doctors in London
No ratings yet
Web Scraping - Detail of Doctors in London
9 pages
Tree Map - How Psychology Started & Developed
No ratings yet
Tree Map - How Psychology Started & Developed
5 pages
Branches of Psychology
No ratings yet
Branches of Psychology
2 pages
Ready Made Questions For Gordons
100% (1)
Ready Made Questions For Gordons
15 pages
Operative4 Lec 6 Management of Dental Caries in Enamel and Dentin
No ratings yet
Operative4 Lec 6 Management of Dental Caries in Enamel and Dentin
22 pages
BEHAVIOR constraint theory
No ratings yet
BEHAVIOR constraint theory
1 page
NCP For Case Presentation (Acute Pain, Episiotomy)
100% (3)
NCP For Case Presentation (Acute Pain, Episiotomy)
3 pages
Kaki Oedema
No ratings yet
Kaki Oedema
6 pages
Chapter 3 - Project Management Process Groups
No ratings yet
Chapter 3 - Project Management Process Groups
38 pages
Sample Poster #4
No ratings yet
Sample Poster #4
1 page
7__PHYSIOLOGY_AND_INJURIES_IN_SPORTS_XII_2024-25
No ratings yet
7__PHYSIOLOGY_AND_INJURIES_IN_SPORTS_XII_2024-25
8 pages