ST1 4483 8995 Capstone PPT Template
ST1 4483 8995 Capstone PPT Template
PRESENTATION
• The dataset includes four features of each sample: sepal length, sepal width, petal length, and petal width. The
dataset is often used for classification tasks, where the goal is to predict the species of a new flower based on its
features.
3. EDA (Exploratory Data
Analysis) Outcomes
• The average sepal length and width of the flowers are 5.8 cm and 3.1 cm, respectively. The average petal
length and width are 3.8 cm and 1.2 cm, respectively.
• The petal length and width are highly correlated, while the sepal length and width have a weaker positive
correlation.
• The setosa species can be easily separated from the other two species based on their petal length and
width, while versicolor and virginica have some overlap in their feature distributions.
4. PDA (Predictive Data
Analysis) Outcomes
• As our project focuses on classification of iris plants based on their sepal and
petal dimensions, the PDA outcomes involve the performance of different
machine learning models on the dataset.
• We evaluated the performance of four different classification models -
Logistic Regression, K-Nearest Neighbors, Decision Tree, and Random
Forest - using 10-fold cross-validation. The results showed that all four
models were able to classify the iris plants with high accuracy, with Random
Forest performing the best with an average accuracy of 96.67%.
• We also performed feature selection using Recursive Feature Elimination
(RFE) with Logistic Regression as the underlying model. The RFE results
showed that the most important features for iris classification were petal
length and petal width.
• Overall, the PDA outcomes suggest that machine learning models can
effectively classify iris plants based on their sepal and petal dimensions, and
that petal length and width are the most important features for this
classification task.
5. Implementation and Deployment (TkInter/Flask/Streamlit) Plan and Status Update
Streamlit App
References /Bibilography
Brownlee, J. (2020). How to Develop a Multiclass Classification Model
for Iris Flower Species. Retrieved from
https://machinelearningmastery.com/how-to-develop-a-multiclass-
classification-model-for-iris-flower-species/
https://www.kaggle.com/datasets/uciml/iris?resource=download