0% found this document useful (0 votes)
140 views33 pages

Stages of the AI Project Cycle

The document outlines the stages of an AI project cycle, including problem scoping, data acquisition, data exploration, modeling, and evaluation. It emphasizes the importance of understanding the problem, collecting reliable data, and using various modeling approaches such as rule-based and learning-based methods. Additionally, it discusses the significance of data visualization and the evaluation process to ensure the model's effectiveness before deployment.

Uploaded by

Kaira Yadav
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
140 views33 pages

Stages of the AI Project Cycle

The document outlines the stages of an AI project cycle, including problem scoping, data acquisition, data exploration, modeling, and evaluation. It emphasizes the importance of understanding the problem, collecting reliable data, and using various modeling approaches such as rule-based and learning-based methods. Additionally, it discusses the significance of data visualization and the evaluation process to ensure the model's effectiveness before deployment.

Uploaded by

Kaira Yadav
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd

Introduction

Project:- A project refers to the set of


operations carried out with given resources
within a specific schedule to achieve defined
objectives.
Project Cycle:- is a step-by-step process to
solve problem using proven scientific
methods.
Stages in AI project Cycle:-
Stages of the project cycle are the steps that contribute to
completing the project. The stages of AI Project cycle are:-

● Problem Scoping- Understanding the problem

● Data Acquisition- Collecting accurate and reliable data

● Data Exploration- Arranging the data uniformly

● Modelling- Creating models for data

● Evaluation- Evaluating the project


Problem Scoping:-
Problem scoping refers to understanding a problem, finding out various factors
which affects the problem, define the goal or aim of the project.
The 4Ws Canvas tool is a problem framing method:-
❖ Who:- Who is facing the problem stakeholders [affected directly or indirectly]
❖ What:-What is the nature of problem? Can you explain it simply?
❖ Where:- in which context, location or situation people experience the problem?
❖ Why:- Why do you think it is a worth solving problem?
Problem Statement Template:-

The Problem statement template helps us to


summarize all the key points into single template.
So that in the future, whenever there is a need to look
back at the basis of the problem, we can take a look at
the Problem Statement Template and understand its
key elements.
Have a look at the Problem statement template
Problem Statement Template:-
Questions:-
1. The 4Ws Problem canvas help in identifying the key elements related to the problem. The
4Ws Problem canvas is a part of…
a. Problem Scoping b. Data Acquisition c. Modeling
d. Evaluation

1. The ________ block of 4Ws Problem canvas helps in analyzing the people getting affected
directly or indirectly due to it.
a. Who b. What c. Where
d. Why

1. Choose the five stages of AI project cycle in correct order


a) Evaluation -> Problem Scoping -> Data Exploration -> Data Acquisition -> Modelling
b) Problem Scoping -> Data Exploration -> Data Acquisition -> Evaluation -> Modelling
c) Data Acquisition -> Problem Scoping -> Data Exploration -> Modelling -> Evaluation
d) Problem Scoping -> Data Acquisition -> Data Exploration -> Modelling -> Evaluation
Data Acquisition:-
The process of collecting accurate and reliable data to work with.

Data Features:- Refers to the type of data you want to collect.

Big Data:- It includes unstructured type of data

with size that exceed the capacity of traditional

software to process within an acceptable time and value.

To interpret the patterns and trends in the data, which is in large quantities, data
in visual forms such as charts, graphs, maps (heat maps, density map) tree
diagram, Venn diagram,Histogram are preferred.
Data Acquisition:-

The are two types of data sets:


Training data Set:- Data set that is used to train an algorithm.
Testing Data Set:- is used with the AI algorithm to correlate the
input data with processed output.

Base Training Set Testing Set

Use Use for Training the Model Use for Testing the Model after
it is trained

Size bigger than testing data and It is smaller than training Set
constitutes about 70% to 80% and constitutes about 20% to 30%
Data Set:-
Data features:-

Data Characteristics:- Data is crucial for success of any AI


project.
★ Relevance :- Do you really need this information ?
★ Accuracy:- Is the information correct in every detail?
★ Completeness:- How comprehensive the information is?
★ Timeliness:- How up to date information is?
★ Reliability:-Does the information contradicts the other sources?
★ Validity:- Is the information compliant with the requirements?
Type of Data:-
Type of Data:-
❖ Structured Data:- has a purposely designed ,
pre defined structure as per some existing data
model such as 2D spreadsheet or table.
❖ Unstructured Data:- it is not organised. Such as
social media post, surveillance camera footage
Data Source:-
❖ Web Scraping:- collecting data from website using some tools called web
scraper. These tools are used for monitoring price,news etc.
❖ Sensors:- Sensors collect the physical data and detects the changes. It is
used to measure different parameters like temperature,humidity,blood
pressure
❖ Cameras:- capture visual information or graphical data such as raw video
,raw images
❖ Application Programing Interfaces:- Collects specific types of data
pertaining to users such as Twitter API, Google Search API
❖ Observation:- When we observe something carefully we get some
information. It is time consuming data source.

❖ The Internet
Questions:-
1. During Data Acquisition feeding previous data into the machine is called…
a. Training Data b. Testing Data c. Predicting Data d. Evaluating Data

1. ________ Involves collecting data from various authentic sources such as reliable website,
observation surveys.
a. Data Acquisition b. Data Exploration c. Data Tasting d.
Evaluating Data
Data Exploration:-
The data exploration is the process of arranging the gathered data uniformly for
a better understanding. Data can be arranged in the form of a table, plotting a
chart or making a database.

The graphical representation makes the data understandable for humans as we


can discover trends and patterns
Data Exploration:-

❖ Table Line Chart


❖ Pi Chart Histogram
❖ Bar Graph Heat Map
❖ Node Link Diagram Word Cloud
Data Exploration:-

To analyse the data quickly, you need to visualize it so that you


can:

❖ Quickly get a sense of the trends, relationship and pattern

❖ Define strategy for which model to use at a later stage

❖ Communicate the same to others effectively


Data Visualization Tools:-
The tools used to visualise the acquired data are
known as data visualization or exploration tools
Few Data visualization tools are-
★ Google Chart
★ Tableau
★ Fusion Charts
★ Highcharts
Data Modeling:-
AI modelling refers to developing algorithms, also called models
which can be trained to get intelligent outputs. OR This is, writing
codes to make a machine artificial intelligence

❖ The selected model are tested and analysed and most


suitable AI Model is selected.
❖ Once most efficient Model is chosen AI algorithm is
developed around it.

“During this stage training and testing data are also selected”

Training Data ( Model ):- is a data set used to train an AI


algorithm.

Testing Data set:- is of observation used to validate the


developed model after training is completed.
AI Modeling:-

Modelling is the process in which different models can be


created and even checked for the advantages and disadvantages.
2 Type of Modelling Approaches:-
1. Rule Based Approach [Model driven]
2. Learning Based Approach [Data Driven]
Rule Based Approach:-
● Rule Based Approach Refers to the AI modeling where the
relationship or pattern in data are defined by developer.
● This means the machine works on the rules and information given
by the developer and performs the task accordingly.
● The rule based AI is used when we have known or labeled dataset
Ex. You trained you model with 100 images of apples and
bananas. Now if you test it by showing an apple, it will figure
out and tell of it’s an apple or not. Here Labeled images of
apple and banana were fed,due to which the model could detect
the fruit.
Rule Based Approach Drawback:-

❖ Lot of manual Work

❖ Consume a lot of time

❖ Suitable only for less complex domain

❖ Limited ability and learning capacity

❖ Static and not scalable


Learning Based Approach :-
The learning-based Approach is based on a machine learning experience with
data fed [input huge data].

Machine Learning[ML]:-Machine learning is a subset of artificial intelligence


(AI) that provides machine the ability to learn automatically and improve from
experience based on input data without being programed for it.

In ML , there are no patterns, rules and relationships predefined by the


developer , rather machine learns with each new input and come with own
algorithm.
Machine Learning[ML] :-

3 types of machine learning:-


★ Supervised Learning #Task Driven
○ Regression
○ Classification
★ Unsupervised Learning #Data Driven
○ Clustering
○ Dimensionality Reduction
★ Semi supervised or Reinforcement Learning #True AI
○ Clustering
○ Association
Supervised Learning :-

Supervised Learning where a computer algorithm is


trained on input data that has been labeled for a
particular output.

2 types of Supervised Learning:-


○ Regression
○ classification
Classification :-
Data is categorized under different labels according to some
parameters given in the input and then the labels are
predicted for the data.
It can classify the category of new unlabelled test inputs on
the basis of training data
The classification models use
non-contiguous i.e. discrete data
Ex. To predict which of them
is apple and banana.
Regression :-
It is used to predict continuous value.
It is based on a mathematical approach used to find the
relationship between two or more variables and predict the
outcome
Ex. To predict your next salary
Based on your previous salary,
Increment etc.
Unsupervised Learning :-
It works with unlabelled data and creates cluster of items
having similar features,characteristic or patterns.
It is used to identify relationship patterns and trends out of
the data which is fed into it
It helps the user in understanding
- What the data is about
- What are the major feature identified by the machine
Ex. boy observe someone performing trick with ball,so he
also learn the trick by himself
Clustering :-
Clustering can cluster the unknown data according to the
pattern or trends identified out of it.
The patterns observed can be known to the developer or it
can be unique.
Note: Classification ≈ Division, Clustering ≈ Grouping
Dimensionality Reduction :-
● We can visualize up to 3-Dimensions only.
● To reduce the dimensions and still be able to make sense of the data,
we use Dimensionality Reduction.
● The ball in our hand is 3-Dimensions. But if we click its picture, the
data transform to 2-D.
Reinforcement Learning:-
● Learning through feedback or trial and error method is called
Reinforcement Learning.
● The system works on Reward or Penalty policy. In this an agent
performs an action positive or negative, in the environment which is
taken as input from the system, then the system changes the state
in the environment and the agent is provided with a reward or
penalty.
Reinforcement Learning:-
Example:- A very good example of these is Vending machines.

Suppose you put a coin (action) in a Juice Vending machine(environment),


now the system detects the amount of coin given (state) you get the
drink corresponding to the amount(reward) or if the coin is damaged or
there is any another problem, then you get nothing (penalty).
Evaluation and Deployment:-

In the Evaluation stage , the developed model is accurately evaluated for


accuracy and performance using new data so as to determine if the developed
model is deployable or not.

You might also like