0% found this document useful (0 votes)

21 views7 pages

Evaluation Question Answers

Uploaded by

pratheesh

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

21 views7 pages

Evaluation Question Answers

Uploaded by

pratheesh

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

You are on page 1/ 7

Jain International Residential School, Bengaluru

Artificial Intelligence
Unit: 7 Evaluation
Questions and Answers

While in modelling we can make different types of models, how do we check if one’s
better than the other? That’s where Evaluation comes into play.

1. What is evaluation?
The process of understanding the reliability of any AI model, based on outputs by feeding test dataset
into the model and comparing with actual answers.

2. Define Evaluation.
Moving towards deploying the model in the real world, we test it in as many ways as possible. The
stage of testing the models is known as EVALUATION.
OR
Evaluation is a process that critically examines a program. It involves collecting and analyzing
information about a program’s activities, characteristics, and outcomes. Its purpose is to make
judgments about a program, to improve its effectiveness, and/or to inform programming decisions.
3. Why are we not using the same training data for testing (evaluation) purpose?
This is because our model will simply remember the whole training set, and will therefore always
predict the correct label for any point in the training set.

4. What is meant by Overfitting of Data?

Overfitting is "the production of an analysis that corresponds too closely or exactly to a particular set
of data, and may therefore fail to fit additional data or predict future observations reliably".
(OR)
An Overfitted Model is a statistical model that contains more parameters than can be justified by the
data. Here, to evaluate the AI model it is not necessary to use the data that is used to build the model.
Because AI Model remembers the whole training data set, therefore it always predicts the correct
label for any point in the training dataset. This is known as Overfitting
(OR)
Models that use the training dataset during testing, will always results in correct output. This is
known as overfitting.
5. Which two parameters are considered for Evaluation of a model?
Prediction and Reality are the two parameters considered for Evaluation of a model.
The “Prediction” is the output which is given by the machine and the “Reality” is the real scenario,
when the prediction has been made?
6. What do you mean by prediction?
The output given by the machine.
7. What do you mean by reality?
The real scenario when the prediction has been made. The totality of real things and events.
8. What is True Positive?
 The predicted value matches the actual value
 The actual value was positive and the model predicted a positive value
9. What is True Negative?
 The predicted value matches the actual value
 The actual value was negative and the model predicted a negative value
10. What is False Positive?
 The predicted value was falsely predicted
 The actual value was negative but the model predicted a positive value
 Also known as the Type 1 error
11. What is False Negative?
 The predicted value was falsely predicted
 The actual value was positive but the model predicted a negative value
 Also known as the Type 2 error
12. What is Accuracy? Mention its formula.
Accuracy is defined as the percentage of correct predictions out of all the observations.
A prediction is said to be correct if it matches reality. Here we have two conditions in which
the Prediction matches with the Reality, i.e., True Positive and True Negative. Therefore, Formula for
Accuracy is

Where TP = True Positives, TN = True Negatives, FP = False Positives, and FN = False

Negatives.
13. What is Precision? Mention its formula.
Precision is defined as the percentage of true positive cases versus all the cases where the
prediction is true (positive).

That is, it takes into account the True Positives and False Positives.
14. What is Recall? Mention its formula.
Recall is defined as the fraction of positive cases that are correctly Identified.

15. Why is evaluation important? Explain.

Importance of Evaluation
Evaluation is a process that critically examines a program. It involves collecting and analyzing
information about a program's activities, characteristics, and outcomes. Its purpose is to make
judgments about a program, to improve its effectiveness, and/or to inform programming decisions.
 Evaluation is important to ensure that the model is operating correctly and optimally.
 Evaluation is an initiative to understand how well it achieves its goals.
 Evaluations help to determine what works well and what could be improved in a program
16. How do you suggest which evaluation metric is more important for any case?
F1 Evaluation metric is more important in any case. F1 score sort maintains a balance between the
precision and recall for the classifier. If the precision is low, the F1 is low and if the recall is low again
F1 score is low.
The F1 score is a number between 0 and 1 and is the harmonic mean of precision and recall

When we have a value of 1 (that is 100%) for both Precision and Recall. The F1 score would also be
an ideal 1 (100%). It is known as the perfect value for F1 Score. As the values of both Precision and
Recall ranges from 0 to 1, the F1 score also ranges from 0 to 1.

17. Which evaluation metric would be crucial in the following cases? Justify your answer.
a. Mail Spamming
b. Gold Mining
c. Viral Outbreak
Here, Mail Spamming and Gold Mining are related to FALSE POSITIVE cases which are expensive at
cost.
But Viral Outbreak is a FALSE NEGATIVE case which infects a lot of people on health and leads to
expenditure of money too for checkups.
So, False Negative case (VIRAL OUTBREAK) are more crucial and dangerous when compared to
FALSE POSITIVE cases.
(OR)
a. If the model always predicts that the mail is spam, people would not look at it and eventually
might lose important information. False Positive condition would have a high cost. (predicting
the mail as spam while the mail is not spam)
b. A model saying that there exists treasure at a point and you keep on digging there but it turns
out that it is a false alarm. False Positive case is very costly. (predicting there is a treasure but
there is no treasure)
c. A deadly virus has started spreading and the model which is supposed to predict a viral
outbreak does not detect it. The virus might spread widely and infect a lot of people. Hence,
False Negative can be dangerous
18. What are the possible reasons for an AI model not being efficient? Explain.
Reasons of an AI model not being efficient:
a. Lack of Training Data: If the data is not sufficient for developing an AI Model, or if the data is
missed while training the model, it will not be efficient.
b. Unauthenticated Data / Wrong Data: If the data is not authenticated and correct, then the
model will not give good results.
c. Inefficient coding / Wrong Algorithms: If the written algorithms are not correct and relevant,
Model will not give desired output. Not Tested: If the model is not tested properly, then it will
not be efficient.
d. Not Easy: If it is not easy to be implemented in production or scalable.
e. Less Accuracy: A model is not efficient if it gives less accuracy scores in production or test
data or if it is not able to generalize well on unseen data.
19. Answer the following:
Give an example where High Accuracy is not usable.
SCENARIO: An expensive robotic chicken crosses a very busy road a thousand times per day.
An ML model evaluates traffic patterns and predicts when this chicken can safely cross the
street with an accuracy of 99.99%.
Explanation: A 99.99% accuracy value on a very busy road strongly suggests that the ML
model is far better than chance. In some settings, however, the cost of making even a small
number of mistakes is still too high. 99.99% accuracy means that the expensive chicken will
need to be replaced, on average, every 10 days. (The chicken might also cause extensive
damage to cars that it hits.)
Give an example where High Precision is not usable.
Example: “Predicting a mail as Spam or Not Spam”
False Positive: Mail is predicted as “spam” but it is “not spam”.
False Negative: Mail is predicted as “not spam” but it is “spam”.
Of course, too many False Negatives will make the spam filter ineffective but False Positives
may cause important mails to be missed and hence Precision is not usable.
Four (04) Mark Questions
1. Deduce the formula of F1 Score? What is the need of its formulation?

The F1 Score, also called the F score or F measure, is a measure of a test’s accuracy. It
is calculated from the precision and recall of the test, where the precision is the
number of correctly identified positive results divided by the number of all positive
results, including those not identified correctly, and the recall is the number of
correctly identified positive results divided by the number of all samples that should
have been identified as positive. The F1 score is defined as the weighted harmonic
mean of the test’s precision and recall. This score is calculated according to the
formula.
Formula:

Necessary:
F-Measure provides a single score that balances both the concerns of precision and recall in one
number.
A good F1 score means that you have low false positives and low false negatives, so you’re
correctly identifying real threats, and you are not disturbed by false alarms.
An F1 score is considered perfect when it’s 1, while the model is a total failure when it’s 0.
F1 Score is a better metric to evaluate our model on real-life classification problems and when
imbalanced class distribution exists.

2. What is a confusion matrix? Explain in detail with the help of an example.

Confusion Matrix:
A Confusion Matrix is a table that is often used to describe the performance of a classification
model (or "classifier") on a set of test data for which the true values are known.

A 2x2 matrix denoting the right and wrong predictions might help us analyse the rate of success.
This matrix is termed the Confusion Matrix.

Evaluation of the performance of a classification model is based on the counts of test records
correctly and incorrectly predicted by the model.

Therefore, Confusion Matrix provides a more insightful picture which is not only the
performance of a predictive model, but also which classes are being predicted correctly and
incorrectly, and what type of errors are being made.

The confusion matrix is useful for measuring Recall (also known as Sensitivity), Precision,
Accuracy and F1 Score.

The following confusion matrix table illustrates how the 4-classification metrics are calculated
(TP, FP, FN, TN), and how our predicted value compared to the actual value in a confusion
matrix.
Let’s decipher the matrix:

The target variable has two values: Positive or Negative

The columns represent the actual values of the target variable

The rows represent the predicted values of the target variable

True Positive, True Negative, False Positive and False Negative in a Confusion Matrix

True Positive (TP)

The predicted value matches the actual value
The actual value was positive and the model predicted a positive value

True Negative (TN)

The predicted value matches the actual value
The actual value was negative and the model predicted a negative value

False Positive (FP) – Type 1 error

The predicted value was falsely predicted

The actual value was negative but the model predicted a positive value. Also known as the Type
1 error

False Negative (FN) – Type 2 error

The predicted value was falsely predicted
The actual value was positive but the model predicted a negative value also known as the
Type 2 error

Example:
Case: Loan (Good loan & Bad loan)
The result of TP will be that bad loans are correctly predicted as bad loans.

The result of TN will be that good loans are correctly predicted as good loans.

The result of FP will be that (actual) good loans are incorrectly predicted as bad loans.

The result of FN will be that (actual) bad loans are incorrectly predicted as good loans.

The banks would lose a bunch of money if the actual bad loans are predicted as good loans due to
loans not being repaid. On the other hand, banks won't be able to make more revenue if the
actual good loans are predicted as bad loans. Therefore, the cost of False Negatives is much
higher than the cost of False Positives.

Machine Learning Interview Questions
From Everand
Machine Learning Interview Questions
Tech Interviews
4.5/5 (2)
Make Your Own Project-Based Lesson Plan
No ratings yet
Make Your Own Project-Based Lesson Plan
14 pages
Evaluation
No ratings yet
Evaluation
12 pages
EvaluationQuestions Class 10 Ai
No ratings yet
EvaluationQuestions Class 10 Ai
6 pages
AI Evaluation
No ratings yet
AI Evaluation
3 pages
517-c-30072-Assignment Chapter Evaluation
No ratings yet
517-c-30072-Assignment Chapter Evaluation
10 pages
Q ClassX AI Evaluation
No ratings yet
Q ClassX AI Evaluation
12 pages
Unit-7 Evaluation: 7. What Is Meant by Overfitting of Data?
No ratings yet
Unit-7 Evaluation: 7. What Is Meant by Overfitting of Data?
7 pages
Cbse - Department of Skill Education Artificial Intelligence
No ratings yet
Cbse - Department of Skill Education Artificial Intelligence
12 pages
EVALUATION - notes
No ratings yet
EVALUATION - notes
15 pages
EVALUATION
No ratings yet
EVALUATION
4 pages
UNIT 7 EVALUATION.docx
No ratings yet
UNIT 7 EVALUATION.docx
13 pages
1051637-Worksheet Part b Unit7 Evaluation
No ratings yet
1051637-Worksheet Part b Unit7 Evaluation
5 pages
Ch-EVALUATION
No ratings yet
Ch-EVALUATION
7 pages
Grade 10 Unit 7 - Evaluation
No ratings yet
Grade 10 Unit 7 - Evaluation
50 pages
Unit 7 - Evaluation
No ratings yet
Unit 7 - Evaluation
7 pages
Evaluation Notes
No ratings yet
Evaluation Notes
12 pages
AI-Evaluation
No ratings yet
AI-Evaluation
30 pages
Evaluation Class x Ai 417
No ratings yet
Evaluation Class x Ai 417
19 pages
EVALUATION PPT
No ratings yet
EVALUATION PPT
25 pages
Ch 07 Evaluation
No ratings yet
Ch 07 Evaluation
25 pages
Ch 7 - notes evaluation
No ratings yet
Ch 7 - notes evaluation
3 pages
c10 Ai Evaluation -2024-25
No ratings yet
c10 Ai Evaluation -2024-25
29 pages
Part B Unit 7 Evaluation
No ratings yet
Part B Unit 7 Evaluation
11 pages
Aiunit 7 10
No ratings yet
Aiunit 7 10
4 pages
Evaluation Grade10 Ai
No ratings yet
Evaluation Grade10 Ai
32 pages
UNIT 3 Evaluating Models Q-Ans
No ratings yet
UNIT 3 Evaluating Models Q-Ans
6 pages
Evaluation__1646538719041
No ratings yet
Evaluation__1646538719041
65 pages
Screenshot 2024-12-17 at 8.54.03 PM
No ratings yet
Screenshot 2024-12-17 at 8.54.03 PM
4 pages
Evaluation 1
No ratings yet
Evaluation 1
23 pages
5.10AI -2B
No ratings yet
5.10AI -2B
15 pages
AI Project Evaluation 1
No ratings yet
AI Project Evaluation 1
5 pages
10 Ai Evaluation tp01
No ratings yet
10 Ai Evaluation tp01
5 pages
Evaluation Class X
50% (2)
Evaluation Class X
19 pages
417_AI_Handbook_Class9_Evaluation
No ratings yet
417_AI_Handbook_Class9_Evaluation
5 pages
1006_ai_evaluation
No ratings yet
1006_ai_evaluation
4 pages
Notes of Evaluation
No ratings yet
Notes of Evaluation
5 pages
Part B Chapter 7 (Evaluation)
No ratings yet
Part B Chapter 7 (Evaluation)
5 pages
Evaluation
No ratings yet
Evaluation
32 pages
EVALUATION
No ratings yet
EVALUATION
10 pages
04 Evaluation Revision Notes
No ratings yet
04 Evaluation Revision Notes
5 pages
EvaluationNotes
No ratings yet
EvaluationNotes
12 pages
Evaluation New
No ratings yet
Evaluation New
42 pages
Class X - Artificial Intelligence - Evaluation - Question Bank
83% (6)
Class X - Artificial Intelligence - Evaluation - Question Bank
8 pages
AI Evaluation
No ratings yet
AI Evaluation
24 pages
Evaluation
No ratings yet
Evaluation
10 pages
3008_revision_cv_evaluation
No ratings yet
3008_revision_cv_evaluation
20 pages
EVALUATION
No ratings yet
EVALUATION
12 pages
UNIT 3-Practice Sheet 3 (1)
No ratings yet
UNIT 3-Practice Sheet 3 (1)
2 pages
Evaluation in AI
No ratings yet
Evaluation in AI
20 pages
Evaluation Exercise
No ratings yet
Evaluation Exercise
3 pages
Unit-7 Evaluation Notes
No ratings yet
Unit-7 Evaluation Notes
9 pages
Evaluation 1 7
No ratings yet
Evaluation 1 7
7 pages
Unit 7 - AI (Evaluation)
No ratings yet
Unit 7 - AI (Evaluation)
28 pages
2.Confusion matrix and Performmance Metrics
No ratings yet
2.Confusion matrix and Performmance Metrics
15 pages
Worksheet class 10 Preboard
No ratings yet
Worksheet class 10 Preboard
14 pages
Evaluation-Important Questions
No ratings yet
Evaluation-Important Questions
12 pages
Evaluation-Practice Questions(Answer key)
No ratings yet
Evaluation-Practice Questions(Answer key)
4 pages
Class X Artificial Intelligence Evaluation Assignment
No ratings yet
Class X Artificial Intelligence Evaluation Assignment
3 pages
Evaluation - Grade 10 AI
No ratings yet
Evaluation - Grade 10 AI
12 pages
UNIT-3
No ratings yet
UNIT-3
13 pages
Artificial Intelligence 417 Class X Sample Paper Test 03 For Board Exam 2023 Answers 1
No ratings yet
Artificial Intelligence 417 Class X Sample Paper Test 03 For Board Exam 2023 Answers 1
10 pages
Self Management Skills
No ratings yet
Self Management Skills
14 pages
MCQ Artificial Intelligence Class 10 Data Science
0% (1)
MCQ Artificial Intelligence Class 10 Data Science
16 pages
F1 Score Calculation
No ratings yet
F1 Score Calculation
4 pages
MCQ Artificial Intelligence Class 10 Computer Vision
100% (3)
MCQ Artificial Intelligence Class 10 Computer Vision
41 pages
Important QnA Modelling AI Class 10
No ratings yet
Important QnA Modelling AI Class 10
3 pages
Important QnA Neural Network AI Class 10
100% (1)
Important QnA Neural Network AI Class 10
3 pages
Daily Lesson Plan Subject: Science
No ratings yet
Daily Lesson Plan Subject: Science
3 pages
Chapter 2: The Self, Society and Culture
No ratings yet
Chapter 2: The Self, Society and Culture
20 pages
Reflecting On Descarte Meditations
No ratings yet
Reflecting On Descarte Meditations
12 pages
Lilibeth D. Macasero ADMINISTRATIVE AND SUPERVISORY USES OF TEST AND MEASUREMENT
No ratings yet
Lilibeth D. Macasero ADMINISTRATIVE AND SUPERVISORY USES OF TEST AND MEASUREMENT
18 pages
English Worksheet: Past Simple Tense: Regular Verbs
No ratings yet
English Worksheet: Past Simple Tense: Regular Verbs
3 pages
Observation Special Ed
No ratings yet
Observation Special Ed
6 pages
Felicity Baker Music Therapy Songwriting
No ratings yet
Felicity Baker Music Therapy Songwriting
15 pages
SPIN Selling Summary: 1-Sentence-Summary: SPIN Selling Is Your Guide To Becoming An Expert Salesperson by
No ratings yet
SPIN Selling Summary: 1-Sentence-Summary: SPIN Selling Is Your Guide To Becoming An Expert Salesperson by
4 pages
SCHOOL MEMO No. 60, S. 2022 - SLAC (Sept. 30, 2022)
100% (2)
SCHOOL MEMO No. 60, S. 2022 - SLAC (Sept. 30, 2022)
5 pages
320124
No ratings yet
320124
2 pages
The Ai Essay
No ratings yet
The Ai Essay
3 pages
(the Springer International Series in Engineering and Computer Science 745) Nada Lavrač, Marko Grobelnik (Auth.), Dunja Mladenić, Nada Lavrač, Marko Bohanec, Steve Moyle (Eds.) - Data Mining and Decis
No ratings yet
(the Springer International Series in Engineering and Computer Science 745) Nada Lavrač, Marko Grobelnik (Auth.), Dunja Mladenić, Nada Lavrač, Marko Bohanec, Steve Moyle (Eds.) - Data Mining and Decis
283 pages
Fse Dissertation Template Qualitative Methods
No ratings yet
Fse Dissertation Template Qualitative Methods
12 pages
Organızatıonal Cynıcısm: A Brıef Evaluatıon
No ratings yet
Organızatıonal Cynıcısm: A Brıef Evaluatıon
9 pages
Teachers' Understanding of Eyl Principles and Their
No ratings yet
Teachers' Understanding of Eyl Principles and Their
15 pages
How K-Pop Affect To Junior High School Students in RCAI: Kisha Badinas Aizel Sevilla Humprey Bea Jayzon Nava Paul Polines
No ratings yet
How K-Pop Affect To Junior High School Students in RCAI: Kisha Badinas Aizel Sevilla Humprey Bea Jayzon Nava Paul Polines
14 pages
Day Three: Numerals 20-Million
No ratings yet
Day Three: Numerals 20-Million
11 pages
FIELD STUDY 2 Episode 16
No ratings yet
FIELD STUDY 2 Episode 16
13 pages
Increase Your Financial Iq
No ratings yet
Increase Your Financial Iq
4 pages
INTRO TO COMM ONLINE MODULE REVIEWER
No ratings yet
INTRO TO COMM ONLINE MODULE REVIEWER
7 pages
Carlena Lowell 529 Ellco Report
No ratings yet
Carlena Lowell 529 Ellco Report
11 pages
Short Note On Luhmann and Zizek
No ratings yet
Short Note On Luhmann and Zizek
4 pages
NoteGPT - Share Your Learning Note With One Click.
No ratings yet
NoteGPT - Share Your Learning Note With One Click.
3 pages
Grade 1 To 6 Daily Lesson Log School: Santiago Integrated School Grade Level: IV
No ratings yet
Grade 1 To 6 Daily Lesson Log School: Santiago Integrated School Grade Level: IV
3 pages
Kierkegaard On Truth Is Subjectivity and The Leap of Faith
No ratings yet
Kierkegaard On Truth Is Subjectivity and The Leap of Faith
18 pages
Tryout Instrument
No ratings yet
Tryout Instrument
3 pages
Learning Optimal Dialogue Strategies: A Case Study of A Spoken Dialogue Agent For Email
No ratings yet
Learning Optimal Dialogue Strategies: A Case Study of A Spoken Dialogue Agent For Email
7 pages
19cs413 Artificial Intelligence
No ratings yet
19cs413 Artificial Intelligence
3 pages
Chapter 8 Practice Test
100% (1)
Chapter 8 Practice Test
5 pages