0% found this document useful (0 votes)

16 views6 pages

2425 CS420 22TT HW04

Uploaded by

ndt280504

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

16 views6 pages

2425 CS420 22TT HW04

Uploaded by

ndt280504

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

You are on page 1/ 6

Class 22TT – Term I/2024-2025

Course: CS420 – Artificial Intelligence

Homework 04

Submission Notices:
 Conduct your homework by filling answers into the placeholders given in this file (in Microsoft Word format).
Questions are shown in black color, instructions/hints are shown in italic and blue color, and your content
should use any color that is different from those.
 After completing your homework, prepare the file for submission by exporting the Word file (filled with
answers) to a PDF file, whose filename follows the following format,
<StudentID-1>_<StudentID-2>_HW02.pdf (Student IDs are sorted in ascending order)
E.g., 2312001_2312002_HW04.pdf
and then submit the file to Moodle directly WITHOUT any kinds of compression (.zip, .rar, .tar, etc.).
 Note that you will get zero credit for any careless mistake, including, but not limited to, the following things.
1. Wrong file/filename format, e.g., not a pdf file, use “-” instead of “_” for separators, etc.
2. Disorder format of problems and answers
3. Conducted not in English
4. Cheating, i.e., copy other students’ works or let the other student(s) copy your work.

Problem 1. (2.5pts) Answer each of the following question with a detailed explanation.
Please write your answer in the table.

Questions (0.5pt each) Answers

1. Name and briefly describe three advanced + Generative AI: Generative AI refers to AI
technologies in AI mentioned in the slides. models that can generate new, original content
using generative models.
+ Text generation: These models can generate
the relevant text based on the given prompt.
+ Text to image: Text-to-image AI can convert
the natural language descriptions into artistic
images.

2. What is the key difference between supervised Supervised learning model learns from labeled
learning and unsupervised learning? examples to map from input to output.
Unsupervised learning model learns from
unlabeled examples to describe patterns and
insights in the data.

3. What is the primary function of the perceptron The perceptron algorithm is mainly used in
algorithm in machine learning, and how does it binary classification. It divides the n-
update its weights during the training process? dimensional space into two decision regions by
a hyperplane defined by the linearly separate
function.

1
4. In the context of backpropagation, explain how The chain rule of calculus is essential for this
the chain rule of calculus is applied to compute the process, as it allows the gradients to be
gradients of the loss function with respect to each calculated layer by layer in a multi-layer
weight in a neural network. How does this process network.
help in updating the weights during the training of The chain rule helps in computing the gradient
a multi-layer neural network using gradient of the loss function with respect to each weight
descent? by breaking it smaller components.
+ Forward pass: The network computes the
output y by applying weights and activation
functions to the input data.
+ Backward pass: To update a weight w, we
will calculate the gradient of the loss with
respect to w.
Using the chain rule, we can break that
gradient into a product of partial derivatives:
∂L ∂ L ∂ z
= .
∂w ∂ z ∂ w
Where z is the weight input to next layer.
And after that, the gradients are propagated
backward layer by layer. The gradient at each
layer depends on the gradient of subsequent
layers.

5. Explain the differences between batch gradient The difference between these algorithms is
descent, stochastic gradient descent (SGD), and about the amount of data usage when updating
mini-batch gradient descent. the parameters.
+ Batch gradient descent: It computes the
gradient of entire dataset to perform a single
update in each iteration.
+ Stochastic gradient descent: It computes the
gradient using a single random sample from the
dataset to update.
+ Mini-batch gradient descent: It computes the
“k” samples from the dataset to update.

Problem 2. (2pts) For each of the statement below, choose either true or false, and then provide a
detailed explanation.
Please fill in your answers on the table below.

Statement (0.5pt each) True / False Explanation

1. In reinforcement learning, a reward is False The reward can depend on the state and
2
a number returned at a certain step of the action. The reward function determines
Markov Decision Process. The reward is the immediate feedback in that state s
not allowed to depend on state and after taking the action a and transitioning
action. to state s’

2. In reinforcement learning, a human False A human user is not typically needed to

user is typically needed to provide provide feedback during the learning
feedback to assess whether the predicted process. The algorithm learns by
value is correct or incorrect. This interacting with the environment,
feedback helps the algorithm learn and receiving the rewards as feedback.
refine its understanding of the problem
space.

3. Inverse reinforcement learning (which True Inverse reinforcement learning primarily

allows helicopters to fly autonomously) aims to learn the expert’s reward
focuses mainly on learning the expert’s function by interacting with the
reward function. environment based on the trajectories.
The core idea is that understands what
motivates the expert’s actions.

4. Reinforcement contingencies True The contingencies play a crucial role in

significantly influence which behaviors shaping the voluntary behavior because
individuals are likely to engage in individuals are more likely to engage in
voluntarily. actions that leads to positive outcomes or
rewards.

Problem 3. (2.5pts) The following table is a dataset used for training a decision tree. The labels of each
sample can be “yes” or “no” given three features X, Y, and Z. Your task is to build an ID3 decision tree
by splitting by information gain (draw the resulting tree), then answer the questions.

X Y Z Label
0 0 0 yes
0 0 1 yes
0 1 0 yes
0 1 1 no
1 0 0 no
1 0 1 no
1 1 0 yes
1 1 1 no

a. (1pt) Draw the decision tree below:

3
Your tree:

b. (1pt) Which features can be the root of the tree? Explain why.

Answer:

−4 4 4 4
H ( S )= log 2 − log 2 =1
8 8 8 8

4
AEx = ¿
8

I x =1−0.81127=0.18873

A Ey = (
4 −2
8 4
2 2
log 2 − log 2 +
4 4 4 8 4) (
2 4 −2 2 2 2
)
log 2 − log 2 =1
4 4 4

I y =1−1=0

4
4
AEz = ¿
8

I z =1−0.81127=0.18873

We can see that the information of “X” and “Z” are both 0.18873. Therefore, I choose “X” will be the
root of the decision tree.

c. (0.5pt) How many edges are there on the longest branch? Indicate the branch.

Answer:
There are 3 edges on the longest branch. That is the path from X go to Y through “0” and go to Z
through “1” and reach “No” through “1”.

Problem 4. (3pts) Answer the following question about Gradient Descent.

a. (1pt) Let ϕ (x ): R ↦ R d , w ∈ R d. Consider the following objective function (loss function).

Loss (x , y , w)=¿
where y ∈ R . Compute the gradient ∇ w Loss(x , y , w).
Answer:
Case 1: (w⋅ ϕ (x)) y ≤ 0
Loss( x , y , w )=1−2( w ⋅ϕ ( x )) y
∇ w Loss ( x , y , w )=−2 ϕ (x) y
Case 2: 0<(w ⋅ ϕ (x)) y ≤ 1
Loss( x , y , w )=¿
∇ w Loss ( x , y , w )=−2 ϕ (x) y (1− ( w ⋅ϕ ( x ) ) y)
Case 3: (w ⋅ ϕ (x)¿ y >1
Loss( x , y , w )=0
∇ w Loss ( x , y , w )=0
b. (1pt) Write out the Gradient Descent update rule for the function TrainingLoss (w):R d ↦ R .
Answer:

We need to find the gradient of the loss function TrainingLoss(w):

∂ TrainingLoss( w)
∇ w TrainingLoss(w) =
∂w
After that, we will update the weights based on the formula:
w ← w−η ∇ w TrainingLoss(w)
(1pt) Let d=2, and ϕ ( x )=[1 , x]. Consider the following loss function.
1
TrainingLoss (w)= (Loss (x1 , y 1 , w)+ Loss(x 2 , y 2 , w)).
2
5
Compute ∇ w Loss(w) for the following values of x 1 , y 1 , x 2 , y 2 , w .
1
[ ]
w= 0 , , x 1=−2 , y 1=1 , x 2=−1 , y 2=−1.
2
Answer:
We compute the ϕ ( x 1 ) and ϕ ( x 2 ):
ϕ ( x 1 )=[ 1 ,−2 ]
ϕ ( x 2 )=[1 ,−1]
Then we compute the w ϕ ( x 1 ) and w ϕ ( x 2 ):

[ ]
w ϕ ( x 1 )= 0 ,
1
2
1
∗[ 1 ,−2 ] =0∗1+ ∗(−2 )=−1
2

[ ][
w ϕ ( x 2 )= 0 ,
1
2
1
∗ 1 ,−1 ] =0∗1+ ∗(−1 ) =
2
−1
2
Then we compute the loss function
For x 1 , y 1 , w :
( w ϕ ( x1 ) ) y 1=(−1 )∗1=−1→case 1
Loss ( x1 , y 1 , w )=1−2 ( w ⋅ ϕ ( x 1 ) ) y 1
∇ w Loss ( x 1 , y 1 , w ) =−2 ϕ ( x 1 ) y 1=−2∗[ 1,−2 ]∗1=[−2 , 4]
For x 2 , y 2 , w :

( w ϕ ( x 2) ) y 2=( −12 )∗(−1)= 12 →case 2

Loss( x 2 , y 2 , w ) =¿

( 12 )=[1 ,−1]
∇ w Loss ( x 2 , y 2 , w ) =−2 ϕ ( x2 ) y 2 ( 1−( w ⋅ ϕ ( x 2 ) ) y 2 )=−2∗[ 1 ,−1 ]∗(−1 )∗ 1−

Finally, we will compute the gradient TrainingLoss(w):

1 1 −1 3
∇ w Traning Loss ( w )= ( ∇ w Loss ( x 1 , y 1 , w ) + ∇ w Loss ( x 2 , y 2 , w ) ) = ( [−2, 4 ] + [ 1 ,−1 ] )=[ , ]
2 2 2 2

Homework 04: Your Content Should Use Any Color That Is Different From Those
No ratings yet
Homework 04: Your Content Should Use Any Color That Is Different From Those
5 pages
Home Assignment Submission Solutions
No ratings yet
Home Assignment Submission Solutions
82 pages
Deep Learning Exam: Technical University of Munich
No ratings yet
Deep Learning Exam: Technical University of Munich
20 pages
Answer Key
No ratings yet
Answer Key
12 pages
DNN Cluster S2 22 MidSem Makeup
No ratings yet
DNN Cluster S2 22 MidSem Makeup
7 pages
Machine Learning - Assignment 2
No ratings yet
Machine Learning - Assignment 2
16 pages
Machine Learning Unit 2 MCQ
No ratings yet
Machine Learning Unit 2 MCQ
17 pages
7COM1033test 0000
No ratings yet
7COM1033test 0000
4 pages
ML Endsem 2022
No ratings yet
ML Endsem 2022
7 pages
TM366 2023 2024 2
No ratings yet
TM366 2023 2024 2
10 pages
COE292 - T221 - Final - Version C
No ratings yet
COE292 - T221 - Final - Version C
19 pages
Neural Networks & Machine Learning FAQ
No ratings yet
Neural Networks & Machine Learning FAQ
5 pages
Machine Learning Exam Prep
No ratings yet
Machine Learning Exam Prep
9 pages
Updated Assignment-1 Deep Learning
No ratings yet
Updated Assignment-1 Deep Learning
3 pages
Sigmoid Functions and Neural Networks
No ratings yet
Sigmoid Functions and Neural Networks
6 pages
Neural Networks Midterm Exam
No ratings yet
Neural Networks Midterm Exam
14 pages
Week - 5 (Deep Learning) Q. 1) Explain The Architecture of Feed Forward Neural Network or Multilayer Perceptron. (12 Marks)
No ratings yet
Week - 5 (Deep Learning) Q. 1) Explain The Architecture of Feed Forward Neural Network or Multilayer Perceptron. (12 Marks)
7 pages
Machine Learning Assignments and Answers
No ratings yet
Machine Learning Assignments and Answers
35 pages
AI Midterm Solution Guide 2013
No ratings yet
AI Midterm Solution Guide 2013
11 pages
HW04
No ratings yet
HW04
9 pages
B.Tech & M.Tech Soft Computing Exams
No ratings yet
B.Tech & M.Tech Soft Computing Exams
23 pages
Week 4
No ratings yet
Week 4
61 pages
1.deep Learning Assignment1 Solutions 1
100% (3)
1.deep Learning Assignment1 Solutions 1
12 pages
Solution ML KOE - 073 PUT (7th Sem 2024-25) Neeru
No ratings yet
Solution ML KOE - 073 PUT (7th Sem 2024-25) Neeru
14 pages
SS 2020
No ratings yet
SS 2020
21 pages
CSCI 5521 Spring 2025 Final Exam
No ratings yet
CSCI 5521 Spring 2025 Final Exam
8 pages
BITS F464 Machine Learning Neural Network Practice Questions - SolutionKey
No ratings yet
BITS F464 Machine Learning Neural Network Practice Questions - SolutionKey
5 pages
T243 COE 292 Quiz04 Concept
No ratings yet
T243 COE 292 Quiz04 Concept
7 pages
Deep Learning
No ratings yet
Deep Learning
9 pages
Roll No. B.E/ B.Tech (Fulltime) Degreeend Semesterexaminations, April/May2013
No ratings yet
Roll No. B.E/ B.Tech (Fulltime) Degreeend Semesterexaminations, April/May2013
3 pages
Artificial Neural Networks
No ratings yet
Artificial Neural Networks
9 pages
Solved Paper-2024 (Raihan-13017704423) - MUHAMMED RAIHAN
No ratings yet
Solved Paper-2024 (Raihan-13017704423) - MUHAMMED RAIHAN
14 pages
Neural Network Basics
No ratings yet
Neural Network Basics
37 pages
MLT Kcs055 2022 23 Aktu Qpaper Sol
No ratings yet
MLT Kcs055 2022 23 Aktu Qpaper Sol
23 pages
ML Cie-2
No ratings yet
ML Cie-2
2 pages
19CSE456 - VI Sem May 2022
No ratings yet
19CSE456 - VI Sem May 2022
6 pages
hw2 Red
No ratings yet
hw2 Red
4 pages
101905CS502H - Neural Networks and Deep Learning - Model Question Paper
100% (2)
101905CS502H - Neural Networks and Deep Learning - Model Question Paper
4 pages
Exam Spring 10
No ratings yet
Exam Spring 10
10 pages
Soft 2
No ratings yet
Soft 2
2 pages
DL Assignment Solutions
No ratings yet
DL Assignment Solutions
64 pages
CSC 2541: Neural Net Training Dynamics: Lecture 1 - A Toy Model: Linear Regression
No ratings yet
CSC 2541: Neural Net Training Dynamics: Lecture 1 - A Toy Model: Linear Regression
62 pages
Must Know Questions Deep Learning
No ratings yet
Must Know Questions Deep Learning
22 pages
Machine Learning Homework
No ratings yet
Machine Learning Homework
8 pages
Soft Computing CT QP
No ratings yet
Soft Computing CT QP
2 pages
Aat Assignment I Deep Learning A8707 Iiib - Tech I Sem
No ratings yet
Aat Assignment I Deep Learning A8707 Iiib - Tech I Sem
2 pages
15-381 Spring 2007 Assignment 6: Learning
No ratings yet
15-381 Spring 2007 Assignment 6: Learning
14 pages
Final Exam Solutions
No ratings yet
Final Exam Solutions
12 pages
Soft Computing MCQS
No ratings yet
Soft Computing MCQS
24 pages
Deep Learning Exam Solutions 2019
No ratings yet
Deep Learning Exam Solutions 2019
20 pages
ML Previous Year
No ratings yet
ML Previous Year
28 pages
ML Mcqs Without Answers
50% (2)
ML Mcqs Without Answers
21 pages
Cs230exam Win19 Soln
No ratings yet
Cs230exam Win19 Soln
29 pages
Machine Learning Midterm Exam 2020
No ratings yet
Machine Learning Midterm Exam 2020
6 pages
CS230: Deep Learning: Winter Quarter 2018 Stanford University Midterm Examination 180 Minutes
100% (1)
CS230: Deep Learning: Winter Quarter 2018 Stanford University Midterm Examination 180 Minutes
36 pages
Is The Data Linearly Separable?: A) Yes B) No
No ratings yet
Is The Data Linearly Separable?: A) Yes B) No
19 pages
First Exam 24 25 Solution
No ratings yet
First Exam 24 25 Solution
13 pages
7 Disk 1
No ratings yet
7 Disk 1
43 pages
05 SystemModeling
No ratings yet
05 SystemModeling
50 pages
06 ArchitecturalDesign
No ratings yet
06 ArchitecturalDesign
64 pages
Final Raw
No ratings yet
Final Raw
8 pages
Scientific Research 01 Merged
No ratings yet
Scientific Research 01 Merged
441 pages
Age and Gender Classification Using CNN CVPR2015
No ratings yet
Age and Gender Classification Using CNN CVPR2015
9 pages
Simulink - Dynamic System Simulation For MatLab 23
No ratings yet
Simulink - Dynamic System Simulation For MatLab 23
1 page
Differential Equations A Modeling Approach 1st Edition Courtney Brown Instant Download
100% (4)
Differential Equations A Modeling Approach 1st Edition Courtney Brown Instant Download
80 pages
Completely Randomized Stat 101
No ratings yet
Completely Randomized Stat 101
22 pages
UNIT2
No ratings yet
UNIT2
20 pages
Final Projects
No ratings yet
Final Projects
1 page
Maths 2 Question Bank - 240430 - 074631-1
No ratings yet
Maths 2 Question Bank - 240430 - 074631-1
4 pages
Deep Learning for Tech Enthusiasts
No ratings yet
Deep Learning for Tech Enthusiasts
30 pages
Performance Task 4 PC
No ratings yet
Performance Task 4 PC
2 pages
Question Level: Easy: Part A (20 1 20)
No ratings yet
Question Level: Easy: Part A (20 1 20)
9 pages
Data Discretization
No ratings yet
Data Discretization
9 pages
Steem Keys PDF
No ratings yet
Steem Keys PDF
1 page
Shallow Vs Deep Nns Dse 3151 Deep Learning
No ratings yet
Shallow Vs Deep Nns Dse 3151 Deep Learning
591 pages
PRP
No ratings yet
PRP
60 pages
Signals and Systems Fundamentals
No ratings yet
Signals and Systems Fundamentals
271 pages
Regression Models in R Guide
No ratings yet
Regression Models in R Guide
137 pages
Frequency Response With Python
No ratings yet
Frequency Response With Python
50 pages
Deep Learning Interview Questions and Answers
No ratings yet
Deep Learning Interview Questions and Answers
21 pages
Android Programming for BSc IT Students
No ratings yet
Android Programming for BSc IT Students
9 pages
Stochastic Control Princeton
No ratings yet
Stochastic Control Princeton
14 pages
Statistical and Mathematical Modeling Guide
No ratings yet
Statistical and Mathematical Modeling Guide
19 pages
Building Energy Use Prediction Using Time Series Analysis
No ratings yet
Building Energy Use Prediction Using Time Series Analysis
5 pages
Age of Information Aware Trajectory Planning of UAVs in Intelligent Transportation Systems A Deep Learning Approach
No ratings yet
Age of Information Aware Trajectory Planning of UAVs in Intelligent Transportation Systems A Deep Learning Approach
14 pages
Introduction To The Laplace Transform Transform: (Chapter 12)
No ratings yet
Introduction To The Laplace Transform Transform: (Chapter 12)
76 pages
10 1 1 83 586 PDF
No ratings yet
10 1 1 83 586 PDF
31 pages
Measures of Position JEH GJSG
No ratings yet
Measures of Position JEH GJSG
38 pages
Cs 607 Quiz 1 Solved
No ratings yet
Cs 607 Quiz 1 Solved
3 pages
Lec 10-Kleens Theorem NFA
No ratings yet
Lec 10-Kleens Theorem NFA
23 pages
Linked List
No ratings yet
Linked List
8 pages
BCS401 - Ada Question Bank-1
0% (1)
BCS401 - Ada Question Bank-1
2 pages

2425 CS420 22TT HW04

Uploaded by

2425 CS420 22TT HW04

Uploaded by

Class 22TT – Term I/2024-2025

Course: CS420 – Artificial Intelligence

Questions (0.5pt each) Answers

Statement (0.5pt each) True / False Explanation

2. In reinforcement learning, a human False A human user is not typically needed to

3. Inverse reinforcement learning (which True Inverse reinforcement learning primarily

4. Reinforcement contingencies True The contingencies play a crucial role in

a. (1pt) Draw the decision tree below:

Problem 4. (3pts) Answer the following question about Gradient Descent.

a. (1pt) Let ϕ (x ): R ↦ R d , w ∈ R d. Consider the following objective function (loss function).

We need to find the gradient of the loss function TrainingLoss(w):

( w ϕ ( x 2) ) y 2=( −12 )∗(−1)= 12 →case 2

Finally, we will compute the gradient TrainingLoss(w):

You might also like