0% found this document useful (0 votes)

16 views54 pages

05-Classification-II-2024

The document covers advanced topics in machine learning, specifically focusing on classification techniques such as Decision Trees and Naïve Bayes. It discusses the structure, training, and evaluation of Decision Trees, including issues like overfitting and regularization, as well as the application of Bayes Theorem in Naïve Bayes classifiers. The content is aimed at providing a comprehensive understanding of these classification methods within the context of supervised learning.

Uploaded by

Arkajyoti Saha

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

16 views54 pages

05-Classification-II-2024

Uploaded by

Arkajyoti Saha

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 54

EECS708: Machine Learning

Classification – part 2
Dr. Ioannis Patras
School of EECS

Slide thanks: Dr. Tim Hospedales

Course Context
• Supervised Learning
– (Linear) regression
– (Linear) Classifiers and Logistic Regression
– Neural Networks
• Unsupervised
– Clustering
– Density Estimation
– Dimensionality reduction (partial)
• Advanced topics
– Deep Learning, Convolutional Neural Networks
– Ensemble Learning
Classification: Overview

• Decision Trees
• Naïve Bayes
• Practical Issues and performance metrics
Decision Trees:
Play tennis dataset
Decision Trees: Contingency Tables
• For every combination of attributes, record
how frequently it occurs
• Check the cube to predict new data
– Would be slow

ast
Ov y

iny
nn
– Decision tree can compress the cube

erc
Ra
Su
Humid
Not Humid

Play

No Play
Decision Trees: Model Structure & Test
time procedure
• Internal nodes:
– Test the value of a particular attribute: Equality /
Inequality.
– Branch according to the result
• Leaf nodes:
– Specify the class f(x)
• Test time:
Classify x* by sending it down the tree
Decision Tree: How to Grow/Train
• What algorithm do you think can construct a
tree from data?
– Hint: It’s recursive.
• Suppose you have a magic pick-best attribute
function?
Decision Tree: How to Grow/Train

Recursive Algorithm:
• Grow(T)
– if All y=0, return Leaf(0)
– elseif All y=1, return Leaf(1)
– else
• xj = ChooseBestAttribute(T)
• T0 = <x,y> in T with xj=0
• T1 = <x,y> in T with xj=1
• Return Node(xj, Grow(T0), Grow(T1))
Grow/Train: How to choose best
attribute?
• Pick the attribute that greedily maximizes
accuracy?
– In this example, x1
• j = ChooseBestAttribute(T)
– Choose j to minimize
– #Examples <x,y> in T0 with y!=0
+
– #Examples <x,y> in T1 with y!=1
• (Actually, minimize information gain instead)
Entropy and Information Gain
Entropy of a random variable Y High Entropy

𝐻 𝑌 = − ∑! 𝑝 𝑦 log(𝑝 𝑦 ))

Low Entropy Low Entropy

A measure of how uniform/peaked a distribution is

Information Gain in decision trees
Entropy of a random variable Y Before the split

𝐻 𝑌 = − ∑! 𝑝 𝑦 log(𝑝 𝑦 ))

𝑌" distribution at
the right branch
𝑌! distribution at
the left branch

Information gain of Y given a split A={l,r}

𝐼𝐺 𝑌|𝐴 = 𝐻 𝑌 − ∑𝑝" 𝐻(𝑌" )
𝑝" proportion of the data in branch 𝑎 ∈ {𝑙, 𝑟}
Training with non-boolean features

• Nominal
– Test one value versus all the others
(Outlook=Sunny)
– Group into disjoint subsets. (Postcode = W1)
• Continuous
– Threshold inequality xj > th
Decision Tree: What can they
represent? (Nominal Data)
• Depth 1 tree p

– Any Boolean function of 1 feature.. f t

p
• Depth 2 tree
f q
– Any Boolean function of two features..
f t
• DT can represent any boolean function
– (But worst case 2^N leaves)
Decision Tree: What can they
represent? (Continuous Data)
• If Length > L1
– Then Salmon
• Else
– If Lightness >L2
• Then Cod
– Else
• Then Salmon
• Represent:
– Axis parallel cuts.
– Can approximate but not exactly
represent diagonal boundaries.
– Can become arbitrarily complex
with enough data
Decision Tree: Over-fitting &
Regularization

• Suppose one unusual day: [Sunny, Hot, Normal,

Strong, Play=No]
– What happens to tree?
– New (noisy) nodes will be grown under Sunny-
Normal-…
Decision Tree: Over-fitting
• Overfitting, formally:
– Train Error (known): E(M,Dtrain)
– Future Error (unknown): E(M, Dall)
– Overfit model M if:
• If there is some other model M’
• E(M, Dtrain) < E(M’, Dall)
• E(M, Dall) > E(M’, Dall) o

o
o
Decision Tree: Regularization
Avoiding Over-fitting for a decision tree. Ideas?
1. Grow full tree then prune
– How to guide pruning?
• Measure performance on train data?
• Measure performance on validation data?
2. Add regularizer to split objective
– xj = ChooseBestAttribute(T)
– If error improvement < λ* #nodes
• Then skip
– (Determine λ by validation)
Decision Tree: Summary
• Decision Tree Classifier: Properties:
• Good:
• Representation: Tree – Mixed-type data (no 1-of-N
encode!)
• Evaluation: Accuracy – High dimensions
• Train: Greedy, Recursive • Classification at test time can
take < O(d)!
• Test: Traverse tree – CF: NN: O(dn), MaxEnt: O(d)
• Frequently used in industry
• Prevent overfit: • May be Interpretable
• Regularize on # of nodes, or • Optimal tree is NP-complete
– Practical trees are not
• Pruning optimal, but good enough
• Some pathological problems
can’t be represented as trees
Classification: Overview

• Decision Trees
• Naïve Bayes
– Bayes Theorem
– ML fitting
• Practical Issues
Naïve Bayes – Bayes Theorem Recap
• Bayes Theorem
– P(A) “Prior probability of A” p(A | B) =
p(B | A)p(A)
p(B)
– P(B|A) “Probability of B given A”
p(B | A)p(A) = p(A, B)
p(A) = ∑ p(A, B)
B
Naïve Bayes – Bayes Theorem Recap
• Bayes Theorem:
– P(H) “Prior probability of hypothesis H”
– P(D|H) “Probability of data D given hypothesis H”

p(D | H )p(H )
p(H | D) =
p(D)

p(D | H )p(H ) = p(D, H )

p(D) = ∑ p(D, H )
H
Naïve Bayes – Bayes Theorem Recap
• Bayes Theorem.
– Hypothesis = {C,!C}, Data = {+,-} p(H | D) =
p(D | H )p(H )
p(D)
– P(C) = 0.008
– P(!C) = 0.992 p(D | H )p(H ) = p(D, H )
– P(+|C) = 0.98 p(D) = ∑ p(D, H )
P(-|C)=0.02
H
–
– P(+|!C)=0.03
– P(-|!C)=0.97
0.98 × 0.008
p(C | +) =
0.98 × 0.008 + 0.03× 0.992
p(C | +) ≈ 0.2
Bayes Theorem - Visualization
• Set interpretation
– P(A) is size of set A in the world
– P(A,B) is the size of the intersection of set A&B
– P(A|B) is the fraction of the space where B is
true that A is also true
H = “Have a headache”
F = “Coming down with Flu”

P(H) = 1/10
P(F) = 1/40
P(H|F) = 1/2

“Headaches are rare and flu is rarer, but if

you’re coming down with ‘flu there’s a 50/50
chance you’ll have a headache.”
Bayes Theorem - Visualization
• Set interpretation
– P(A) is size of set A in the world
– P(A,B) is the size of the intersection of set A&B
– P(A|B) is the fraction of the space where B is
true that A is also true
H = “Have a headache”
F = “Coming down with Flu”
Think fraction of H occupied by F
P(H) = 1/10 Answer is 1/8th. Not fraction of F occupied by H.
P(F) = 1/40
P(H|F) = 1/2 Good Reasoning?!
One day you wake up with a headache. You
think: “Drat! 50% of flus are associated with
headaches so I must have a 50-50 chance of
coming down with flu”
From Bayes Theorem to Naïve Bayes
p(D | H )p(H )
p(H | D) =
p(D)
p(D | H )p(H ) = p(D, H )
• Bayes Theorem: p(D) = ∑ p(D, H )
H
– P(H) “Prior probability of hypothesis H”
– P(D|H) “Probability of data D given hypothesis H”
• What if we have two or more sources of data?
Recall: p(A, B) = p(A)p(B)
iff independent
• Then either
p(D1, D2 | H )p(H ) p(D1 | H )p(D2 | H )p(H )
p(H | D1, D2 ) = p(H | D1, D2 ) =
p(D1, D2 ) p(D1, D2 )
• Using the latter regardless is known as the
“Naïve” Assumption
From Bayes Theorem to Naïve Bayes

• A direct Bayesian classifier would have to model:

– P(“Viarga”=1, “Cheap”=1,..|Spam)=….
– P(“Viarga”=0, “Cheap”=1,..|Spam)=….
– P(“Viarga”=1, “Cheap”=0,..|Spam)=….
p(D1, D2 | H )p(H )
– P(“Viarga”=0, “Cheap”=0,..|Spam)=…. p(H | D1, D2 ) =
p(D1, D2 )
– ….
– P(“Viarga”=1, “Cheap”=1,..|Ham)=….
– P(“Viarga”=0, “Cheap”=1,..|Ham)=….
– P(“Viarga”=1, “Cheap”=0,..|Ham)=….
– P(“Viarga”=0, “Cheap”=0,..|Ham)=….
• Clearly the table size, and hence data requirement is
exponential in the size of the dictionary. Cf: 280,000!
Naïve Bayes Classifier

n Naïve Bayes spam classification ∏ p(D | H )p(H )

i
n P(“Viarga”|Spam)=90% p(H | D1.. N )= i

n P(“Viagra”|Ham) =5% ∏ p(D ) i

i
n P(“Cheap”|Spam)=60%
n P(“Cheap”|Ham) =30%
n P(Spam)=10%
n P(Ham)=90%

n P(Spam|Cheap) = p(C|S) p(S)/Z = 0.60.1/(0.60.1+0.3*0.9) = 18%

n P(Spam|Viagra) = p(V|S) p(S)/Z = 0.90.1/(0.90.1 + 0.05*0.9) = 67%

n P(Spam|Cheap,Viagra) = p(V|S)p(C|S)p(S)/Z
n = 0.6*0.9*0.1/(0.6*0.9*0.1+0.3*0.05*0.9) = 80%
Naïve Bayes Classifier: Continous Data

n For continuous data, often model p(D|H) as Gaussian

n P(S|x*col,x*len)=p(x*col|S)p(x*len|S)p(S)/K p(x | µ, σ ) =
1 1
exp− 2 ( x − µ )
2

σ 2π 2σ
n P(C|x*col,x*len)=p(x*col|C)p(x*len|C)p(C)/K

p(Len | Salm.) p(Color | Salmon) p(Color | Cod)

xlen

p(Len | Cod) xcolor

Learning Naïve Bayes Classifier:
Discrete

n To learn the NB classifier, need to fit probability distributions

n Observe a coin with H,H,H,T,T.
n p(Heads | Coin)=3/5, 60%, p(Tails | Coin)=2/5, 40%.
n Roll Dice 60 times, observe: 12x1, 8x2, 11x3, 9x4, 14x5, 6x6
n P(1|Dice) = 20%, …, p(6|Dice)=10%.

n This is called a binomial/multinomial distribution.

n Parameter tells you the bias. [0.6, 0.4], [0.2,0.13,0.18,0.15,0.23.0.1]

n Find the parameter that maximizes the probability of the data

n Wcoin = argmax p(D|Wcoin)
n (Nj is counting number of outcomes of type j) wj = N j / ∑ N j
j
Learning Naïve Bayes Classifier:
Discrete: Math and Pseudocode

n Find the parameter that maximizes the

probability of the data
n W = argmax p(D|W)

n Foreach attribute k
w jk = N jk / ∑ N jk n Foreach Data i
j
n Foreach state j
N jk = ∑ I(xik = j) n N(j,k) +=1 if xik = j
i n Make N(:,k) sum to 1
Learning Naïve Bayes Classifier:
Continuous

n To learn the NB classifier, independently find the parameter that maximizes the
probability of the training data

n For Gaussian: 1 1 2
p(x | µ, σ ) = exp− 2 ( x − µ )
n {μ,σ}=argmax p(D|μ,σ) σ 2π 2σ

n D={<xl,xc,fish>} 1 N 1 N
µ = ∑x 2
σ = ∑ (xi − µ )2
n ={<0.1,0.3,cod>,<0.2,0.4,cod>,.. N i N −1 i
n <0.3,0.2,salm>,<0.4,0.3,salm>}

n Then p(Len | Salm.) p(Color | Salmon) p(Color | Cod)

n Cod μlen = (0.1+0.2+…)/N

n Salmon μlen = (0.3+0.4+….)/N, etc. xlen
*?
p(Len | Cod)
xcolor
Naïve Bayes Classifier: Over-fitting
• What if you had ten spams and no real emails
with “viagra”?
– Our parameter estimate equation:
p(x j ) = N j / ∑ N j
– P(Viagra|Spam)=10/10+0=100% j
– P(Viagra|Ham)=0/10+0=0%
• Now you get a long email from a friend that
happens to mention Viagra:
– The spam evidence from one “Viagra” overrides every
other indication of ham from the email. (Multiply by
zero)
– How to fix? ∏ p(D | H )p(H ) i
i
p(H | D1.. N ) =
∏ p(D ) i
i
Naïve Bayes Classifier: Regularization
• What if you had ten spams and no real emails
with “Viagra”?
– MLE Learning w = N / ∑N j j j
• P(Viagra|Spam)=10/10+0=100% j

• P(Viagra|Ham)=0/10+0=0%
– Regularized Learning, λ=1 w j = ( N j + λ ) / ∑( N j + λ )
j
• P(Viagra|Spam)=10+1/11+1=92%
• P(Viagra|Ham)=1/11+1=8%
• Now, with enough positive evidence, an email
could be Ham despite including Viagra. ∏ p(D | H )p(H ) i
i
p(H | D1.. N ) =
∏ p(D ) i
i
NB Classifier: What can it classify?
• For continous data
– Naïve Bayes models a line (or quadratic curve)
• For discrete data
– Naïve Bayes models a line. (just like MaxEnt!)
• => Simpler boundary than DT p(Len | Salm.) p(Color | Salmon) p(Color | Cod)

xlen
*?
p(Len | Cod)
Naïve Bayes Issues: Overconfidence
• Naïve assumption:
– Counting each piece of evidence equally
– Not exploiting attribute correlation
p(D1, D2 | H ) = p(D1 | H )p(D2 | H )
p(D1.. N | H ) = ∏ p(Di | H )
i

• You could attack a spam filter by listing all the

fish species below your Viagra add…
Naïve Bayes Classifier: Relation to
MaxEnt
• Both classifiers have simple boundaries
• For data D={yi,xi}
• MaxEnt: w = argmax E (w, D) = ∏ p(y | x , w)
*
MCL
i
i i

• Naïve Bayes: w = argmax E (w, D) = ∏ p(x | y , w)

*
ML i i
p(D | H )p(H )
i p(H | D) =
• NB Learning decouples the prior: p(D)

– You can take your NB cancer classifier to Chernobyl and it

will still work…
– You can move your NB fish classifier from UK to Norway…
– Your MaxEnt cancer classifier will have to re-train from
scratch
An Aside: Online Learning

• Sometimes you want to learn from a data stream instead of from a

pre-existing static database.
1. Because you want to keep your model very up-to-date.
2. Because your database is too huge to fit in memory, and you don’t want to
read it off disk more than once.
– Thin task is know as online learning.
• Any algorithm can be re-trained from scratch every time a new row is
added from the stream.
– E.g., MaxEnt you repeat your O(dn) training for each of n data itmes.
– Inefficient!! Leads to n*O(dn)=O(dn2)
• An algorithm that can update the model from the stream in O(1) (i.e.,
without revisiting the old database) has the Incremental property.
Naïve Bayes Classifier: Online Learning

• Naïve Bayes is naturally online incremental!

• If you want to learn from a continuous stream of
observations
– Maintain your sufficient statistics Nj
• (i.e., how many times each token j is associated with the
current class)
– Add +1 to the appropriate Nj each new observation xj

• There is corresponding regularization for the

continuous version. w | D = ( N + λ ) / ∑( N + λ)
j j j
j

w j | D, D' = ( N j + N j '+ λ ) / ∑ ( N j + N ' j + λ )

j
Naïve Bayes Classifier: Summary
• Naïve Bayes Classifier: • Properties:
• Representation: Likelihoods Bayes • Train Complexity: O(dn)
• Evaluation: Likelihood • Test Complexity: O(d)
• Train:
• Good in high dimensions
• Exact, maximum likelihood
• Even d>n
• (Each attribute independently)
• Set the prior manually or ML • Good for Big Data

• Test: Maximum A-Posteriori • Incremental online

• One-pass

p(H | D1, D2 ) =
p(D1 | H )p(D2 | H )p(H ) • Can change priors
p(D1, D2 ) • Good for mixed-type data
Case Studies: View Angle Classification
(EECS Work! J)
Decision tree classifies face pixels, predicts view direction.
Classification: Overview

• Decision Trees
• Naïve Bayes
• Performance Metrics
Performance Metrics

• Right metric to use depends on the

application
– Misuse of metric can be very misleading. So
better understand them!
• Accuracy
• Confusion Matrix
• Expected Utility
• ROC Curve
Performance Metrics:
Accuracy

• If classifier makes predictions yest and the true values

are ytru
• Accuracy: Percentage of correct answers
• Advantages:
N
1
Acc = ∑ I(yiest = yitru )
N
– Easy, single number i

• Limitations:
– Doesn’t account for imbalanced data:
• E.g., Loans: 90% of people overall pay back their loan
• Bank classifies good/bad borrowers to make lending decisions
• If classify all as good => 90% “accurate” …but useless!
– Doesn’t account for which mistakes are made
– Doesn’t account for classifier calibration
Performance Metrics:
Confusion Matrix

– The confusion matrix compares how many instances of each actual

category are predicted as each estimated category.

• The sum of the confusion matrix diagonal gives the accuracy.

– (Accuracy = % of correct answers, 7/10 = 70% in this example)

Actual Actual
1 0 1 0
TP FP 4 1
Predicted

Predicted
1

1
FN TN 2 3
0

0
Performance Metrics:
Confusion Matrix
– The confusion matrix compares how many instances of each actual category are
predicted as each estimated category.
• Sometimes which mistakes you make matter more
than the total number of mistakes
– E.g., Loans. Predicting good/bad credit
• Consider two classifier results
– Accuracy = 50% in each case
– Both classifiers get the bank 3 loans worth of interest
payments
– But which is more useful?
– Classifier A: Lost business: 1, Bad Loans: 4 Actual Actual

– Classifier B: Lost business: 4, Bad Loans: 1 G B G B

Predicted 3 4 3 1
G
B 1 2 4 2
Performance Metrics: Confusion Matrix
• Accuracy results can mislead if you have imbalanced
data
– Normalised confusion matrix can reveal this
– E.g., Assume 90% loans are good, so an accurate classifier
reports 100% good loans. Overall Accuracy = 9/10 = 90%. But
it’s useless.
– True Positive Rate (Frac of positives identified as positive)
– True Negative Rate (Frac of negatives identified as negative)
– Old diagonal: (TP+FP)/N=90%. New diagonal: (TPR+FPR)/2=50%
Actual Actual Actual Actual
1 0 1 0 G B G B
TP FP 9 1 1.0
Predicted

1.0
Predicted

TPR FPR
G
1

G
1

FN TN FNR TNR 0 0 0 0
B
0

B
0
Performance Metrics: Confusion Matrix
• Different applications care about different parts
of the confusion matrix.
– E.g., bank cares more about minimizing FPR (bad loans) than FNR (lost business)

– E.g., High security system cares more about minimizing

FNR (permitted breakins) than FPR (false alarms).
• Why? Because each outcome has a different cost.

Actual Actual Actual Actual

1 0 1 0 G B G B
TP FP 3 1
Predicted

1/4
Predicted

TPR FPR 3/4

G
1

FN TN FNR TNR 4 2 4/6 2/6

B
0

B
0
Performance Metrics: Expected Value
• Which loan classifier is better, and by how much?
– A makes more good loans, but B makes less bad loans
– Expected Value calculation gives a single number
given a confusion matrix and cost matrix
EV = P(Outcome1)*Val(Outcome1)+P(Outcome2)*Val(Outcome2)
Actual
G B
ts EV = (2*3 - 0.1*1 - 4*4 – 0.1*2)/10
o s Actual 3 4
C = -1$ per customer
Predicted
G

G B
1 2
B
Predicted

$2 -4$
G

G B
-$0.1 -0.1$ EV = (2*2 - 0.1*2 - 0*4 - 0.1*6)/10
B

Predicted

2 0 = 0.32$ per customer

2 6
B
Performance Metrics: ROC

• All of the metrics discussed so far depend on classifier calibration

– You are correct if your final estimate is ytru=yest
– But good binary classifiers can output a confidence as well as a class…
– By default the classifier says: yest=1 if p(y)>0.5
• If you are worried about FPs, you could say: yest=1 if p(y) > 0.75
• If you want to maximize TPs you could say: yest=1 if p(y) > 0.25
• This threshold will change the distribution in the confusion matrix
– Since threshold is user/business context dependent…..
• Is there a way to evaluate a classifier independently of the
threshold?
– So we can evaluate independently of the end user. Predicted
1 0
TP FN

1
Actual
FP TN

0
Performance Metrics: ROC
• Consider a variety of thresholds
– Each threshold defines a TPR and FPR

P(Good|x)
Actual 1
1 0
TPR
Predicted

TPR FPR
1

FNR TNR
0

FPR 1 Bad loans x Good Loans

Performance Metrics: ROC
• Consider a variety of thresholds
– Each threshold defines a TPR and FPR
– The ROC curve is the graph of TPRs and FPRs
• (Receiver operating characteristic)

P(Good|x)
Actual Actual 1
1 0 1 0
TPR
Predicted

2 2 1 1
1

0 0 1 1
0

FPR 1 Bad loans x Good Loans

Performance Metrics: ROC
• Consider a variety of thresholds
– Each threshold defines a TPR and FPR
– The ROC curve is the graph of TPRs and FPRs
• (Receiver operating characteristic)
– Better ROC curves approach the top left
– Area under the ROC curve is a threshold-independent
measure of goodness
• (AUROC: Perfect:
t
1, Worst: 0, Random: 0.5)
rfe
c P(Good|x)
Actual Actual Pe
1
1 0 1 0
TPR
Predicted

1 0 0 0

1 2 2 2
FPR 1 Bad loans x Good Loans
Summary

• Understand meaning and uses of different

performance metrics:
– Accuracy, confusion matrix, expected value and
ROC curve
Summary: You Should Know

• What is the process for classification using a Decision

Tree?
• Sketch an algorithm to learn a Decision Tree
• What is the process for classification using Naïve
Bayes
• Sketch the algorithm to learn Naïve Bayes
• Pros and cons of Decision Tress versus Naïve Bayes
• Understand meaning and uses of different
performance metrics:
– Accuracy, confusion matrix, expected value and ROC
curve

DWM Exp 4
No ratings yet
DWM Exp 4
7 pages
Tcs
No ratings yet
Tcs
1 page
Lecture 27 - Poisson Regression: I TH 1 N I I I TH I TH
No ratings yet
Lecture 27 - Poisson Regression: I TH 1 N I I I TH I TH
4 pages
IML Module 3.pptx
No ratings yet
IML Module 3.pptx
95 pages
2
No ratings yet
2
28 pages
Naive Bayes
No ratings yet
Naive Bayes
6 pages
NB classifier & Bayesian Network 2
No ratings yet
NB classifier & Bayesian Network 2
37 pages
Bayes' Theorem Explained
No ratings yet
Bayes' Theorem Explained
18 pages
ArkajyotiSaha
No ratings yet
ArkajyotiSaha
6 pages
(Machine Learning) BAYES’ THEOREM AND CONCEPT LEARNING
No ratings yet
(Machine Learning) BAYES’ THEOREM AND CONCEPT LEARNING
22 pages
BSc _ MSc Project Suggestions
No ratings yet
BSc _ MSc Project Suggestions
4 pages
ML & Statistics Unit 6
No ratings yet
ML & Statistics Unit 6
36 pages
Lesson 6.0 Supervised Learning with Naive Bayes Classifiers (1)
No ratings yet
Lesson 6.0 Supervised Learning with Naive Bayes Classifiers (1)
13 pages
Lecture 6_Generative Models
No ratings yet
Lecture 6_Generative Models
33 pages
07-Clustering-2024
No ratings yet
07-Clustering-2024
51 pages
Lecture13 Nbayes
No ratings yet
Lecture13 Nbayes
56 pages
Lecture 06 Bayesian Networks 07112022 011127pm
No ratings yet
Lecture 06 Bayesian Networks 07112022 011127pm
33 pages
8 Classification
No ratings yet
8 Classification
45 pages
ML pp8_u2
No ratings yet
ML pp8_u2
35 pages
Advanced R Programming For Data Analytics in Business - Unit 15 - Week 12
No ratings yet
Advanced R Programming For Data Analytics in Business - Unit 15 - Week 12
4 pages
Naive Bayes
No ratings yet
Naive Bayes
36 pages
Bayesian Classifier and ML Estimation: 6.1 Conditional Probability
100% (3)
Bayesian Classifier and ML Estimation: 6.1 Conditional Probability
11 pages
Age Estimation from Facial Images_ Interview Q&A R
No ratings yet
Age Estimation from Facial Images_ Interview Q&A R
4 pages
Ba Yes Naive
No ratings yet
Ba Yes Naive
15 pages
ML Lecture#5
No ratings yet
ML Lecture#5
65 pages
Module - 4 - ECE3047 - Machine Learning
No ratings yet
Module - 4 - ECE3047 - Machine Learning
81 pages
Unit 3 PPT
No ratings yet
Unit 3 PPT
20 pages
Bayes Algorithm
No ratings yet
Bayes Algorithm
26 pages
slide07-bayes
No ratings yet
slide07-bayes
51 pages
Class Adv Classification IV
No ratings yet
Class Adv Classification IV
49 pages
Bayes Theorem
No ratings yet
Bayes Theorem
7 pages
ML-09-naive-bayes-classifier
No ratings yet
ML-09-naive-bayes-classifier
24 pages
Nayes Bayes Classifier
No ratings yet
Nayes Bayes Classifier
46 pages
Bayes Classifier
No ratings yet
Bayes Classifier
35 pages
RE
No ratings yet
RE
22 pages
Bayes Classifier
No ratings yet
Bayes Classifier
20 pages
Naive Bayes
No ratings yet
Naive Bayes
29 pages
김정윤
No ratings yet
김정윤
6 pages
Naive Bayes Algorithm
No ratings yet
Naive Bayes Algorithm
46 pages
Probabilistic Models in Machine Learning: Unit - III Chapter - 1
No ratings yet
Probabilistic Models in Machine Learning: Unit - III Chapter - 1
18 pages
Data Mining - Module 7
No ratings yet
Data Mining - Module 7
8 pages
Naive Bayes Classifier in Machine Learning - Javatpoint
No ratings yet
Naive Bayes Classifier in Machine Learning - Javatpoint
19 pages
Lecture 7
No ratings yet
Lecture 7
15 pages
Naive Bayes
No ratings yet
Naive Bayes
9 pages
Chapter 2 Econometrics Simple Linear Regression Analysis
No ratings yet
Chapter 2 Econometrics Simple Linear Regression Analysis
42 pages
Module05 - Bayesian Reasoning
No ratings yet
Module05 - Bayesian Reasoning
37 pages
Machine Learning and Data Mining: Prof. Alexander Ihler
No ratings yet
Machine Learning and Data Mining: Prof. Alexander Ihler
51 pages
Thesis Data Analysis - Sample
100% (1)
Thesis Data Analysis - Sample
3 pages
Experiment Number: 3: Aim:-Study of The Linear Regression in The Machine Learning Using The Boston Housing Dataset. 1)
No ratings yet
Experiment Number: 3: Aim:-Study of The Linear Regression in The Machine Learning Using The Boston Housing Dataset. 1)
16 pages
Wk08
No ratings yet
Wk08
10 pages
Bayesian Learning: Based On "Machine Learning", T. Mitchell, Mcgraw Hill, 1997, Ch. 6
No ratings yet
Bayesian Learning: Based On "Machine Learning", T. Mitchell, Mcgraw Hill, 1997, Ch. 6
54 pages
Lecture 5 Bayesian Classification
No ratings yet
Lecture 5 Bayesian Classification
16 pages
Data Mining Classification: Naïve Bayes Classifier Lecture Notes For Chapter 4 &5
No ratings yet
Data Mining Classification: Naïve Bayes Classifier Lecture Notes For Chapter 4 &5
26 pages
Practical-3 Ritesh
No ratings yet
Practical-3 Ritesh
5 pages
20210913115710D3708 - Session 09-12 Bayes Classifier
No ratings yet
20210913115710D3708 - Session 09-12 Bayes Classifier
30 pages
Trắc nghiệm KTL C11 15
No ratings yet
Trắc nghiệm KTL C11 15
27 pages
Jalali@mshdiua - Ac.ir Jalali - Mshdiau.ac - Ir: Data Mining
No ratings yet
Jalali@mshdiua - Ac.ir Jalali - Mshdiau.ac - Ir: Data Mining
16 pages
Evaluation of Machine Learning Algorithms For The Detection of Fake Bank Currency
No ratings yet
Evaluation of Machine Learning Algorithms For The Detection of Fake Bank Currency
6 pages
Machine Learning and Data Mining: Prof. Alexander Ihler
No ratings yet
Machine Learning and Data Mining: Prof. Alexander Ihler
51 pages
Artificial Intelligence Approach For Modeling House Price Prediction
No ratings yet
Artificial Intelligence Approach For Modeling House Price Prediction
5 pages
Formula Sheet - Regression
No ratings yet
Formula Sheet - Regression
2 pages
Econ 399 Chapter2a
No ratings yet
Econ 399 Chapter2a
40 pages
Naive Bayes Classifier in Machine Learning
No ratings yet
Naive Bayes Classifier in Machine Learning
16 pages
Bayes Theorem
No ratings yet
Bayes Theorem
20 pages
Bayesian Classification: Cse 634 Data Mining - Prof. Anita Wasilewska
No ratings yet
Bayesian Classification: Cse 634 Data Mining - Prof. Anita Wasilewska
66 pages
Lecture - 4.1 - Bayes Classifier
No ratings yet
Lecture - 4.1 - Bayes Classifier
31 pages
ML Naive Bayes 1
No ratings yet
ML Naive Bayes 1
19 pages
Implementation of Linear Regression With Python
No ratings yet
Implementation of Linear Regression With Python
5 pages
Naïve Bayesv1
No ratings yet
Naïve Bayesv1
31 pages
Problems On Regression
No ratings yet
Problems On Regression
2 pages
Classification With NaiveBayes
No ratings yet
Classification With NaiveBayes
19 pages
8 - Classification NaiveBayes PDF
No ratings yet
8 - Classification NaiveBayes PDF
13 pages
ARM Founded in November 1990: Advanced RISC Machines
No ratings yet
ARM Founded in November 1990: Advanced RISC Machines
27 pages
EViews 9 Users Guide I (001-607) PDF
No ratings yet
EViews 9 Users Guide I (001-607) PDF
607 pages
Pgm5 With Output
No ratings yet
Pgm5 With Output
13 pages
Naïve Bayes Classifier: April 25, 2006
No ratings yet
Naïve Bayes Classifier: April 25, 2006
19 pages
Lavaan: An R Package For Structural Equation Modeling
No ratings yet
Lavaan: An R Package For Structural Equation Modeling
20 pages
Scale: All Variables: Question 2: Reliability and Correlation Analysis (20%)
No ratings yet
Scale: All Variables: Question 2: Reliability and Correlation Analysis (20%)
8 pages
Assignment Problems
No ratings yet
Assignment Problems
12 pages
Assessment: Assigned
No ratings yet
Assessment: Assigned
13 pages
Bayesian Classification: Dr. Navneet Goyal BITS, Pilani
No ratings yet
Bayesian Classification: Dr. Navneet Goyal BITS, Pilani
35 pages
Basic Regression Analysis With Time Series: Chapter 10 - Review
No ratings yet
Basic Regression Analysis With Time Series: Chapter 10 - Review
8 pages
Bayesian Classification: Dr. Navneet Goyal BITS, Pilani
No ratings yet
Bayesian Classification: Dr. Navneet Goyal BITS, Pilani
35 pages
Reliability Test Result
No ratings yet
Reliability Test Result
2 pages
Naive Bayesian Classifier: National Institute of Technology Sikkim
No ratings yet
Naive Bayesian Classifier: National Institute of Technology Sikkim
6 pages
Unit-4 DWDM
No ratings yet
Unit-4 DWDM
10 pages
Curve Fitting
No ratings yet
Curve Fitting
21 pages
Stepwise Reg
No ratings yet
Stepwise Reg
31 pages
Unit 3-Statistical Techniques
No ratings yet
Unit 3-Statistical Techniques
21 pages
EDA Lecture Module 2
100% (1)
EDA Lecture Module 2
42 pages
Chapter 12 Heteroskedasticity PDF
No ratings yet
Chapter 12 Heteroskedasticity PDF
20 pages
Korchia Calcul Rho
No ratings yet
Korchia Calcul Rho
12 pages
DS Lab
No ratings yet
DS Lab
31 pages
Multiple Regression Analysis: I 0 1 I1 K Ik I
100% (1)
Multiple Regression Analysis: I 0 1 I1 K Ik I
30 pages
Department of Mathematics MAT-3003: Complex Variables and Partial Differential Equations Digital Assignment-I Faculty: Dr. R. Nageshwar Rao
No ratings yet
Department of Mathematics MAT-3003: Complex Variables and Partial Differential Equations Digital Assignment-I Faculty: Dr. R. Nageshwar Rao
1 page
DADM - Cheat Sheet: Hypothesis Testing Two Way Anova
No ratings yet
DADM - Cheat Sheet: Hypothesis Testing Two Way Anova
2 pages
Project 5 - Cars
100% (1)
Project 5 - Cars
22 pages
Regression Question Excel Solution Spring 2014-15
No ratings yet
Regression Question Excel Solution Spring 2014-15
5 pages
Econometrics II Handout For Students
No ratings yet
Econometrics II Handout For Students
29 pages
Calculus I Essentials
From Everand
Calculus I Essentials
Editors of REA
1/5 (1)