Outliers, Variances, Probability Distributions (1) (Read-Only)

This document defines and explains outliers, variance, probability distributions, and correlations. It provides definitions for outliers as distant data points from the rest of a dataset. Variance measures how widely data points vary from the expected value. Probability distributions show the distribution of probability values as a function of variables. Correlation analyzes the strength and direction of relationships between two variables on a 0-100 scale.

Uploaded by

Anagha M

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

120 views8 pages

Outliers, Variances, Probability Distributions (1) (Read-Only)

Uploaded by

Anagha M

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

You are on page 1/ 8

OUTLIERS, VARIANCES,

PROBABILITY DISTRIBUTIONS,
AND CORRELATIONS
Amisha Sarika Gowda (1GA19CS011)
Amulya V D (1GA19CS013)
Anagha M (1GA19CS014)
Anagha S (1GA19CS015)
OUTLIERS
 Outliers are data points that are numerically far distant
from the rest of the points in a dataset.
 There are several reasons for the presence of outliers in
relationships. Some of these are:
 Anomalous situation
 Presence of a previously unknown fact
 Human error
 Sampling error
VARIANCE
 Variance measures by the sum of squares of the
difference in values of a variable with respect to the
expected value.
 Variance indicates how widely data points in a dataset
vary.
 A high variance indicates that the data in the dataset is
very much spread out over a large area (random dataset),
whereas a low variance indicates that the data is very
similar in nature.
PROBABILISTIC DISTRIBUTION
 Probability distribution is the distribution of P values as
a function of all possible independent values, variables,
situations, distances or variables.
 The standard normal distribution formula is:
Normal distribution
 It relates to Gaussian function. Figure shows distribution
around , standard deviation and variance
 The figure also shows the percentages of areas in five
regions with respect to the total area under the curve for
P(x).
 The variance for probability distribution represents how
individual data points relate to each other within a
dataset.
 The variance is the average of the squared differences
between each data value and the mean.
CORRELATION
 Correlation means analysis which lets us find the
association or the absence of the relationship between
two variables, x and y.
 Correlation gives the strength of the relationship
between the model and the dependent variable on a
convenient 0-100% scale.
 Correlation is a statistical technique that measures and
describes the 'strength' and 'direction’ of the relationship
between two variables.
CORRELATION
 The correlation r between the two variables x and y is:

 where n is the number of observations in the sample, xi

is the x value for observation i, x dash is the sample
mean of x, yi is the y value for observation i, y dash is
the sample mean of y, sx is the sample standard deviation
of x, and sy is the sample standard deviation of y.
THANK YOU

5th Sem BCS515B - AI - Module3
No ratings yet
5th Sem BCS515B - AI - Module3
113 pages
Message Authentication
No ratings yet
Message Authentication
47 pages
Graph Mining Techniques Overview
No ratings yet
Graph Mining Techniques Overview
23 pages
Lab 1: Preprocessing Using Python
No ratings yet
Lab 1: Preprocessing Using Python
5 pages
Cryptography & Network Security Course
No ratings yet
Cryptography & Network Security Course
5 pages
Unit 1 Introduction To Datascience
No ratings yet
Unit 1 Introduction To Datascience
14 pages
Pressman CH 3 Prescriptive Process Models
No ratings yet
Pressman CH 3 Prescriptive Process Models
36 pages
Avanthi'S Research &technological Academy: Data Mining Lab
No ratings yet
Avanthi'S Research &technological Academy: Data Mining Lab
50 pages
Unit I - Sensor Classification, Characteristics and Signal Types
No ratings yet
Unit I - Sensor Classification, Characteristics and Signal Types
51 pages
OOAD Lab Manual: UML & Diagrams
0% (1)
OOAD Lab Manual: UML & Diagrams
199 pages
Exploratory Data Analysis in Data Science
No ratings yet
Exploratory Data Analysis in Data Science
31 pages
DVT Paper
No ratings yet
DVT Paper
1 page
MCA Mathematical Foundation For Computer Application 05
No ratings yet
MCA Mathematical Foundation For Computer Application 05
25 pages
Data Science Techniques Overview
No ratings yet
Data Science Techniques Overview
5 pages
2 Binning Techniques in Data Mining With Examples
No ratings yet
2 Binning Techniques in Data Mining With Examples
10 pages
Unit Ii
No ratings yet
Unit Ii
61 pages
Product Metrics
100% (1)
Product Metrics
15 pages
Web Analytics
No ratings yet
Web Analytics
6 pages
Product Metrics For Software
No ratings yet
Product Metrics For Software
29 pages
4.gilb's Approach
100% (1)
4.gilb's Approach
22 pages
Python Unit-1
No ratings yet
Python Unit-1
73 pages
Optimization Technique Course Objective
No ratings yet
Optimization Technique Course Objective
1 page
CN Unit-5
No ratings yet
CN Unit-5
72 pages
Python Programming with Django Framework
No ratings yet
Python Programming with Django Framework
2 pages
Iot Unit 3
No ratings yet
Iot Unit 3
4 pages
Halstead's Operators and Operands Guide
100% (6)
Halstead's Operators and Operands Guide
5 pages
Bottom-Up Parsing Techniques Explained
No ratings yet
Bottom-Up Parsing Techniques Explained
31 pages
2020 CS300 Lecture01 IntroductionToAI
100% (1)
2020 CS300 Lecture01 IntroductionToAI
46 pages
Module 3
No ratings yet
Module 3
43 pages
Python Programming: Unit 5
No ratings yet
Python Programming: Unit 5
19 pages
Ids Unit 5 Final
No ratings yet
Ids Unit 5 Final
25 pages
Social Network Analysis Notes
No ratings yet
Social Network Analysis Notes
8 pages
Branching and Looping in C Programming
No ratings yet
Branching and Looping in C Programming
33 pages
Software Engineering
No ratings yet
Software Engineering
3 pages
Lab Assignment1 Mongodb
100% (1)
Lab Assignment1 Mongodb
2 pages
FSD Notes
No ratings yet
FSD Notes
47 pages
Data Discretization Techniques
No ratings yet
Data Discretization Techniques
21 pages
Knowledge Representation Issue
No ratings yet
Knowledge Representation Issue
18 pages
Operator
No ratings yet
Operator
29 pages
Unit - 5 Multivariate Analysis
No ratings yet
Unit - 5 Multivariate Analysis
29 pages
Module 1
No ratings yet
Module 1
96 pages
Business Analytics Local Author Book 1
No ratings yet
Business Analytics Local Author Book 1
233 pages
NPTEL Domain Certification Overview
No ratings yet
NPTEL Domain Certification Overview
1 page
Ad3411 - Student
No ratings yet
Ad3411 - Student
27 pages
3 Unit - Dspu
No ratings yet
3 Unit - Dspu
23 pages
1 Need, Features & Advantages of C
No ratings yet
1 Need, Features & Advantages of C
17 pages
Ocs353 DSF Unit III Notes
No ratings yet
Ocs353 DSF Unit III Notes
11 pages
DSBDAL - Assignment No 9
No ratings yet
DSBDAL - Assignment No 9
12 pages
Shell Script To Find Largest and Smallest of Given 3 Numbers
No ratings yet
Shell Script To Find Largest and Smallest of Given 3 Numbers
8 pages
Daa Ktu Notes
No ratings yet
Daa Ktu Notes
112 pages
Soft Computing
No ratings yet
Soft Computing
13 pages
Department of Computer Science: Pachamuthu College of Arts and Science For Women Dharmapuri
No ratings yet
Department of Computer Science: Pachamuthu College of Arts and Science For Women Dharmapuri
128 pages
SPM Lecture Notes 2023 (R20 III-I)
No ratings yet
SPM Lecture Notes 2023 (R20 III-I)
76 pages
PHP PDF Generation with Graphics Concepts
No ratings yet
PHP PDF Generation with Graphics Concepts
6 pages
Evolution of Big Data
No ratings yet
Evolution of Big Data
21 pages
HMAC and CMAC: Cryptographic Overview
No ratings yet
HMAC and CMAC: Cryptographic Overview
14 pages
Data Structures: Hashing & Search
No ratings yet
Data Structures: Hashing & Search
55 pages
Module5 Bigdata Analytics
No ratings yet
Module5 Bigdata Analytics
110 pages
AIML
No ratings yet
AIML
14 pages
Stastics For Data Science1 (Quiz1 Notes)
No ratings yet
Stastics For Data Science1 (Quiz1 Notes)
2 pages
Car Showroom Final Report
No ratings yet
Car Showroom Final Report
41 pages
Car Driving Schhol Project Report
No ratings yet
Car Driving Schhol Project Report
48 pages
Keylogger
No ratings yet
Keylogger
6 pages
Advanced Keylogger For Ethical Hacking
No ratings yet
Advanced Keylogger For Ethical Hacking
6 pages