Course Notes - Basic Probability

This document provides an introduction to key concepts in probability that are important for statistics and data science. It defines probability as the likelihood of an event occurring, and explains how to calculate it using the probability formula. It also discusses expected values, frequency distributions, complements, and how understanding these probability concepts is crucial for mastering machine learning and data analysis.

Uploaded by

Anshul

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

191 views

Course Notes - Basic Probability

Uploaded by

Anshul

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 6

PROBABILITY FOR STATISTICS AND DATA

SCIENCE
Introduction to Probability: Cheat Sheet
Probability Formula | Sample Space | Expected Values | Complements
Words of welcome

ML
Dataset
Insight

You are here because you want to comprehend the basics of probability before you can dive into the world of statistics
and machine learning. Understanding the driving forces behind key statistical features is crucial to reaching your goal of
mastering data science. This way you will be able to extract important insight when analysing data through supervised
machine learning methods like regressions, but also fathom the outputs unsupervised or assisted ML give you.

Bayesian Inference is a key component heavily used in many fields of mathematics to succinctly express complicated
statements. Through Bayesian Notation we can convey the relationships between elements, sets and events.
Understanding these new concepts will aid you in interpreting the mathematical intuition behind sophisticated data
analytics methods.

Distributions are the main way we lie to classify sets of data. If a dataset complies with certain characteristics, we can
usually attribute the likelihood of its values to a specific distribution. Since many of these distributions have elegant
relationships between certain outcomes and their probabilities of occurring, knowing key features of our data is
extremely convenient and useful.
What is probability?

Probability is the likelihood of an event occurring. This event can be pretty much anything – getting heads, rolling a 4 or even
bench pressing 225lbs. We measure probability with numeric values between 0 and 1, because we like to compare the relative
likelihood of events. Observe the general probability formula.

𝑃𝑟𝑒𝑓𝑒𝑟𝑟𝑒𝑑 𝑜𝑢𝑡𝑐𝑜𝑚𝑒𝑠
P(X)=
𝑆𝑎𝑚𝑝𝑙𝑒 𝑆𝑝𝑎𝑐𝑒

Probability Formula:
• The Probability of event X occurring equals the number of preferred outcomes over the number of outcomes in the
sample space.
• Preferred outcomes are the outcomes we want to occur or the outcomes we are interested in. We also call refer to such
outcomes as “Favorable”.
• Sample space refers to all possible outcomes that can occur. Its “size” indicates the amount of elements in it.

If two events are independent:

The probability of them occurring simultaneously equals the product of them occurring on their own.
P(A ) = P(A) . P( )
Expected Values
Trial – Observing an event occur and recording the outcome.
Experiment – A collection of one or multiple trials.
Experimental Probability – The probability we assign an event, based on an experiment we conduct.
Expected value – the specific outcome we expect to occur when we run an experiment.

Example: Trial Example: Experiment

Flipping a coin and recording the outcome. Flipping a coin 20 times and recording the 20 individual outcomes.

In this instance, the experimental probability for getting heads would equal the number of heads we record over the course of
the 20 outcomes, over 20 (the total number of trials).

The expected value can be numerical, Boolean, categorical or other, depending on the type of the event we are interested in. For
instance, the expected value of the trial would be the more likely of the two outcomes, whereas the expected value of the experiment
will be the number of time we expect to get either heads or tails after the 20 trials.

Expected value for categorical variables. Expected value for numeric variables.
𝑛

𝐸 𝑋 =𝑛×𝑝 𝐸 𝑋 = ෍ 𝑥𝑖 × 𝑝𝑖
𝑖=1
Probability Frequency Distribution

What is a probability frequency distribution?:

A collection of the probabilities for each possible outcome of an
event.

Why do we need frequency distributions?:

We need the probability frequency distribution to try and predict
future events when the expected value is unattainable.

What is a frequency?:
Frequency is the number of times a given value or outcome
appears in the sample space.

What is a frequency distribution table?:

The frequency distribution table is a table matching each distinct
outcome in the sample space to its associated frequency.

How do we obtain the probability frequency distribution

from the frequency distribution table?:
By dividing every frequency by the size of the sample space.
(Think about the “favoured over all” formula.)
Complements

The complement of an event is everything an event is not. We denote the complement of an event with an apostrophe.

A’ = Not A
complement original event
opposite

Characteristics of complements:
• Can never occur simultaneously.
• Add up to the sample space. (A + A’ = Sample space)
• Their probabilities add up to 1. (P(A) + P(A’) = 1)
• The complement of a complement is the original event. ((A’)’ = A)
Example:
• Assume event A represents drawing a spade, so P(A) = 0.25.
• Then, A’ represents not drawing a spade, so drawing a club, a diamond or a heart. P(A’) = 1 – P(A), so P(A’) = 0.75.

Practice-Exam-for-Design-of-Experiments-DOE
100% (1)
Practice-Exam-for-Design-of-Experiments-DOE
30 pages
Pro100 Shortcuts
No ratings yet
Pro100 Shortcuts
2 pages
MITx 6.86x Notes - MD
No ratings yet
MITx 6.86x Notes - MD
91 pages
Notes For 18.6501x, Fundamentals of Statistics: v0.2 (2019 April 24)
100% (1)
Notes For 18.6501x, Fundamentals of Statistics: v0.2 (2019 April 24)
14 pages
Homework 8 - Solution
No ratings yet
Homework 8 - Solution
8 pages
Flask Cheat Sheet
No ratings yet
Flask Cheat Sheet
3 pages
Formulas in Inferential Statistics
No ratings yet
Formulas in Inferential Statistics
4 pages
Algorithm Analysis Cheat Sheet PDF
0% (1)
Algorithm Analysis Cheat Sheet PDF
2 pages
Science Sample Lesson Plan: Title: Pollution Affects Bodies of Water and Living Organisms Grade: 2 Materials
100% (1)
Science Sample Lesson Plan: Title: Pollution Affects Bodies of Water and Living Organisms Grade: 2 Materials
3 pages
Probability Cheat Sheet
No ratings yet
Probability Cheat Sheet
1 page
R Package Recommendation
No ratings yet
R Package Recommendation
4 pages
Matlab To Numpy PDF
No ratings yet
Matlab To Numpy PDF
14 pages
UNIX Command Cheat Sheets
No ratings yet
UNIX Command Cheat Sheets
11 pages
Categorical Data Analysis
No ratings yet
Categorical Data Analysis
11 pages
Hypothesis Testing: Categorical Data Analysis
No ratings yet
Hypothesis Testing: Categorical Data Analysis
54 pages
A Glossary of Statistics-001A
No ratings yet
A Glossary of Statistics-001A
24 pages
Statistics Probability Midterm Cheat Sheet
0% (1)
Statistics Probability Midterm Cheat Sheet
5 pages
1.2 Matrices Notes Presentation
No ratings yet
1.2 Matrices Notes Presentation
5 pages
R Packages For Machine Learning
No ratings yet
R Packages For Machine Learning
3 pages
Letters.: /double For /fraktur For
No ratings yet
Letters.: /double For /fraktur For
4 pages
Important Statistics Formulas
No ratings yet
Important Statistics Formulas
7 pages
Chapter 2
No ratings yet
Chapter 2
34 pages
Matrices: Definition. A
No ratings yet
Matrices: Definition. A
5 pages
Diagonalization: Definition. A Matrix
No ratings yet
Diagonalization: Definition. A Matrix
5 pages
Markdown Cheatsheet PDF
No ratings yet
Markdown Cheatsheet PDF
2 pages
The Three MS: Analysis Data
No ratings yet
The Three MS: Analysis Data
5 pages
GATE:linear Algebra SAMPLE QUESTIONS
No ratings yet
GATE:linear Algebra SAMPLE QUESTIONS
14 pages
Linear Algebra Final Review
No ratings yet
Linear Algebra Final Review
7 pages
Grey Fox 2014 Schedule
No ratings yet
Grey Fox 2014 Schedule
6 pages
11 Parameter Estimation
No ratings yet
11 Parameter Estimation
6 pages
Tips For Using Word Equation Editor
No ratings yet
Tips For Using Word Equation Editor
3 pages
Introduction and Basic Operations
No ratings yet
Introduction and Basic Operations
26 pages
Estimation of Parameters: Example
No ratings yet
Estimation of Parameters: Example
2 pages
Hypothesis Testing On A Single Population Proportion
No ratings yet
Hypothesis Testing On A Single Population Proportion
1 page
Formula Stables
No ratings yet
Formula Stables
29 pages
MATRICES
No ratings yet
MATRICES
1 page
Laptop Review PDF
No ratings yet
Laptop Review PDF
12 pages
Tutorial Sheet
No ratings yet
Tutorial Sheet
2 pages
Formulae Sheet
No ratings yet
Formulae Sheet
11 pages
Chandan Mandal 11
100% (1)
Chandan Mandal 11
14 pages
340 Printable Course Notes
No ratings yet
340 Printable Course Notes
184 pages
Beta Distribution
No ratings yet
Beta Distribution
8 pages
Matrices Determinants
No ratings yet
Matrices Determinants
3 pages
Matrices
No ratings yet
Matrices
3 pages
If Are Partitions of Probability Space S: AB A B AB A B
No ratings yet
If Are Partitions of Probability Space S: AB A B AB A B
4 pages
50 Keyboard Shortcuts For Google Earth
No ratings yet
50 Keyboard Shortcuts For Google Earth
4 pages
2.4 Transition Matrices
No ratings yet
2.4 Transition Matrices
9 pages
Statistics For Management and Economics, Tenth Edition Formulas
No ratings yet
Statistics For Management and Economics, Tenth Edition Formulas
11 pages
Sampling and Surveying Handbook
100% (1)
Sampling and Surveying Handbook
72 pages
Algorithem Cheat Sheet
No ratings yet
Algorithem Cheat Sheet
25 pages
Categorical Data Analysis With Graphics
No ratings yet
Categorical Data Analysis With Graphics
104 pages
Statistics Packet
No ratings yet
Statistics Packet
17 pages
Energy Eigenvalues and Eigenstates
No ratings yet
Energy Eigenvalues and Eigenstates
4 pages
ST1131 Cheat Sheet Page 1
0% (1)
ST1131 Cheat Sheet Page 1
1 page
(Hadley Wickham) Ggplot Book
No ratings yet
(Hadley Wickham) Ggplot Book
50 pages
Assignment 1 Ver 2.0
No ratings yet
Assignment 1 Ver 2.0
3 pages
Table of Math Symbols
No ratings yet
Table of Math Symbols
16 pages
1 Basic Math Symbols
No ratings yet
1 Basic Math Symbols
13 pages
10.2 Generalized Eigenvectors
No ratings yet
10.2 Generalized Eigenvectors
4 pages
Big-O Algorithm Complexity Cheat Sheet
No ratings yet
Big-O Algorithm Complexity Cheat Sheet
4 pages
Course Notes Basic Probability
No ratings yet
Course Notes Basic Probability
6 pages
Probability - Course Notes
No ratings yet
Probability - Course Notes
58 pages
Research Designs: Dr. Khalid Manzoor Butt
No ratings yet
Research Designs: Dr. Khalid Manzoor Butt
22 pages
Natalie Nava 2f Lab Write-Up Template
No ratings yet
Natalie Nava 2f Lab Write-Up Template
4 pages
Huraian Sukatan Pelajaran Fizik Tingkatan 4
No ratings yet
Huraian Sukatan Pelajaran Fizik Tingkatan 4
57 pages
Capstone Chapters 1 3
No ratings yet
Capstone Chapters 1 3
18 pages
Preliminary Activity For A Local Weather Study: Experiment
No ratings yet
Preliminary Activity For A Local Weather Study: Experiment
8 pages
Motion of A Pendulum Practical: Key Concepts and Background Knowledge
No ratings yet
Motion of A Pendulum Practical: Key Concepts and Background Knowledge
6 pages
Methods of Educational Psychology
50% (2)
Methods of Educational Psychology
4 pages
Form 1 C1 Science Is Part of Daily Life
No ratings yet
Form 1 C1 Science Is Part of Daily Life
1 page
Q4_18_04_10
No ratings yet
Q4_18_04_10
27 pages
Year 8 Biology Exam
No ratings yet
Year 8 Biology Exam
4 pages
Scientific Revolution Module
100% (1)
Scientific Revolution Module
10 pages
Research Methodology Final
100% (2)
Research Methodology Final
70 pages
GCE Applied Art and Design: April 2007
No ratings yet
GCE Applied Art and Design: April 2007
17 pages
Design of Experiment
No ratings yet
Design of Experiment
25 pages
NCM 108 Nuremberg Code, Declaration of Helsinki, Belmont Report
No ratings yet
NCM 108 Nuremberg Code, Declaration of Helsinki, Belmont Report
9 pages
With Answer Review 1
No ratings yet
With Answer Review 1
5 pages
Proposal Research
No ratings yet
Proposal Research
19 pages
MCA1PRA
0% (1)
MCA1PRA
45 pages
Exam 3 Review
No ratings yet
Exam 3 Review
16 pages
Campuchia BiochemLab ThuPM Report03
No ratings yet
Campuchia BiochemLab ThuPM Report03
16 pages
History of FBA
No ratings yet
History of FBA
23 pages
3250-Article Text-6299-1-10-20190531 (1)
No ratings yet
3250-Article Text-6299-1-10-20190531 (1)
10 pages
102 Occupational Therapy For Physical Dysfunction
No ratings yet
102 Occupational Therapy For Physical Dysfunction
61 pages
Diss Lesson 1
No ratings yet
Diss Lesson 1
55 pages
Sains - Integrated Curriculum For Primary School
100% (6)
Sains - Integrated Curriculum For Primary School
17 pages
Research Methods & Design Outline: - Types of Research Design - How To Choose A Research Design - Issues in Research Design
No ratings yet
Research Methods & Design Outline: - Types of Research Design - How To Choose A Research Design - Issues in Research Design
27 pages
I J A N: Indian Journal OF Animal Nutrition
No ratings yet
I J A N: Indian Journal OF Animal Nutrition
100 pages
Human Biology
No ratings yet
Human Biology
677 pages