Lecture 2 - Statistical Inference - EDA and DS Process - 02032023 111156am 1 - 1 27022024 012412pm
Lecture 2 - Statistical Inference - EDA and DS Process - 02032023 111156am 1 - 1 27022024 012412pm
What is data ?
Data uncertainty and randomness
Understanding of Descriptive and Statistical Inference
Populations and samples
Fitting a model
Data Science Process
What is exploratory data analysis?(EDA)
Python and its basic fuctions
Numpy and Pandas
Data represents the traces of the real-world processes.
OR
A datum is an abstraction of real world entity.
The terms variable, feature, and attribute are often used
interchangeably to denote an individual abstraction.
Each entity is typically described by a number of attributes
A data set consists of the data relating to a collection of
entities, with each entity described in terms of a set of
attributes
Randomness and uncertainty are inherent aspects of data
analysis
Effective data analysis involves understanding and
appropriately accounting for these factors to make informed
decisions and draw reliable conclusions.
A mathematical model for uncertainty and randomness is
offered by probability theory.
Uncertainty refers to the lack of complete Knowledge about
the outcome of a measurement or observation.
It arises from various sources, including measurement error,
variability in the data, and incomplete information
https://www.linkedin.com/pulse/model-fitting-i4data/
“Exploratory data analysis” is an attitude, a state of
flexibility, a willingness to look for those things that we
believe are not there, as well as those we believe to be
there. — John Tukey
import numpy as np
std_dev = np.sqrt(variance)
variance = np.var(data)
Takeinput string name from the user and display the names
having more than 5 letters