0% found this document useful (0 votes)

16 views

Statistics Introduction

The document discusses the mtcars data set in R, which contains information on 32 cars. It describes that the data set has 32 observations and 11 variables, and provides definitions of the variables. Examples are given to find the dimensions, variable names, row names, and print, sort, and analyze values of variables.

Uploaded by

Rukmaninambi

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

16 views

Statistics Introduction

Uploaded by

Rukmaninambi

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

You are on page 1/ 8

Statistics Introduction

Statistics is the science of analyzing, reviewing and conclude data.

Some basic statistical numbers include:

 Mean, median and mode

 Minimum and maximum value
 Percentiles
 Variance and Standard Deviation
 Covariance and Correlation
 Probability distributions

The R language was developed by two statisticians. It has many built-in

functionalities, in addition to libraries for the exact purpose of statistical analysis.

R Data Set
Data Set
A data set is a collection of data, often presented in a table.

There is a popular built-in data set in R called "mtcars" (Motor Trend Car Road
Tests), which is retrieved from the 1974 Motor Trend US Magazine.

In the examples below (and for the next chapters), we will use the mtcars data set,
for statistical purposes:

Example
# Print the mtcars data set
mtcars

Result:

mpg cyl disp hp drat wt qsec vs am gear

carb
Mazda RX4 21.0 6 160.0 110 3.90 2.620 16.46 0 1 4
4
Mazda RX4 Wag 21.0 6 160.0 110 3.90 2.875 17.02 0 1 4
4
Datsun 710 22.8 4 108.0 93 3.85 2.320 18.61 1 1 4
1
Hornet 4 Drive 21.4 6 258.0 110 3.08 3.215 19.44 1 0 3
1
Hornet Sportabout 18.7 8 360.0 175 3.15 3.440 17.02 0 0 3
2
Valiant 18.1 6 225.0 105 2.76 3.460 20.22 1 0 3
1
Duster 360 14.3 8 360.0 245 3.21 3.570 15.84 0 0 3
4
Merc 240D 24.4 4 146.7 62 3.69 3.190 20.00 1 0 4
2
Merc 230 22.8 4 140.8 95 3.92 3.150 22.90 1 0 4
2
Merc 280 19.2 6 167.6 123 3.92 3.440 18.30 1 0 4
4
Merc 280C 17.8 6 167.6 123 3.92 3.440 18.90 1 0 4
4
Merc 450SE 16.4 8 275.8 180 3.07 4.070 17.40 0 0 3
3
Merc 450SL 17.3 8 275.8 180 3.07 3.730 17.60 0 0 3
3
Merc 450SLC 15.2 8 275.8 180 3.07 3.780 18.00 0 0 3
3
Cadillac Fleetwood 10.4 8 472.0 205 2.93 5.250 17.98 0 0 3
4
Lincoln Continental 10.4 8 460.0 215 3.00 5.424 17.82 0 0 3
4
Chrysler Imperial 14.7 8 440.0 230 3.23 5.345 17.42 0 0 3
4
Fiat 128 32.4 4 78.7 66 4.08 2.200 19.47 1 1 4
1
Honda Civic 30.4 4 75.7 52 4.93 1.615 18.52 1 1 4
2
Toyota Corolla 33.9 4 71.1 65 4.22 1.835 19.90 1 1 4
1
Toyota Corona 21.5 4 120.1 97 3.70 2.465 20.01 1 0 3
1
Dodge Challenger 15.5 8 318.0 150 2.76 3.520 16.87 0 0 3
2
AMC Javelin 15.2 8 304.0 150 3.15 3.435 17.30 0 0 3
2
Camaro Z28 13.3 8 350.0 245 3.73 3.840 15.41 0 0 3
4
Pontiac Firebird 19.2 8 400.0 175 3.08 3.845 17.05 0 0 3
2
Fiat X1-9 27.3 4 79.0 66 4.08 1.935 18.90 1 1 4
1
Porsche 914-2 26.0 4 120.3 91 4.43 2.140 16.70 0 1 5
2
Lotus Europa 30.4 4 95.1 113 3.77 1.513 16.90 1 1 5
2
Ford Pantera L 15.8 8 351.0 264 4.22 3.170 14.50 0 1 5
4
Ferrari Dino 19.7 6 145.0 175 3.62 2.770 15.50 0 1 5
6
Maserati Bora 15.0 8 301.0 335 3.54 3.570 14.60 0 1 5
8
Volvo 142E 21.4 4 121.0 109 4.11 2.780 18.60 1 1 4
2
Try it Yourself »
Information About the Data Set
You can use the question mark (?) to get information about the mtcars data set:

Example
# Use the question mark to get information about the data set

?mtcars

Result:

mtcars {datasets} R Documentation

Motor Trend Car Road Tests

Description
The data was extracted from the 1974 Motor Trend US magazine, and comprises
fuel consumption and 10 aspects of automobile design and performance for 32
automobiles (1973-74 models).

Usage
mtcars

Format
A data frame with 32 observations on 11 (numeric) variables.

[, 1] mpg Miles/(US) gallon

[, 2] cyl Number of cylinders

[, 3] disp Displacement (cu.in.)

[, 4] hp Gross horsepower

[, 5] drat Rear axle ratio

[, 6] wt Weight (1000 lbs)

[, 7] qsec 1/4 mile time

[, 8] vs Engine (0 = V-shaped, 1 = straight)

[, 9] am Transmission (0 = automatic, 1 = manual)

[,10] gear Number of forward gears

[,11] carb Number of carburetors

Note
Henderson and Velleman (1981) comment in a footnote to Table 1: 'Hocking
[original transcriber]'s noncrucial coding of the Mazda's rotary engine as a straight
six-cylinder engine and the Porsche's flat engine as a V engine, as well as the
inclusion of the diesel Mercedes 240D, have been retained to enable direct
comparisons to be made with previous analyses.'

Source
Henderson and Velleman (1981), Building multiple regression models
interactively. Biometrics, 37, 391-411.

Examples
require(graphics)
pairs(mtcars, main = "mtcars data", gap = 1/4)
coplot(mpg ~ disp | as.factor(cyl), data = mtcars,
panel = panel.smooth, rows = 1)
## possibly more meaningful, e.g., for summary() or bivariate plots:
mtcars2 <- within(mtcars, {
vs <- factor(vs, labels = c("V", "S"))
am <- factor(am, labels = c("automatic", "manual"))
cyl <- ordered(cyl)
gear <- ordered(gear)
carb <- ordered(carb)
})
summary(mtcars2)
Try it Yourself »

Get Information
Use the dim() function to find the dimensions of the data set, and
the names() function to view the names of the variables:

Example
Data_Cars <- mtcars # create a variable of the mtcars data set for better
organization

# Use dim() to find the dimension of the data set

dim(Data_Cars)

# Use names() to find the names of the variables from the data set
names(Data_Cars)

Result:

[1] 32 11
[1] "mpg" "cyl" "disp" "hp" "drat" "wt" "qsec" "vs" "am"
"gear"
[11] "carb"
Try it Yourself »

Use the rownames() function to get the name of each row in the first column, which is
the name of each car:

Example
Data_Cars <- mtcars

rownames(Data_Cars)

Result:

[1] "Mazda RX4" "Mazda RX4 Wag" "Datsun 710"

[4] "Hornet 4 Drive" "Hornet Sportabout" "Valiant"
[7] "Duster 360" "Merc 240D" "Merc 230"
[10] "Merc 280" "Merc 280C" "Merc 450SE"
[13] "Merc 450SL" "Merc 450SLC" "Cadillac Fleetwood"
[16] "Lincoln Continental" "Chrysler Imperial" "Fiat 128"
[19] "Honda Civic" "Toyota Corolla" "Toyota Corona"
[22] "Dodge Challenger" "AMC Javelin" "Camaro Z28"
[25] "Pontiac Firebird" "Fiat X1-9" "Porsche 914-2"
[28] "Lotus Europa" "Ford Pantera L" "Ferrari Dino"
[31] "Maserati Bora" "Volvo 142E"
Try it Yourself »

From the examples above, we have found out that the data set
has 32 observations (Mazda RX4, Mazda RX4 Wag, Datsun 710, etc)
and 11 variables (mpg, cyl, disp, etc).

A variable is defined as something that can be measured or counted.

Here is a brief explanation of the variables from the mtcars data set:

Variable Name Description

mpg Miles/(US) Gallon

cyl Number of cylinders

disp Displacement

hp Gross horsepower

drat Rear axle ratio

wt Weight (1000 lbs)

qsec 1/4 mile time

vs Engine (0 = V-shaped, 1 = straight)

am Transmission (0 = automatic, 1 = manual)

gear Number of forward gears

carb Number of carburetors

Print Variable Values

If you want to print all values that belong to a variable, access the data frame by
using the $ sign, and the name of the variable (for example cyl (cylinders)):
Example
Data_Cars <- mtcars

Data_Cars$cyl

Result:

[1] 6 6 4 6 8 6 8 4 4 6 6 8 8 8 8 8 8 4 4 4 4 8 8 8 8 4 4 4 8 6 8 4
Try it Yourself »

Sort Variable Values

To sort the values, use the sort() function:

Example
Data_Cars <- mtcars

sort(Data_Cars$cyl)

Result:

[1] 4 4 4 4 4 4 4 4 4 4 4 6 6 6 6 6 6 6 8 8 8 8 8 8 8 8 8 8 8 8 8 8
Try it Yourself »

From the examples above, we see that most cars have 4 and 8 cylinders.

Analyzing the Data

Now that we have some information about the data set, we can start to analyze it
with some statistical numbers.

For example, we can use the summary() function to get a statistical summary of the
data:

Example
Data_Cars <- mtcars

summary(Data_Cars)

The summary() function returns six statistical numbers for each variable:
 Min
 First quantile (percentile)
 Median
 Mean
 Third quantile (percentile)
 Max

OutPut:

Information About the Data Set

You can use the question mark (?) to get information about the mtcars data set:

Result:

Thesis Last Na Jud Print
100% (3)
Thesis Last Na Jud Print
150 pages
Motor Trend Car Road Tests PDF
No ratings yet
Motor Trend Car Road Tests PDF
1 page
Chrysler Slant Six Engines: How to Rebuild and Modify
From Everand
Chrysler Slant Six Engines: How to Rebuild and Modify
Doug Dutra
No ratings yet
GM 6L80 Transmissions: How to Rebuild & Modify
From Everand
GM 6L80 Transmissions: How to Rebuild & Modify
Steve Garrett
5/5 (1)
Porters Five Forces Model Automobile Industry
100% (2)
Porters Five Forces Model Automobile Industry
16 pages
Mtcars Dataset Analysis in R
No ratings yet
Mtcars Dataset Analysis in R
4 pages
R Studio
No ratings yet
R Studio
5 pages
R Studio
No ratings yet
R Studio
4 pages
Assignment CSE-520
No ratings yet
Assignment CSE-520
29 pages
Practical NO.3
No ratings yet
Practical NO.3
7 pages
Week 2 - Assignment: Step 1: What Is The HP (HP Stands For "Horse Power")
No ratings yet
Week 2 - Assignment: Step 1: What Is The HP (HP Stands For "Horse Power")
3 pages
Data Science Lab
No ratings yet
Data Science Lab
28 pages
Assignment 2 output 229010
No ratings yet
Assignment 2 output 229010
17 pages
Assignment
No ratings yet
Assignment
49 pages
An Overview of Statistical Tests in SAS: 1. Introduction and Description of Data
No ratings yet
An Overview of Statistical Tests in SAS: 1. Introduction and Description of Data
8 pages
bda file
No ratings yet
bda file
54 pages
R
No ratings yet
R
3 pages
SMDM Business+Report
No ratings yet
SMDM Business+Report
11 pages
se python_merged (1) (1) (1)
No ratings yet
se python_merged (1) (1) (1)
77 pages
activity 2
No ratings yet
activity 2
16 pages
SMDM Business+Report
No ratings yet
SMDM Business+Report
11 pages
SMDM-Business Report
No ratings yet
SMDM-Business Report
11 pages
SMDM-Business Report
No ratings yet
SMDM-Business Report
11 pages
Autos
No ratings yet
Autos
2 pages
SMDM-Business Report
No ratings yet
SMDM-Business Report
11 pages
R Lab Ex 1 to 5
No ratings yet
R Lab Ex 1 to 5
26 pages
Automobil E Data Analysis: Name Pgp-Dsba Online January' 21 Date: Dd/mm/yyyy
No ratings yet
Automobil E Data Analysis: Name Pgp-Dsba Online January' 21 Date: Dd/mm/yyyy
11 pages
Car Price Prediction Using ML
No ratings yet
Car Price Prediction Using ML
11 pages
Untitled.ipynb_ (5) - JupyterLab
No ratings yet
Untitled.ipynb_ (5) - JupyterLab
4 pages
Lab4
No ratings yet
Lab4
4 pages
Analysis On Car Resale Price
No ratings yet
Analysis On Car Resale Price
13 pages
Research Methods
No ratings yet
Research Methods
12 pages
p1
No ratings yet
p1
4 pages
Impack of Car Features
No ratings yet
Impack of Car Features
19 pages
Aayushi Bda File
No ratings yet
Aayushi Bda File
41 pages
Project Impact of Car Features project-7 (final)(1)
No ratings yet
Project Impact of Car Features project-7 (final)(1)
11 pages
Mtcars Data
No ratings yet
Mtcars Data
2 pages
Project - Analyzing The Impact of Car Features On Price and Profitability
No ratings yet
Project - Analyzing The Impact of Car Features On Price and Profitability
8 pages
Analyzing The Impact of Car Features On Price and Profitability
No ratings yet
Analyzing The Impact of Car Features On Price and Profitability
12 pages
Post Test Praktikum8
No ratings yet
Post Test Praktikum8
14 pages
R Program
No ratings yet
R Program
2 pages
Exploratory Data Analysis
No ratings yet
Exploratory Data Analysis
22 pages
American New Cars of 1993
No ratings yet
American New Cars of 1993
15 pages
Miles Per Gallon
No ratings yet
Miles Per Gallon
11 pages
Team AN
No ratings yet
Team AN
23 pages
Secondhand Car Price Analysis
No ratings yet
Secondhand Car Price Analysis
12 pages
Course2 - DataAnalysis With Python - Week3 - Exploratory Data Analysis
No ratings yet
Course2 - DataAnalysis With Python - Week3 - Exploratory Data Analysis
23 pages
R Version 3
No ratings yet
R Version 3
8 pages
Project - Analyzing The Impact of Car Features On Price and Profitability
No ratings yet
Project - Analyzing The Impact of Car Features On Price and Profitability
8 pages
Economics 400 Computer Exercise
No ratings yet
Economics 400 Computer Exercise
7 pages
Project Impact of Car Features
No ratings yet
Project Impact of Car Features
9 pages
Car Trend Analysis
No ratings yet
Car Trend Analysis
12 pages
Car Price
No ratings yet
Car Price
22 pages
Report Analysis Super Cars
100% (1)
Report Analysis Super Cars
15 pages
Statistics
No ratings yet
Statistics
10 pages
Data Set For Excersize
No ratings yet
Data Set For Excersize
20 pages
Statistics Cia 1
No ratings yet
Statistics Cia 1
26 pages
Topic
No ratings yet
Topic
9 pages
Lab1: Introduction To R: Islr2
No ratings yet
Lab1: Introduction To R: Islr2
10 pages
Motor Trend Car Road Tests
No ratings yet
Motor Trend Car Road Tests
5 pages
Analyzing The Impact of Car Features On Price and Profitability
No ratings yet
Analyzing The Impact of Car Features On Price and Profitability
8 pages
The Slot Car Handbook: The definitive guide to setting-up and running Scalextric sytle 1/32 scale ready-to-race slot cars
From Everand
The Slot Car Handbook: The definitive guide to setting-up and running Scalextric sytle 1/32 scale ready-to-race slot cars
Dave Chang
3/5 (1)
Loan Default Prediction Using Decision Trees and R
No ratings yet
Loan Default Prediction Using Decision Trees and R
13 pages
International Exchange Moa 2017
No ratings yet
International Exchange Moa 2017
8 pages
Dr. V Vijayalakshmi, Article For Thailand Conference
No ratings yet
Dr. V Vijayalakshmi, Article For Thailand Conference
8 pages
PH.D Research & Publication Ethics
No ratings yet
PH.D Research & Publication Ethics
1 page
Management Theory and Practice
No ratings yet
Management Theory and Practice
12 pages
Case Study
No ratings yet
Case Study
5 pages
Cambridge International AS & A Level: Economics 9708/41
No ratings yet
Cambridge International AS & A Level: Economics 9708/41
6 pages
Aqa 71322 QP Jun17
No ratings yet
Aqa 71322 QP Jun17
24 pages
Evolution of The Marketing Concept and Philosophy 18 7 2019
No ratings yet
Evolution of The Marketing Concept and Philosophy 18 7 2019
6 pages
Lightyear One Car Brochure Online Version
No ratings yet
Lightyear One Car Brochure Online Version
17 pages
REMF-1 Fist Clutch For Physically Challenged Humans
No ratings yet
REMF-1 Fist Clutch For Physically Challenged Humans
3 pages
Bajaj Auto Project
No ratings yet
Bajaj Auto Project
62 pages
Technology Management (Lesson 1)
No ratings yet
Technology Management (Lesson 1)
10 pages
Toyota
0% (2)
Toyota
2 pages
EV Monthly December 2023
No ratings yet
EV Monthly December 2023
50 pages
Import Quotas On Japanese Cars in USA (1980s)
No ratings yet
Import Quotas On Japanese Cars in USA (1980s)
1 page
List of Products Display 2023
No ratings yet
List of Products Display 2023
12 pages
Time Line Evolution of Car
No ratings yet
Time Line Evolution of Car
5 pages
Varna-Suraksha-Auto-Model
No ratings yet
Varna-Suraksha-Auto-Model
2 pages
Sohn Investment Conference Presentation, May 9, 2023
No ratings yet
Sohn Investment Conference Presentation, May 9, 2023
47 pages
Peugeot 408 Press Info English
100% (1)
Peugeot 408 Press Info English
19 pages
Vehicle Technology Ebook
No ratings yet
Vehicle Technology Ebook
528 pages
Expertshub - Automotive Styling Boot Camp
No ratings yet
Expertshub - Automotive Styling Boot Camp
5 pages
About Us - Reliance General Insurance: Vision
No ratings yet
About Us - Reliance General Insurance: Vision
21 pages
European-Vietnam Free Trade Agreement Evfta Impact
No ratings yet
European-Vietnam Free Trade Agreement Evfta Impact
13 pages
PD 595 South Dallas/Fair Park Special Purpose District
No ratings yet
PD 595 South Dallas/Fair Park Special Purpose District
25 pages
Davood Shiran Case Study1 Origi
No ratings yet
Davood Shiran Case Study1 Origi
3 pages
Automobile slide class lecture
No ratings yet
Automobile slide class lecture
36 pages
Presentation For Study On The Integration of Supply Chain Management in Automobile Business"
100% (1)
Presentation For Study On The Integration of Supply Chain Management in Automobile Business"
19 pages
Mahindra Mahindra Project Report
No ratings yet
Mahindra Mahindra Project Report
74 pages
This Website: Figure 1: Example of Certificate of Conformity
No ratings yet
This Website: Figure 1: Example of Certificate of Conformity
3 pages
JAC Light Truck HFC1045
No ratings yet
JAC Light Truck HFC1045
96 pages

Statistics Introduction

Uploaded by

Statistics Introduction

Uploaded by

Statistics Introduction

Statistics is the science of analyzing, reviewing and conclude data.

Some basic statistical numbers include:

 Mean, median and mode

The R language was developed by two statisticians. It has many built-in

mpg cyl disp hp drat wt qsec vs am gear

mtcars {datasets} R Documentation

Motor Trend Car Road Tests

[, 1] mpg Miles/(US) gallon

[, 2] cyl Number of cylinders

[, 3] disp Displacement (cu.in.)

[, 5] drat Rear axle ratio

[, 6] wt Weight (1000 lbs)

[, 7] qsec 1/4 mile time

[, 8] vs Engine (0 = V-shaped, 1 = straight)

[, 9] am Transmission (0 = automatic, 1 = manual)

[,10] gear Number of forward gears

# Use dim() to find the dimension of the data set

[1] "Mazda RX4" "Mazda RX4 Wag" "Datsun 710"

A variable is defined as something that can be measured or counted.

Variable Name Description

mpg Miles/(US) Gallon

drat Rear axle ratio

wt Weight (1000 lbs)

qsec 1/4 mile time

vs Engine (0 = V-shaped, 1 = straight)

am Transmission (0 = automatic, 1 = manual)

gear Number of forward gears

carb Number of carburetors

Print Variable Values

Sort Variable Values

Analyzing the Data

Information About the Data Set

You might also like