0% found this document useful (0 votes)

48 views3 pages

R Programming: Descriptive Stats Guide

Uploaded by

RAHUL SHARMA

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

48 views3 pages

R Programming: Descriptive Stats Guide

Uploaded by

RAHUL SHARMA

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

You are on page 1/ 3

Faculty of : FCE Program: B.Tech Class/Section: Sem V, Sec.

A,B,C(AIDS) Date:

Name of Faculty: Seema Kaloria Name of Course: R Programming Code: BADCCE5104

Descriptive Stastics:
Descriptive statistics in R provides a way to summarize and describe the main features of a dataset. These
statistics give insight into the central tendency, dispersion, and shape of a dataset’s distribution, which are
essential for understanding and interpreting data.

R provides several built-in functions and libraries for calculating descriptive statistics, including measures like
mean, median, standard deviation, variance, and more.
1. Basic Descriptive Statistics Functions in R

Here are some common functions for basic descriptive statistics in R:

a) Mean (mean())

The mean is the average of the data.

data <- c(1, 2, 3, 4, 5, 6, 7, 8, 9)

mean(data)
b) Median (median())

The median is the middle value of the dataset.

median(data)
c) Mode

R does not have a built-in function for mode, but it can be calculated as follows:

mode <- function(x) {

uniq <- unique(x)
uniq[which.max(tabulate(match(x, uniq)))]
}
mode(data)
d) Standard Deviation (sd())

The standard deviation measures how spread out the numbers in the dataset are.

sd(data)
e) Variance (var())

Variance is the square of the standard deviation and measures how much the data points deviate from the
mean.

var(data)

Session 2024-25
f) Range (range())

The range returns the minimum and maximum values of the dataset.

range(data)
g) Minimum and Maximum (min() and max())

To find the minimum and maximum values individually:

min(data)
max(data)
h) Quantiles (quantile())

Quantiles help you understand the distribution of the data. You can specify which quantiles to calculate.

quantile(data, probs = c(0.25, 0.5, 0.75)) # 25th, 50th (median), and 75th percentiles
i) Interquartile Range (IQR())

The interquartile range is the difference between the 75th and 25th percentiles and gives the spread of the
middle 50% of the data.

IQR(data)
j) Summary Statistics (summary())

R provides a built-in summary() function that gives a quick overview of the dataset.

summary(data)

This function provides:

 Minimum
 1st Quartile (25%)
 Median (50%)
 Mean
 3rd Quartile (75%)
 Maximum

2. Descriptive Statistics for Data Frames

If you are working with a data frame, you can apply the above functions to each column or use some of R's
packages to obtain more detailed descriptive statistics.

# Example data frame

df <- data.frame(
age = c(23, 45, 31, 35, 28),
weight = c(70, 80, 60, 75, 68),
height = c(165, 170, 158, 172, 160)
)

# Summary of each column in the data frame

summary(df)

3. Using Libraries for More Detailed Descriptive Statistics

Session 2024-25
a) psych package

The psych package provides a variety of functions for descriptive statistics. To install and load it:

install.packages("psych")
library(psych)

# Descriptive statistics for each column in a data frame

describe(df)

This function provides:

 Mean
 Standard deviation
 Median
 Minimum
 Maximum
 Skewness
 Kurtosis

b) Hmisc package

The Hmisc package also provides functions for more detailed statistical summaries.

install.packages("Hmisc")
library(Hmisc)

# Summary statistics
describe(df)
c) summarytools package

For more flexible and comprehensive summaries, the summarytools package is useful.

install.packages("summarytools")
library(summarytools)

# Descriptive statistics for a data frame

dfSummary(df)
4. Visualizing Descriptive Statistics:To complement descriptive statistics, visualizing the data can help
identify patterns or outliers. Here are some commonly used plots:
a) Histogram (hist())

A histogram is useful to visualize the distribution of data.

hist(df$age, main = "Age Distribution", xlab = "Age", col = "lightblue")

b) Boxplot (boxplot())

A boxplot shows the distribution of the data along with the median, quartiles, and potential outliers.

boxplot(df$weight, main = "Boxplot of Weight", ylab = "Weight")

Session 2024-25

Descriptive Analysis in R Programming - GeeksforGeeks-1-12
No ratings yet
Descriptive Analysis in R Programming - GeeksforGeeks-1-12
12 pages
Unit3 R
No ratings yet
Unit3 R
19 pages
Unit3 R
No ratings yet
Unit3 R
30 pages
Lab Manual
No ratings yet
Lab Manual
46 pages
R Programming for BCA Students
No ratings yet
R Programming for BCA Students
40 pages
Weather Derivatives in India
No ratings yet
Weather Derivatives in India
25 pages
India Launches Exchange-Traded Weather Derivatives
No ratings yet
India Launches Exchange-Traded Weather Derivatives
5 pages
R Lang-Unit-01
100% (1)
R Lang-Unit-01
50 pages
R Language
No ratings yet
R Language
59 pages
R Programming Lab
No ratings yet
R Programming Lab
26 pages
R Programming Lab Manual
No ratings yet
R Programming Lab Manual
73 pages
R Data Visualization Techniques
No ratings yet
R Data Visualization Techniques
21 pages
R Programming Unit 1
No ratings yet
R Programming Unit 1
83 pages
R Programming
No ratings yet
R Programming
28 pages
Financial Analytics & Time Series
No ratings yet
Financial Analytics & Time Series
17 pages
R Programming Lab
100% (1)
R Programming Lab
46 pages
Modelling in R
No ratings yet
Modelling in R
47 pages
Lecture Notes
100% (1)
Lecture Notes
82 pages
R Programming
No ratings yet
R Programming
11 pages
Unit 4
No ratings yet
Unit 4
105 pages
BDA Unit 2
No ratings yet
BDA Unit 2
31 pages
R Workshop
No ratings yet
R Workshop
47 pages
R Programming Unit 2
No ratings yet
R Programming Unit 2
46 pages
Statistical Computing and R Programming
No ratings yet
Statistical Computing and R Programming
2 pages
Bca-Iv Sem Dar Imp Questions
100% (1)
Bca-Iv Sem Dar Imp Questions
1 page
Pranav R Programming Lab File
No ratings yet
Pranav R Programming Lab File
41 pages
TDS 2025 Jan GA1 - Development Tools
No ratings yet
TDS 2025 Jan GA1 - Development Tools
28 pages
Assignment I Data Analytics
No ratings yet
Assignment I Data Analytics
3 pages
SAS Presentation
No ratings yet
SAS Presentation
49 pages
Data Analytics Using R
No ratings yet
Data Analytics Using R
37 pages
Unit - 1 Notes R Programming
No ratings yet
Unit - 1 Notes R Programming
52 pages
Final-BCA V and VI Sem Syllabus
No ratings yet
Final-BCA V and VI Sem Syllabus
25 pages
R Programming Lab Manual
No ratings yet
R Programming Lab Manual
35 pages
Unit-1-Ppt Ada
No ratings yet
Unit-1-Ppt Ada
78 pages
R Lnaguager
No ratings yet
R Lnaguager
38 pages
R Programming Course Notes
No ratings yet
R Programming Course Notes
28 pages
KMBN It01 - Unit 4
No ratings yet
KMBN It01 - Unit 4
19 pages
Data Analytics Lab Manual Using R Programming
No ratings yet
Data Analytics Lab Manual Using R Programming
27 pages
An R Tutorial Starting Out
No ratings yet
An R Tutorial Starting Out
9 pages
R Module 2
No ratings yet
R Module 2
30 pages
All Unit R - Programming Notes PDF
No ratings yet
All Unit R - Programming Notes PDF
736 pages
Measure of Central Tendency Practical
No ratings yet
Measure of Central Tendency Practical
7 pages
Stats With R
No ratings yet
Stats With R
103 pages
DSRS BR
No ratings yet
DSRS BR
25 pages
Chp4 Advance Analytics-KMeans
No ratings yet
Chp4 Advance Analytics-KMeans
40 pages
CPL Practical 1
No ratings yet
CPL Practical 1
14 pages
Data Visualization Using Matplotlib in Python
No ratings yet
Data Visualization Using Matplotlib in Python
15 pages
R22-Ids-Question Bank
No ratings yet
R22-Ids-Question Bank
4 pages
Introduction To R Programming
No ratings yet
Introduction To R Programming
14 pages
R Programming 1-5
No ratings yet
R Programming 1-5
13 pages
R Programming in Data Science
No ratings yet
R Programming in Data Science
23 pages
R Vectors and Lists Guide
No ratings yet
R Vectors and Lists Guide
12 pages
1 3 ST-explore
No ratings yet
1 3 ST-explore
55 pages
Unit 4
No ratings yet
Unit 4
35 pages
TEB2043 Introduction To Data Science: Descriptive Analytics & Visualization DR Shuhaida Mohamed Shuhidan JAN 2025
No ratings yet
TEB2043 Introduction To Data Science: Descriptive Analytics & Visualization DR Shuhaida Mohamed Shuhidan JAN 2025
29 pages
MAT125 Note Packet F23
No ratings yet
MAT125 Note Packet F23
66 pages
Unit 3
No ratings yet
Unit 3
11 pages
Basic Descriptive Statistics Using R
No ratings yet
Basic Descriptive Statistics Using R
4 pages
Module 5-6
No ratings yet
Module 5-6
12 pages
Nummerical Summaries
No ratings yet
Nummerical Summaries
11 pages
Amazon Interview Questions - Zero To Mastery Academy
No ratings yet
Amazon Interview Questions - Zero To Mastery Academy
1 page
Rahul Sharma: High Impact Presentations
No ratings yet
Rahul Sharma: High Impact Presentations
1 page
Rahul Sharma: Openai Generative Pre-Trained Transformer 3 (Gpt-3) For Developers
No ratings yet
Rahul Sharma: Openai Generative Pre-Trained Transformer 3 (Gpt-3) For Developers
1 page
Research Paper by Rahul Sharma
No ratings yet
Research Paper by Rahul Sharma
15 pages
The Education of A Value Investor PDF
No ratings yet
The Education of A Value Investor PDF
160 pages
Jul-Oct 24 SoH NPTEL Week 7 Assignment
100% (1)
Jul-Oct 24 SoH NPTEL Week 7 Assignment
6 pages
EP325 BrainfluencePodcastTranscript
No ratings yet
EP325 BrainfluencePodcastTranscript
24 pages
Jul-Oct 24 SoH NPTEL Week 6 Assignment
100% (1)
Jul-Oct 24 SoH NPTEL Week 6 Assignment
6 pages
Top 40 DAA Interview Questions and Answers
100% (1)
Top 40 DAA Interview Questions and Answers
2 pages
Top 40 DAA Interview Questions (2024) - Javatpoint
No ratings yet
Top 40 DAA Interview Questions (2024) - Javatpoint
2 pages
Data Visualization in Excel-2
No ratings yet
Data Visualization in Excel-2
1 page
Contigency R
No ratings yet
Contigency R
7 pages
Aman Sharma CV
No ratings yet
Aman Sharma CV
3 pages
Data Warehousing and Data Mining Techniques For Cyber Security 1st Edition by Anoop Singhal 0387476539 9780387476537
No ratings yet
Data Warehousing and Data Mining Techniques For Cyber Security 1st Edition by Anoop Singhal 0387476539 9780387476537
47 pages
Its132-Sa1 1
No ratings yet
Its132-Sa1 1
3 pages
DMDW-Unit II
No ratings yet
DMDW-Unit II
19 pages
Big Data Analytics Midterm Q&A
No ratings yet
Big Data Analytics Midterm Q&A
15 pages
Java AWT and Swing Overview
No ratings yet
Java AWT and Swing Overview
98 pages
Fyp Progress-1-1
No ratings yet
Fyp Progress-1-1
29 pages
Intermediate Java for Developers
No ratings yet
Intermediate Java for Developers
10 pages
RDL Reports Nuts N Bolts
No ratings yet
RDL Reports Nuts N Bolts
8 pages
Database Concurrency Techniques
No ratings yet
Database Concurrency Techniques
31 pages
Building Machine Learning Pipelines Automating Model Life Cycles With Tensorflow 1St Edition Hannes Hapke - Downloadable PDF 2025
No ratings yet
Building Machine Learning Pipelines Automating Model Life Cycles With Tensorflow 1St Edition Hannes Hapke - Downloadable PDF 2025
52 pages
Computer Science PRACTICAL FILE 2024 25 Initial Pages
No ratings yet
Computer Science PRACTICAL FILE 2024 25 Initial Pages
4 pages
Apache Hadoop and Spark:: and Use Cases For Data Analysis
No ratings yet
Apache Hadoop and Spark:: and Use Cases For Data Analysis
48 pages
Apex Triggers in Salesforce
No ratings yet
Apex Triggers in Salesforce
3 pages
BRISS Study Analysis
No ratings yet
BRISS Study Analysis
6 pages
SQLCS Order
No ratings yet
SQLCS Order
1 page
Cambridge IGCSE ™: Computer Science 0478/23
No ratings yet
Cambridge IGCSE ™: Computer Science 0478/23
14 pages
Guc 437 59 30794 2023-03-27T09 24 46
No ratings yet
Guc 437 59 30794 2023-03-27T09 24 46
54 pages
Ga7 8 0 Upgrade Guide
No ratings yet
Ga7 8 0 Upgrade Guide
46 pages
Probability and Statistic
No ratings yet
Probability and Statistic
24 pages
Imadmirali 944@
No ratings yet
Imadmirali 944@
3 pages
PG Program in Web Development
No ratings yet
PG Program in Web Development
11 pages
Number: 000-000 Passing Score: 800 Time Limit: 120 Min File Version: 1.0
0% (1)
Number: 000-000 Passing Score: 800 Time Limit: 120 Min File Version: 1.0
20 pages
iTNC 530 Programming Manual
No ratings yet
iTNC 530 Programming Manual
648 pages
Hierarchical Alv
No ratings yet
Hierarchical Alv
11 pages
HBD361 SupportingMaterial
No ratings yet
HBD361 SupportingMaterial
63 pages
CS8492 - Database Management Systems: II Year / IV Semester
No ratings yet
CS8492 - Database Management Systems: II Year / IV Semester
76 pages
How To Write A Literature Review Using Artificial Intelligence (AI) Tools
No ratings yet
How To Write A Literature Review Using Artificial Intelligence (AI) Tools
8 pages
Stats Tests for Thesis Students
No ratings yet
Stats Tests for Thesis Students
1 page
WWW Simplilearn Com Tutorials SQL Tutorial What Is Normalization in SQL
No ratings yet
WWW Simplilearn Com Tutorials SQL Tutorial What Is Normalization in SQL
9 pages

R Programming: Descriptive Stats Guide

Uploaded by

R Programming: Descriptive Stats Guide

Uploaded by

Faculty of : FCE Program: B.Tech Class/Section: Sem V, Sec.

Name of Faculty: Seema Kaloria Name of Course: R Programming Code: BADCCE5104

Here are some common functions for basic descriptive statistics in R:

The mean is the average of the data.

data <- c(1, 2, 3, 4, 5, 6, 7, 8, 9)

The median is the middle value of the dataset.

mode <- function(x) {

To find the minimum and maximum values individually:

This function provides:

2. Descriptive Statistics for Data Frames

# Example data frame

# Summary of each column in the data frame

3. Using Libraries for More Detailed Descriptive Statistics

# Descriptive statistics for each column in a data frame

This function provides:

# Descriptive statistics for a data frame

A histogram is useful to visualize the distribution of data.

hist(df$age, main = "Age Distribution", xlab = "Age", col = "lightblue")

boxplot(df$weight, main = "Boxplot of Weight", ylab = "Weight")

You might also like