R Guru Cheat Sheet

Uploaded by

biheg28817

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

15 views2 pages

R Guru Cheat Sheet

Uploaded by

biheg28817

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 2

R-Guru.

com Cheat Sheet for Practical R Tasks df2 <- df1[df1$vr1 == 'male', ] # row, column reference, subset, all vars
print(df1[df1$vr1 == 'male', c('vr1', 'vr2')]) # print vr1 & vr2 for males
mydata$vr2 <- ifelse(vr1 >= 4, 1, 0) # derive variable by condition
This guide contains basic best practice examples for creating and updating
tibble data frames from vectors of the same length or number of records.
Piping and multiple conditional processing to derive variables
Examples show common R tasks for importing data, creating data frames,
df2 <- df1 %>% # copy df1 data frame to df2
direct variable referencing, piping, conditional and group processing, sql
mutate(vr2 = case_when( # derive vr2 based on condition
components, character and date operations, variable type conversions,
vr1 > 20 ~ "label 1", # ifelse() is two option alternative
transposing data frames, joining data frames, appending data frames,
vr1 > 10 ~"label 2", # ifelse(vr1 < 51, "50 or Below", "Above 50")
deriving summary variables, and creating graphs and output files. When
TRUE ~ "label 3" ) ) # otherwise last case for label 3
possible, base R supplied sample data frames are used in examples.
SQL components (DPLYR) for selecting, filtering, mutating, and arranging
Mutate() has five features: case_when(), simple expression, summary
myquery <- ToothGrowth %>% # 1. source df
functions, rowwise(), and group_by()/ungroup() with summary functions.
select(len, supp, dose) %>% # 2. select variables, - drop
Data utility functions describe and view data frames: View(df), str(df),
filter(tolower(supp) == 'vc') %>% # 3. subset records, & (and), | (or), ! (not)
summary(df), table(vr), print(df, n=), head(df), tail(df), row_number(), nrow(),
# filter(year %in% c(2010, 2011)) to subset multiple values
ncol() and ls(). Tidyverse, DPLYR, STRINGR, READXL, HAVEN, LUBRIDATE and
mutate(dose2 = (dose*2)) %>% # 4. derive variables w/ simple expressions
GGPLOT2 packages are required. df#-data frame names, vr# – variable names.
arrange(supp, dose) # 5. sort records, desc()
Character or numeric variables depend on the function and values.
Character String Operations to combine, remove, subset, and substring
Import data into data frames: Data frames, CSV, Excel and SAS Datasets
vr3 <- str_c(vr1, sep="-", vr2) # combine two variables as vr1-vr2
install.packages('tidyverse') # install package
vr2 <- str_replace_all(vr1 , "Street" , "St" ) # replace ‘Street’ with ‘St’
library(tidyverse) # load popular data management package
vr2 <- str_trim(vr1, side='both') # remove blanks from left and right sides
readRDS("df.RDS") # read R data frame
vr2 <- str_extract(vr1 , "\\d*" ) # from char vr1, extract all digitss
read.csv("C:/mydata/my_csv.csv") # read csv, forward ‘/’
filter(str_detect( vr1 , "Health")) # subset records by finding text
read_excel("C:/mydata/my_excel.xlsx") # read excel, missing ‘NA’
vr2 <- str_sub(vr1 , 3 , 6 ) # substring vr1 text from 3rd to 6th position
sdtm <- “c:/my_sdtm” # create full path reference
read_sas(file.path(sdtm, “adsl.sas7bdat”)) # read dataset, missing ., ‘’
Variable Type Conversion to switch between Numeric & Character Variables
vr2 <- as.character(vr1) # number in numeric variable to character variable
Environmental Setup and Workspace
vr2 <- as.numeric(vr1) # number in character variable to numeric variable
ls() # list all objects
remove(list=ls()) # remove all objects
Date Operations: Assignment, Periods, Durations, Intervals, and Formats
# names(adsl)= tolower(names(adsl)) # lower case all variable names
Durations - # of seconds, Periods - # of days, weeks, months and years,
Intervals - duration between start and end points
Create data frames by combining variables
vr1 <- as.Date("2021-01-25") # assign date in yyyy-mm-dd format
df <- data.frame(vr1, vr2, vr3) # variable order
format(date, format="%m/%d/%y") # format as mm/dd/yy
+ ddays(1), + dweeks(1), + dmonths(1), + dyears(1) # add 1 dy, wk, mth or yr
Derive numeric and character constants to data frame
interval(dtvr1, dtvr2) %/% months(1) # of months between dates
df2 <- cbind(df1, vr1=1, vr2='Drug A') # to df1, add vr1 and vr2
# dates are stored as # of days since 1970
Direct variable reference to select variables and filter records
Transpose data frames to switch between long (records) & wide (variables)
df2 <- df1[c('vr1', 'vr2')] # combine selected variables by name
Long (records) to wide (variables) structure mutate(vr3 = mean(vr2, .1)) %>% # 4. derive mean vr2 with rounding
df2 <- df1 %>% # vr1 contains new variable name values ungroup() # 5. ungroup to add back all original variables with vr3
pivot_wider(names_from = vr1, values_from = vr2) # vr2 contains numbers
Left join data frames to add derived variables
Wide (variables) to long (records) structure df3 <- left_join(df1, df2, by='vr1') # join by the same by variables
df2 <- df1 %>% # all other variables in df1 are group by variables df3 <- left_join(df1, df2, by= c('vr1' = 'vr2')) # join by different by variables
pivot_longer(c("vr1", "vr2")) # list all variables to be transposed # other joins: right_join(), inner_join(), full_join()
df3 <- crossing(df1, df2) # many-to-many join without by variables
Group processing to derive summary variables
• Summary variables Subquery condition in df1 to filter df2 records
• First and Last Group By variables df3 <- df2 %>% # 5. final df3 data frame
• Descriptive Statistics filter(vr1 %in% ( # 4. filter vr1 values in df2 data frame
df1 %>% # 1. lookup df1 data frame
Derive summary variables filter(vr2 == 'male') %>% # 2. filter vr2=males in df1
mtcars_cyl_summary <- mtcars %>% # final and source data frames pull(vr1) %>% unique)) # 3. unique vr1 values for df2 filter
group_by(cyl) %>% # without is overall, ignore NA
summarize(mean_mpg = mean(mpg, na.rm = TRUE)) Append data frame records to end of first data frame records
df3 <- bind_rows(df1, df2) # append data frames with uneven variables
Derive First and Last group by variables df4 <- rbind(df1, df2, df3) # append two or more even variables data frames
first_mpg <- mtcars %>%
group_by( mpg ) %>% # group by mpg Graphs: Scatterplots, Lines, Boxplots, Bars and Histograms
slice(1) # flag first group by records, distinct() for unique records ggplot(df # data frame name
slice(n()) # flag last group by records , aes(x = vr1, y = vr2 # vr1 for x and vr2 for y axis variables
lead(), lag() # next and previous record values , fill= , color= , col=, size=) # valid options with valid values
# one or more required options below, defaults unless options are specified
Derive Descriptive Statistics Variable, Left Join to Add to Data Frame + geom_point() # scatterplot two quantitative variables
- Across one variable + geom_line # trend lines over time
vr1=c(2, 4, 6) # combine 3 values into one variable + geom_boxplot() # boxplot one continuous and one categorical variable
vr2 <- min(vr1) # one value, max(), sum() + geom_bar() # bar of variables, options: stat=
+ geom_histogram() # histogram of x-axis for counts
- Across variables using rowMins(), rowMax(), rowMeans() + geom_smooth(method=’lm’, formula=y~x, se=F) # smooth option
df2 <- subset(df1, select=c(vr1, vr2) ) # select variables vr1 and vr2 # one or more format options below, defaults unless options are specified
df3$vr1 <- rowMeans(df2, na.rm=TRUE ) # derive mean of all df2 vars + theme(), + ggtitle(), + xlab(), + ylab()

- Across variables using rowwise() with min(), max() Output files: Data frames, Text, Excel and SAS Datasets
df2 <- df1 %>% # derive min, max variables of vr1 and vr2 setwd("C:/mydataframes") # change default working folder
rowwise() %>% mutate(vr3= min(vr1, vr2), vr4= max(vr1, vr2)) getwd() # confirm working folder
saveRDS(df, file = "df.RDS") # save as permanent data frame
- Across records using mutate(), min(), max(), mean(), sum(), percent() write.table(df,"C:/myoutput/mydm.txt", sep="\t") # save as text file
df2 <- df1 %>% # 1. source df write_sas(df, "df.sas7bdat") # save as SAS dataset
filter(vr1 != '.') %>% # 2. subset non-missing records Created by Sunil Gupta, Gupta Programming, Copyright © 2023, Practical R Programming, R-Guru.com
group_by(vr1) %>% # 3. group by vr1, else by overall

R File Code
No ratings yet
R File Code
16 pages
RSTUDIO
No ratings yet
RSTUDIO
44 pages
R Topicscovered
No ratings yet
R Topicscovered
22 pages
Matrix, Dataframes, List
No ratings yet
Matrix, Dataframes, List
8 pages
FDP Indoglobal Group of Colleges: 27 April To 1 May R Programming Language Assignment Submission
No ratings yet
FDP Indoglobal Group of Colleges: 27 April To 1 May R Programming Language Assignment Submission
12 pages
MBA Sem 1 Unit 3 Fundamentals of R
No ratings yet
MBA Sem 1 Unit 3 Fundamentals of R
41 pages
Essential R Commands Guide
No ratings yet
Essential R Commands Guide
11 pages
R Functions
No ratings yet
R Functions
8 pages
Analysis Using Statistical: Introduction & Data Exploration
No ratings yet
Analysis Using Statistical: Introduction & Data Exploration
23 pages
R Programming Cont..
No ratings yet
R Programming Cont..
24 pages
Working with Data Frames in R
No ratings yet
Working with Data Frames in R
8 pages
DSCI 100 Cheat Sheet
No ratings yet
DSCI 100 Cheat Sheet
3 pages
Basic R Dplyr Session 4 Demonstration
No ratings yet
Basic R Dplyr Session 4 Demonstration
18 pages
All Codes
No ratings yet
All Codes
10 pages
R Study Material I
No ratings yet
R Study Material I
8 pages
Data Science Practical Completion Report
No ratings yet
Data Science Practical Completion Report
31 pages
Lab 02 - Compound Data Structures
No ratings yet
Lab 02 - Compound Data Structures
12 pages
DMPA Codes
No ratings yet
DMPA Codes
16 pages
R Lecture 2-1
No ratings yet
R Lecture 2-1
28 pages
R Programming Cheat Sheet
No ratings yet
R Programming Cheat Sheet
7 pages
R Code
No ratings yet
R Code
9 pages
8 R Basics 3
No ratings yet
8 R Basics 3
27 pages
R-Basics Knit
No ratings yet
R-Basics Knit
13 pages
R Program
No ratings yet
R Program
22 pages
X - 15 x-1 2. Print ('Hello Word!') ## (1) "Hello Word!" 3. X - 4 y - 5 Z - X+y Print (Z) 4. X - 4 y - 5 Cat ('The Sum of X and y Is', X+y)
No ratings yet
X - 15 x-1 2. Print ('Hello Word!') ## (1) "Hello Word!" 3. X - 4 y - 5 Z - X+y Print (Z) 4. X - 4 y - 5 Cat ('The Sum of X and y Is', X+y)
15 pages
Basics: TH TH TH TH TH TH TH
No ratings yet
Basics: TH TH TH TH TH TH TH
3 pages
MDPN460 Lecture05
No ratings yet
MDPN460 Lecture05
32 pages
Tidy Data Techniques in R
No ratings yet
Tidy Data Techniques in R
17 pages
R Basic and Advanced
No ratings yet
R Basic and Advanced
9 pages
Data Tidying With Tidyr::: Cheat Sheet
No ratings yet
Data Tidying With Tidyr::: Cheat Sheet
2 pages
Introduction to R for Statistics
No ratings yet
Introduction to R for Statistics
56 pages
R-Lab p-4,2,1
No ratings yet
R-Lab p-4,2,1
12 pages
Experiment 5
No ratings yet
Experiment 5
13 pages
RStudio Tips and Common Functions Guide
No ratings yet
RStudio Tips and Common Functions Guide
7 pages
Ma 3
No ratings yet
Ma 3
32 pages
Advanced R Programming Tidyverse Packages Notes
No ratings yet
Advanced R Programming Tidyverse Packages Notes
12 pages
R Reference Card
No ratings yet
R Reference Card
1 page
Lecture 5 (Managing and Understanding Data)
No ratings yet
Lecture 5 (Managing and Understanding Data)
9 pages
Lab Week2-3
No ratings yet
Lab Week2-3
26 pages
R Programming Cheat Sheet
No ratings yet
R Programming Cheat Sheet
15 pages
Advanced R Data Analysis Training PDF
No ratings yet
Advanced R Data Analysis Training PDF
72 pages
Data Analytic R
No ratings yet
Data Analytic R
28 pages
Fda SSIGNMENT 02
No ratings yet
Fda SSIGNMENT 02
13 pages
Importing The Files
No ratings yet
Importing The Files
14 pages
Exploratory Data Analysis and Visualization
No ratings yet
Exploratory Data Analysis and Visualization
10 pages
DP Unit1 Notes
No ratings yet
DP Unit1 Notes
18 pages
02-Data Gathering and Preparation
No ratings yet
02-Data Gathering and Preparation
54 pages
Lab1 411 Eman Yahya 7773225
No ratings yet
Lab1 411 Eman Yahya 7773225
16 pages
R Cheatsheet Base R
No ratings yet
R Cheatsheet Base R
2 pages
MIT 302 - Statistical Computing II - Tutorial 02
No ratings yet
MIT 302 - Statistical Computing II - Tutorial 02
5 pages
A Short List of The Most Useful R Commands
No ratings yet
A Short List of The Most Useful R Commands
8 pages
R Data Analysis and Manipulation Tasks
No ratings yet
R Data Analysis and Manipulation Tasks
21 pages
DR - Pierpaolo-Delser - Introduction R
No ratings yet
DR - Pierpaolo-Delser - Introduction R
83 pages
R Programming Materials
No ratings yet
R Programming Materials
51 pages
R Examples
No ratings yet
R Examples
56 pages
A Short List of Some Useful R Commands: Input and Display
No ratings yet
A Short List of Some Useful R Commands: Input and Display
2 pages
UL2
No ratings yet
UL2
2 pages
R Commands
No ratings yet
R Commands
18 pages
150pro User Manual
No ratings yet
150pro User Manual
20 pages
Practical Application of AI
No ratings yet
Practical Application of AI
4 pages
Grade - 9 (Edexcel) Mathematics
No ratings yet
Grade - 9 (Edexcel) Mathematics
12 pages
Monte Carlo Risk Analysis Model
No ratings yet
Monte Carlo Risk Analysis Model
3 pages
Ansi-Asme B1.20.3-1976 - NPTF
No ratings yet
Ansi-Asme B1.20.3-1976 - NPTF
30 pages
Project Management: Dr. S. Vijaya Bhaskar
No ratings yet
Project Management: Dr. S. Vijaya Bhaskar
41 pages
CV#7 SIFT Scale Invariant Feature Transform
No ratings yet
CV#7 SIFT Scale Invariant Feature Transform
70 pages
Coding Decoding Questions & Answers
No ratings yet
Coding Decoding Questions & Answers
2 pages
Lagrange Interpolation Guide
No ratings yet
Lagrange Interpolation Guide
7 pages
Silo - Tips - NDMP Configuration Guide For Ibm Tivoli Storage Manager
No ratings yet
Silo - Tips - NDMP Configuration Guide For Ibm Tivoli Storage Manager
23 pages
Evolution of Computers Overview
No ratings yet
Evolution of Computers Overview
20 pages
Ethnotech - Data Science With Python
No ratings yet
Ethnotech - Data Science With Python
480 pages
Mobile Application Lab Question Paper
No ratings yet
Mobile Application Lab Question Paper
3 pages
openSAP Sac3 Week 1 Exercise1
No ratings yet
openSAP Sac3 Week 1 Exercise1
30 pages
Exam Prep Data Communications and Networking With TCPIP Protocol Suite 6th Edition Behrouz A Forouzan HQ File Comprehensive
No ratings yet
Exam Prep Data Communications and Networking With TCPIP Protocol Suite 6th Edition Behrouz A Forouzan HQ File Comprehensive
326 pages
ECE 545: Digital Design with VHDL
No ratings yet
ECE 545: Digital Design with VHDL
77 pages
Overview and Characteristics of Java
No ratings yet
Overview and Characteristics of Java
4 pages
DT8032 - 8 Channel 500 V - 10 Ma Desktop Power Supply Module (USB - Ethernet - Touchscreen) - CAEN - Tools For Discovery
No ratings yet
DT8032 - 8 Channel 500 V - 10 Ma Desktop Power Supply Module (USB - Ethernet - Touchscreen) - CAEN - Tools For Discovery
3 pages
Online Art Gallery Project Report
43% (14)
Online Art Gallery Project Report
25 pages
Lastexception 63728404490
No ratings yet
Lastexception 63728404490
22 pages
0130 Ce
No ratings yet
0130 Ce
7 pages
Simplified STEEL DESIGN Besavilla PDF
100% (1)
Simplified STEEL DESIGN Besavilla PDF
473 pages
42U Racks Specification
No ratings yet
42U Racks Specification
3 pages
S3900 Series Switches IGMP-SNOOPING Configuration
No ratings yet
S3900 Series Switches IGMP-SNOOPING Configuration
16 pages
Literature Review On Online Ticket Booking System
100% (1)
Literature Review On Online Ticket Booking System
8 pages
Video Surveillance and Access Control System: XXXX-XXXXXX-XXX
No ratings yet
Video Surveillance and Access Control System: XXXX-XXXXXX-XXX
44 pages
Advanced Computer Networks Exam
No ratings yet
Advanced Computer Networks Exam
4 pages
Case Study 2
No ratings yet
Case Study 2
6 pages
BeyondInsight and Password Safe API Guide
No ratings yet
BeyondInsight and Password Safe API Guide
140 pages
DMTA 10010 01EN - Rev - A 38DL - PLUS Getting - Started - EN
No ratings yet
DMTA 10010 01EN - Rev - A 38DL - PLUS Getting - Started - EN
2 pages

R Guru Cheat Sheet

Uploaded by

R Guru Cheat Sheet

Uploaded by

R-Guru.

You might also like