0% found this document useful (0 votes)
40 views35 pages

Pplpresentation 211012192639

Uploaded by

Harish Garg
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
40 views35 pages

Pplpresentation 211012192639

Uploaded by

Harish Garg
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
You are on page 1/ 35

Programming Language

History and Introduction

 R is a programming language and free software environment for statistical


computing and graphics supported by the R Foundation for Statistical Computing.

 R is widely used by statisticians, data analysts and researchers for developing


statistical software and data analysis.

 It compiles and runs on a wide variety of UNIX platforms, Windows and Mac OS.

 The copyright for the primary source code for R is held by the R Foundation and is
published under the GNU General Public License version 2.0.

2
History and Introduction
 R was created by Ross Ihaka and Robert Gentleman at the University of Auckland, New
Zealand.

 Currently R is developed & maintained by the R Development Core Team.

 The Applications of R programming language includes :


1.Statical Computing
2.Machine Learning
3.Data Science

 R can be downloaded and installed from CRAN(Comprehensive R Archive Network) website.

 R language is cross platform interoperable and fully portable which means R program that
you write on one platform can be carried out to other platform and run there.(Platform
independent)
3
History and Introduction
Top Tier companies using R – companies all over the world use R language for statical analysis.
These are some of top tier companies that uses R.

Company Name Applications

Facebook For behavior analysis related to status updates and profile pictures.

Google For advertising effectiveness and economic forecasting.

Twitter For data visualization and semantic clustering

Acquired Revolution R company and use it for a variety of


Microsoft purposes.

Uber For statistical analysis

Airbnb Scale data science. 4


Evolution of R
R is a dialect of S language…
 It means that R is an implementation of the S programming language combined
with lexical scoping semantics & inspired by Scheme.
 S language was created by John Chambers in 1976 at Bell Labs.

 A commercial version of S was offered as S-PLUS starting in 1988.

S version1
S version 2
S version 3
S version4

5
Features of R
These are the some of important features of R -
 R is a simple, effective and well-developed, programming language which
includes conditionals, loops, user defined & recursive functions and input &
output facilities.

 R provides a large, coherent and integrated collection of tools for data


analysis.

 R has an effective data handling and storage facility.

 R provides a suite of operators for calculations on arrays, lists, vectors and


matrices.

 R provides graphical facilities for data analysis and display either directly at
the computer or printing at the papers. 6
Features of R

 Fast Calculation - R can be used to perform complex mathematical


and statistical calculations on data objects of a wide variety. Fast Calculation
 Extreme Compatibility - R is an interpreted language which means
that it does not need a compiler to make a program from the code. Extremely Compatible
 Open Source - R is an open-source software environment. You can
make improvements and add packages for additional functionalities
 Cross Platform Support - R is machine-independent. It supports Open Source
the cross-platform operation. Therefore, it can be used on many
different operating systems. Cross Platform Support
 Wide Packages - CRAN houses more
than 10,000 different packages and extensions that help solve all
sorts of problems in data science. Wide Packages
 Large Standard Library - R can produce static graphics with
production quality visualizations and has extended libraries Large Standard Library
providing interactive graphic capabilities.
7
Syntax of R

Once we have R environment setup, then it’s easy to start our R command prompt by just
typing R in command prompt.

Hello World Program – The [ ] in the output of R can be used


>myString <- “Hello world !” to reference data frame columns
>print(myString)

Output :
[1] “Hello World !”

In the Syntax of R we will discuss –


 Data Types
 Variables
 Keywords
 Operators
 Data Structures 8
Data Types
In R there are basically 6 data types –
Integer Logical
Data Type Examples

Integer 2L,5L,8L

Numeric 6,2,1,9 raw


Data Numeric
Logical true,false,0,1 Types
raw Raw Bytes

complex Z=3+7i Character Complex

Character ‘A’ , ”Aditya” , ”AB12”

9
Variables

Rules For Naming Variables in R –


1. In R variable name must be a combination of letters, digits, period(.) and
underscores.
2. It must start with a letter or period(.) and if it starts with period then it period
should not be followed by number.
3. Reserved words in R cannot be used in variable name.
Valid variables Invalid Variables

 myValue
 .1nikku
 .my.value.one
 TRUE
 my_value_one
 vik@sh
 Data4
 _temp

10
Keywords
Reserved Keywords in R – Reserved words are set of words that have special meaning
and cannot be used as names of identifiers.

If Else Repeat While Function

For In Next Break TRUE

FALSE NULL inf NaN -

Reserved Keywords in R
11
Operators
In any programming language, an operator is a symbol which is used to represent an
action. R has several operators to perform tasks including arithmetic, logical and bitwise
operations.

Operators in R can mainly be classified into the following categories –

1.Arithmetic Operators = {+ , - , * , / , %% , %/%}

2.Logical Operators = { ! , & , && , | , ||}

3.Assignment operators = { <- , <<- , = , -> , ->>}

4.Relational Operators = { < , > , <= , >= , != , ==}

12
Functions
Functions are used to incorporate sets of instructions that you want to use repeatedly. There are two types of functions.

Function

Built In User Defined

13
Built - In
Built-in functions are those functions which are provided by R so that we can use directly
within the language and its standard libraries.

In R there are so many built-in functions which make our programming fast and easy.

For Example :
1.The sum(a,b) function will return (a+b)

>print(sum(10,20))
[1] 30

2.The seq(a,b) function is used to get sequence from a to b.

>print(seq(5,15))
[1] 5 6 7 8 9 10 11 12 13 14 15

14
User Defined
User defined functions are those functions which we define in our code and use them
repeatedly. These functions can be defined with two types.
1.Without Arguments 2.With Arguments

Without Arguments With Arguments

myFunction <- function() myFunction <- function(a,b)


{ {
#This will be printed on calling this funcition #This function will print sum of passed args
print(“Without Arguments”) print(a+b)
} }
Conditional Statements

 Conditional Statements in R programming are used to make decisions based on the conditions.

 Conditional statements execute sequentially when there is no condition around the statements.

In R language we’ll discuss 3 types of Conditional Statements –

1.If - else statements

2.If – else if – else statements

3.Switch statements

16
If-else

Syntax – Start
If(condition) {
expression 1
}
Else { yes no
expression 2 Condition
True?
}

Example –
If(a>b) { Execute Execute
print(“a is greater than b”) If Block Else block
}
Else {
print(“ a is less than b”)
} End
17
Switch statement

 In switch() function we pass two types of arguments one is value and others is list of
items.

 The expression is evaluated based on the value and corresponding item is returned.

 If the value evaluated from the expression matches with more than one item of the
list then switch() function returns the item which was matched first.

Examples:

> switch(2,”Delhi”,”Jaipur”,”Mumbai”) > a=3


>[1] “Jaipur” > switch(a,”red”,”blue”,”green”,”yellow”)
> [1] “green”

18
Loops

Loops are used When we need to


execute particular code
repeatedly.
In R Language there are 3 types
of Loops –
1.For Loop Loops
2.While Loop In
R
3.Repeat Loop

19
For Loop
A for loop is used to iterate over a vector in R programming.
Example to count the number of even numbers in a
For each item
vector. in Sequence
Program -
x <- c(2,5,3,9,8,11,6)
count <- 0 Yes
Last item
Reached??
for (i in x) {
if(i %% 2 == 0) {
count=count+1
} No
}
print(count)
Body of
Output - For Loop
[1] 3
Exit Loop
20
While Loop
In R programming, while loops are used to loop until a specific condition is met.

Program –
Start
i <- 1
while(i<5) {
print(i)
i=i+1
} Execute code Yes Condition
of while block True??
Output –
[1] 1
[1] 2 No
[1] 3
[1] 4
Execute code
outside while block
21
Repeat Loop
 A repeat loop is used to iterate over a block of code multiple number of times.
 There is no condition check in repeat loop to exit the loop.
 We must ourselves put a condition explicitly inside the body of the loop and use the break statement to
exit the loop. Failing to do so will result into an infinite loop.
Enter Loop
Example –
x <- 1 Body of
repeat { Loop
print(x)
x = x+1
if (x == 4) {
break Break Yes
?
}
}
Output – No
[1] 1
[1] 2 Remaining
[1] 3 body of loop Exit
22
Data Structures

A data structure is a particular way of


organizing data in a computer so that it can
be used effectively. The idea is to reduce
the space and time complexities of different Arrays
tasks. Data structures in R programming are Matrices
tools for holding multiple values.

Data
Vectors Structures Lists
The most essential data structures used in R include :

 Vectors
 Arrays Data
 Factors Factors Frames
 Lists
 Matrices
 Data Frames
23
Vector
 Vector is the one of basic data structure of R which supports integer,
double, Character, logical, complex and raw data types.
 The elements in a vector are known as components of a vector.

Vector Creation
Vector can be created using these two methods :-
1.By Using Colon(:) Operator –
a <- 2:8
print(a) #2345678

2.By Using seq() function–


a <- seq(2,10,by=2)
print(a) # 2 4 6 8 10

24
Vector
Vector Operations

1.Combining Vectors 2.Arithmetic Operations


a <- c(4,3,5) a <- c(1,2,3)
b <- c(‘x’,’y’,’z’) b <- c(4,5,6)
c <- c(a,b) d <- a+b
print(c) o/p= 4 3 5 x y z print(d) o/p = 5 7 9

3.Numeric Indexing 4.Duplicate Indexing


a <- c(4,3,5) a <- c(4,3,5)
Print(a[2]) op = 5 print(a[1,2,2,3,3]) o/p=4 3 3 5 5

5.Logical Indexing 6.Range Indexing


a <- c(4,3,5) a <- c(1,2,3,4,5,6,7)
print(a[true,false,true]) o/p = 4 5 print(a[2:6]) o/p = 2 3 4 5 6
25
Array
Arrays allow us to store data in multi - dimensions and use in efficient way.
array Creation
Syntax -
Array_Name <- array(data, dim=(row_size,column_size,matrices), dim_names)

array Operations
1.Accessing Array Elements –
Accessing array in R is similar to other programming languages like c,c++ and java.
Eg. Print(Arr[2,2])

2.Arithmetic Operations –
Eg. Arr3 <- Arr2 + Arr1
Or
Arr3 = Arr1 – Arr2 26
Data Frame
Data Frame is a table or a two dimensional Array type structure.
Important Considerations
 The Column names should be non-empty.
 The row names should be unique.
 The Data stored in Data Frames can be only Numeric, Factor or Character Type.
 Each column should contain same number of data types.

Data Frame Creation


products <-data.frame(
product_number = seq(1:4)
product_name = c(“Apple”,”Samsung”,”Redmi”,”Oppo”))
print(products)
Product_number Product_name
1 Apple
2 Samsung
3 Redmi
4 Oppo 27
Lists
 List is a data structure which have components of Mixed data types.

 So a vector having elements of different data types is called a list.

 List can be created using list() function.


Eg. –
x <- list( a=“amba”,b=9.23,c=TRUE) #list storing 3 different data types

Accessing List Elements

print(x[‘b’]) o/p = 9.23


print(x[‘a’]) o/p = “amba”

Manipulating List Elements


x[‘a’] <- “nitin”
print([‘a’]) o/p = “nitin”
28
Matrices
 In R two dimensional rectangular data set is known as Matrix.

 A Matrix is created with the help of the input vector to the matrix() function.

 We can Perform addition, subtraction, multiplication and division operations on matrices.

Creating matrix -
Matrix1 <- matrix (2:7,nrow=2,ncol=3)
print(Matrix1)

o/p = 2 4 6
3 5 7
Accessing Elements –
Matrix1[2,3] #7

Assigning Value –
Matrix1[2,3]=1
29
Matrices

Operations On Matrices
1.Addition :
Matrix3=Matrix1+Matrix2

2.Subtraction :
Matrix3 = Matrix2 – Matrix1

3.Multiply by a Constant :
Ex : 7*Matrix1

4.Identity Matrix :
Ex – diag(5)

5.Transposition
Ex – t(Matrix1)

30
Factors
Factors are data objects which are used to categorise the data and store it as levels.

For example: a data field such as marital status may contain only values from single, married,
separated, divorced, or widowed.

>x
[1] single married married single
Levels : married single

Here, we can see that factor x has four elements and two levels. We can check if a variable is a
factor or not using class() function.

>class(x)
[1] “factor”

>levels(x)
[1] married single 31
R - Studio
Interacting with R Studio –
 R-Studio is a free and open-source integrated development
environment (IDE) for R, a programming language for statistical
computing and graphics.

 R-Studio was founded by JJ Allaire,creator of the programming


language ColdFusion.

 There are 4 main sections in R-Studio IDE…


1.Code Editor
2.Workspace and History
3.R console
4.Plots and Files
R - Studio

 RStudio is available in two editions:

1.RStudio Desktop, where the program is run


locally as a regular desktop application.

2.RStudio Server, Prepackaged distributions of


RStudio Desktop are available for Windows, OS
X, and Linux.

 RStudio is written in the C++ programming


language and uses the Qt framework for its
graphical user interface.

33
Why learn R?

34

You might also like