0% found this document useful (0 votes)

117 views

Bigdata Programs&Solutions

1. The document describes several R programs for analyzing big data. It includes programs to create vectors and data frames, perform operations on datasets like mtcars, and write data to CSV files. 2. The solutions section provides code snippets to implement the programs, such as creating vectors with repeating values, turning strings into factors, plotting variables, and reading/writing to CSV files. 3. Examples analyze the mtcars dataset, creating vectors from repeated characters, joining data frames, and selecting rows based on column conditions. Functions like rep(), factor(), write.csv() and read.csv() are used.

Uploaded by

Anitha Mc

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

117 views

Bigdata Programs&Solutions

Uploaded by

Anitha Mc

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

You are on page 1/ 7

Big Data Programs

Part A

(List of R programs)

1. Create vector for the following

a. (4, 6, 3, 4, 6, 3, . . . , 4, 6, 3) where there are 10 occurrences of 4.
b. Use the function paste to create the following character vectors of length 30
("fn1", "fn2", ..., "fn30").In this case, there is no space between fn and the number

2. a. Turn the vector of character items "Control", "Control", "Control", "Ear Removal", "Ear Removal",
"Ear Removal", "Ear Removal", "Fake Ear Removal", "Fake Ear Removal", "Fake Ear Removal", "Fake
Ear Removal" into a Factor variable and create a table from it to show the number of entries in each
treatment.
b. Create a vector of character variables that contains 25 ”a”, 15 ”b”, and 58 ”c” instances. What is
the length of this vector? Create a table from the entries.
3. a. Create three different variables, one that is numeric type and other two are vector of characters.
Use these to create data frame of student.(USN,Name,Marks)
b. Add a new numeric data column to the existing data frame (Age). Provide summary of the data
c. Display the list of student whose Age is less than 20 and Marks greater than 25

4. Write a program to create the csv file for storing Employee data. Containing the data
(EmpID, EmpName , DOJ, EmpCode, Dept, Desig.)
a. Read the suitable number of employee details from the user.
b. Create a dataframe of Employee
c. Store the dataframe in the csv file
d. Check the difference between csv and csv2 file
e. Read the data from csv and Display the contents
f. Append a new row into the csv file

5. Dataset example
a. List the data set available in your system using suitable command
b. Select “mtcars” data set, find and display the number of rows and columns in that data
set
c. Find are there more automatic (0) or manual (1) transmission-type cars in the
dataset? Hint: 9th column indicate the transmission type
d. Get a scatter plot of ‘hp’ vs ‘weight’.
e. Change ‘am’, ‘cyl’ and ‘vs’ to integer and store the new dataset as ‘newmtc’.
f. Extract the cases where cylinder is less than 5

6. Consider “Airquality” dataset

a. Display the dimension of the dataset
b. Display the class of each fields in the data set
c. Test the missing values
d. Recode the missing values, as mean of the column values
e. Exclude the missing values

Solutions

1. Create vector for the following

a. tmp <- c(4,6,3) # Create the vector

rep(tmp,10) #Repeat the vector 10 times
b. paste("fn",1:30,sep="") # paste 1st and 2nd argument

a. # Create the vector of strings

x<-c("Control", "Control", "Control", "Ear Removal", "Ear Removal", "Ear
Removal", "Ear Removal", "Fake Ear Removal", "Fake Ear Removal", "Fake Ear
Removal", "Fake Ear Removal")

# display the vector

> x
[1] "Control" "Control" "Control" "Ear Removal"
[5] "Ear Removal" "Ear Removal" "Ear Removal" "Fake Ear
Removal"
[9] "Fake Ear Removal" "Fake Ear Removal" "Fake Ear Removal"

#construct factor from the vector

> xfact<- factor(x)

#Display the vector

> xfact
[1] Control Control Control Ear Removal
[5] Ear Removal Ear Removal Ear Removal Fake Ear Removal
[9] Fake Ear Removal Fake Ear Removal Fake Ear Removal
Levels: Control Ear Removal Fake Ear Removal
> nlevels(xfact)
[1] 3

2b.

#Create the vector

> x<-c(rep("a",25),rep("b",15),rep("c",58))
> x
[1] "a" "a" "a" "a" "a" "a" "a" "a" "a" "a" "a" "a" "a" "a" "a" "a" "a" "a"
"a" "a"
[21] "a" "a" "a" "a" "a" "b" "b" "b" "b" "b" "b" "b" "b" "b" "b" "b" "b" "b"
"b" "b"
[41] "c" "c" "c" "c" "c" "c" "c" "c" "c" "c" "c" "c" "c" "c" "c" "c" "c" "c"
"c" "c"
[61] "c" "c" "c" "c" "c" "c" "c" "c" "c" "c" "c" "c" "c" "c" "c" "c" "c" "c"
"c" "c"
[81] "c" "c" "c" "c" "c" "c" "c" "c" "c" "c" "c" "c" "c" "c" "c" "c" "c" "c"
> x<-c(rep("a",25),rep("b",15),rep("c",58))
> x
# Find the length of the vector

[1] 98
> table1<- data.frame(x) # Construct table from the vector
> table1
x
1 a
2 a
3 a
4 a
5 a
6 a
|
|
|
|
|
93 c
94 c
95 c
96 c
97 c
98 c

3. a. Create three different variables, one that is numeric type and other two are vector of characters.
Use these to create data frame of student.(USN,Name,Marks)
b. Add a new numeric data column to the existing data frame (Age). Provide summary of the data
c. Display the list of student whose Age is less than 20 and Marks greater than 25

n <- as.integer(readline(prompt = "Enter no of students")) # Read No of

students
# Declare the vector of character of length n
USN <- vector(mode="character", length= n)
Name <- vector(mode="character", length= n)
Marks <- vector(mode="numeric", length= n)

#Read the elements of the vector

print("Enter USN")
for (i in 1:n)
USN[i] <- as.character(readline())
print("Enter Name")
for (i in 1:n)
Name [i] <- readline()
print("Enter Marks" )
for (i in 1:n)
Marks[i] <- as.integer(readline())

#Construct the data frame from the vectors

student <- data.frame(USN,Name,Marks)
print("The Student detials are as follows")
print(student) # Display data frame

print("Enter Age") # Read the vector of Age

Age <- vector(mode="integer", length=n)
for (i in 1:n)
Age [i] <- readline()
student <- cbind(student,Age) # Append the vector to the data frame

print(student)

for(i in 1:n) # Print student age > 20 , marks > 25

if ( student[i,][3] > 25 )
if (student[i,][4] > 20)
print(student[i,])

4. Write a program to create the csv file for storing Employee data. Containing the data
(EmpID, EmpName , DOJ, EmpCode, Dept, Desig.)
a. Read the suitable number of employee details from the user.
b. Create a dataframe of Employee
c. Store the dataframe in the csv file
d. Read the data from csv and Display the contents
e. Append a new row into the csv file

a. n <- as.integer(readline(prompt = "Enter no of Employee"))

EmpId <- vector(mode="character", length= n)
EmpName <- vector(mode="character", length= n)
DOJ <- vector(mode="character", length= n)
EmpCode <- vector(mode="numeric",length = n)
Desig <- vector(mode="character",length = n)
Dept <- vector(mode="character",length = n)

print("Enter EmpId")
for (i in 1:n)
EmpId[i] <- as.character(readline())
print("Enter EmployeeName")
for (i in 1:n)
EmpName [i] <- readline()
print("Enter DOJ" )
for (i in 1:n)
DOJ[i] <- (readline())
print("Enter EmployeeCode" )
for (i in 1:n)
EmpCode[i] <- as.integer(readline())
print("Enter Designation" )
for (i in 1:n)
Desig[i] <- (readline())
print("Enter Dept" )
for (i in 1:n)
Dept[i] <- (readline())

b.
Emp <- data.frame(EmpId,EmpName,EmpCode,Desig,Dept,DOJ)

print("The Employee detials are as follows")

print(Emp)

c.
write.csv(Emp,"C:/Users/ARCHANA/Documents/Empfile.csv")

d.
readStudent=read.csv("C:/Users/ARCHANA/Documents/file.csv")

e.
print("Enter a new row")
u<- readline(prompt = "EmpId")
n<- readline(prompt = "EmpName")
m<- readline(prompt = "EmpCode")
A<- readline(prompt = "Desig")
s<- readline(prompt = "Dept")
t<- readline(prompt = "DOJ")

x<- data.frame(u,n,m,A,s,t)

write.table(x,"C:/Users/ARCHANA/Documents/Empfile.csv",col.names = FALSE, append = T,row.names

= T, quote= FALSE, sep = ",")

c. x<- data.frame(mtcars)
automatic <-0
manual <-0
for (i in 1:rownum)
ifelse( x[i,9] == 1, automatic <- automatic + 1, manual <- manual +1)
ifelse (automatic > manual,
print("There are more automatic transmission type"),
print("There are more manual transmission type") )

d. //The scatter plot

HorsePower <- x[,4]
Weight <- x[,6]
scatter.smooth(HorsePower,Weight, span=2/3, degree = 1, family =c("symmetric","gaussian"))

// Plot histogram of Miles/gallon

Mpg <- x[,1]
hist(Mpg, breaks = 12, col ="lightblue", border = "pink")

e. // Solution for e
x[,2]<- as.integer(x[,2])
x[,8]<- as.integer(x[,8])
x[,9]<- as.integer(x[,9])
x[,2] <= 5

f. mtcars[mtcars$cyl <=5 ]

6. Consider “Airquality” dataset

a. df <- airquality
dim(df)
b. sapply(df,class)
c. #Printing the missing values
print("The Missing values are as follows")
xcolNames <- colnames(df)
x<- colSums(is.na(df))
print(x)
d. which(is.na(df))
sum(is.na(df))
df1<- as.data.frame(df)
e. #Recoding the missing values
for(i in 1:4)
df1[,i]<- ifelse ( is.na(df[,i]), mean(df[,i], na.rm = TRUE), df[,i])
# Excluding the missing values

df2<-na.omit(df)

Exercises For R
No ratings yet
Exercises For R
40 pages
Lesson 4: Educational Technology and Innovative Teaching
No ratings yet
Lesson 4: Educational Technology and Innovative Teaching
4 pages
arunav da prac
No ratings yet
arunav da prac
55 pages
R Assignment
No ratings yet
R Assignment
9 pages
R-Programming Record - Odd Sem 21-22
No ratings yet
R-Programming Record - Odd Sem 21-22
35 pages
Lec 13
No ratings yet
Lec 13
46 pages
Experiment 5
No ratings yet
Experiment 5
13 pages
Practical 2 Kunal
No ratings yet
Practical 2 Kunal
6 pages
DSDA MANUAL
No ratings yet
DSDA MANUAL
64 pages
RemoveWatermark pdf24 Merged+
No ratings yet
RemoveWatermark pdf24 Merged+
76 pages
DAV_LAB3.pdf_20250306_141450_0000
No ratings yet
DAV_LAB3.pdf_20250306_141450_0000
57 pages
DS Tutorial-2 Dinesh Dodeja 52119
No ratings yet
DS Tutorial-2 Dinesh Dodeja 52119
5 pages
R Assignment 10
No ratings yet
R Assignment 10
12 pages
DAV_LAB
No ratings yet
DAV_LAB
55 pages
Midterm Session II #0000000224 - On March 25, 2016 14 13: Processing
No ratings yet
Midterm Session II #0000000224 - On March 25, 2016 14 13: Processing
11 pages
Learn R by Intensive Practice - Coding Test
No ratings yet
Learn R by Intensive Practice - Coding Test
22 pages
R Programing Bhagu
No ratings yet
R Programing Bhagu
40 pages
Da Lab It
No ratings yet
Da Lab It
20 pages
R Programming
No ratings yet
R Programming
50 pages
R File New
No ratings yet
R File New
22 pages
Lecture_5_(Managing_and_Understanding_Data)
No ratings yet
Lecture_5_(Managing_and_Understanding_Data)
9 pages
R Lab
No ratings yet
R Lab
15 pages
Grade11 Datascience
No ratings yet
Grade11 Datascience
4 pages
Introduction to r Chap 2
No ratings yet
Introduction to r Chap 2
30 pages
DA Lab Manual
No ratings yet
DA Lab Manual
42 pages
Part a r Programming
No ratings yet
Part a r Programming
10 pages
Simple Tutorial in R
No ratings yet
Simple Tutorial in R
15 pages
Experiment - 6 DATE: 28.2.2020 Data Analytics Lab: Seq (1, 3, by 0.2)
No ratings yet
Experiment - 6 DATE: 28.2.2020 Data Analytics Lab: Seq (1, 3, by 0.2)
3 pages
R
No ratings yet
R
15 pages
Data Manipulation: Ionut Bebu
No ratings yet
Data Manipulation: Ionut Bebu
19 pages
Da Lab File
No ratings yet
Da Lab File
33 pages
Big Data Lab R Code with output.docx
No ratings yet
Big Data Lab R Code with output.docx
13 pages
R Programmimg Practical Journal All-1
No ratings yet
R Programmimg Practical Journal All-1
25 pages
Character Vectors: Letters Letters
No ratings yet
Character Vectors: Letters Letters
5 pages
Introduction To R
No ratings yet
Introduction To R
21 pages
R Lab Programs-1
No ratings yet
R Lab Programs-1
26 pages
Big Data File in R
No ratings yet
Big Data File in R
23 pages
R123
No ratings yet
R123
6 pages
Workshop Activity: X Seq y Length
No ratings yet
Workshop Activity: X Seq y Length
3 pages
A Short List of Some Useful R Commands: Input and Display
No ratings yet
A Short List of Some Useful R Commands: Input and Display
2 pages
Analysis Using Statistical: Introduction & Data Exploration
No ratings yet
Analysis Using Statistical: Introduction & Data Exploration
23 pages
materi 4
No ratings yet
materi 4
30 pages
R16
No ratings yet
R16
55 pages
R Programming Basics Guide
No ratings yet
R Programming Basics Guide
30 pages
R Programming: © 2016 SMART Training Resources Pvt. LTD
No ratings yet
R Programming: © 2016 SMART Training Resources Pvt. LTD
28 pages
R Prgms
No ratings yet
R Prgms
12 pages
An Introduction To R Language
No ratings yet
An Introduction To R Language
11 pages
18 3 24 Upto Week 6 A B Latest 1
No ratings yet
18 3 24 Upto Week 6 A B Latest 1
25 pages
R-Programming Lab Mannual
No ratings yet
R-Programming Lab Mannual
33 pages
BIO259 Note
No ratings yet
BIO259 Note
55 pages
RStudio
No ratings yet
RStudio
31 pages
R For Machine Learning Lab Practical Work: Master of Business Administration in Business Analytics
0% (1)
R For Machine Learning Lab Practical Work: Master of Business Administration in Business Analytics
9 pages
Statistics With R
No ratings yet
Statistics With R
5 pages
Lab Manual Record: St. Josephs PG College
No ratings yet
Lab Manual Record: St. Josephs PG College
14 pages
DA Practical File
No ratings yet
DA Practical File
36 pages
R Commands
No ratings yet
R Commands
18 pages
R Syntax Examples 1
No ratings yet
R Syntax Examples 1
6 pages
The Essential R Reference
From Everand
The Essential R Reference
Mark Gardener
No ratings yet
Profound Python Data Science
From Everand
Profound Python Data Science
Onder Teker
No ratings yet
Computer Engineering Laboratory Solution Primer
From Everand
Computer Engineering Laboratory Solution Primer
Karan Bhandari
No ratings yet
Advanced C Concepts and Programming: First Edition
From Everand
Advanced C Concepts and Programming: First Edition
Gayatri
3/5 (1)
IBPS RRB Officer Scale Prelim 1 AIT
No ratings yet
IBPS RRB Officer Scale Prelim 1 AIT
12 pages
Ad Hoc Wireless Networking Course Notes: March 2002
No ratings yet
Ad Hoc Wireless Networking Course Notes: March 2002
59 pages
Cse I SEM Q.BANK 2016-17 PDF
No ratings yet
Cse I SEM Q.BANK 2016-17 PDF
127 pages
Taxonomy For Wireless Sensor Networks Services Characterisation and Classification
No ratings yet
Taxonomy For Wireless Sensor Networks Services Characterisation and Classification
4 pages
Wireless Communication July 2017 (2010 Scheme)
No ratings yet
Wireless Communication July 2017 (2010 Scheme)
1 page
Design of Horizontal Curve: Formula
No ratings yet
Design of Horizontal Curve: Formula
3 pages
Information and Network Security Lab: Lab Mannual FOR VII SEMESTER Computer Science & Engineering
No ratings yet
Information and Network Security Lab: Lab Mannual FOR VII SEMESTER Computer Science & Engineering
30 pages
Tech Spec - Lifts 20.12.08
No ratings yet
Tech Spec - Lifts 20.12.08
16 pages
Install ABAP Development Tools
No ratings yet
Install ABAP Development Tools
20 pages
TC3 Installation EN
No ratings yet
TC3 Installation EN
79 pages
Dual-Tone Multi-Frequency (DTMF) : DAS Project Report
No ratings yet
Dual-Tone Multi-Frequency (DTMF) : DAS Project Report
23 pages
HAC-GWW1 User Manual Quick Start Guide
No ratings yet
HAC-GWW1 User Manual Quick Start Guide
18 pages
How To Set Web Server in Cisco Packet Tracer
No ratings yet
How To Set Web Server in Cisco Packet Tracer
8 pages
1 Data Sheet Nokia 7368 ISAM ONT G-240G-C For POL
No ratings yet
1 Data Sheet Nokia 7368 ISAM ONT G-240G-C For POL
3 pages
Yamaha Speakers
No ratings yet
Yamaha Speakers
4 pages
Samsung Digital Camera SCB-2000
No ratings yet
Samsung Digital Camera SCB-2000
17 pages
VSP211L
No ratings yet
VSP211L
4 pages
"Software Piracy Protection Using Machine Learning": Bachelor of Engineering
No ratings yet
"Software Piracy Protection Using Machine Learning": Bachelor of Engineering
19 pages
RivCross Operating Manual
No ratings yet
RivCross Operating Manual
223 pages
Gmax 50 PDF
No ratings yet
Gmax 50 PDF
275 pages
Electric FD Fan Actuator For Aalborg Boilers
No ratings yet
Electric FD Fan Actuator For Aalborg Boilers
2 pages
TVB Reasons of Success
No ratings yet
TVB Reasons of Success
4 pages
ConvertXtoDVD v4 Manual
No ratings yet
ConvertXtoDVD v4 Manual
277 pages
Recent Developments and Applications of Simultaneous Localization and Mapping in Agriculture
No ratings yet
Recent Developments and Applications of Simultaneous Localization and Mapping in Agriculture
28 pages
How To Setup An Md380 For Use With A DMR Repeater
No ratings yet
How To Setup An Md380 For Use With A DMR Repeater
7 pages
Gwhs Annex C
No ratings yet
Gwhs Annex C
48 pages
IQ-400 Install F-1
No ratings yet
IQ-400 Install F-1
120 pages
Manual Monitoreo Presion LLantas Vag
100% (1)
Manual Monitoreo Presion LLantas Vag
36 pages
DLL EMPowerment TECHnologies
100% (2)
DLL EMPowerment TECHnologies
23 pages
Error Facturacion
No ratings yet
Error Facturacion
101 pages
DC System - Mar 2023
No ratings yet
DC System - Mar 2023
23 pages
Module-1 Answer - Sherlyn C. Taboso
No ratings yet
Module-1 Answer - Sherlyn C. Taboso
8 pages
Lesson 4 The Human Person Flourishing in Terms of Science and Technology
No ratings yet
Lesson 4 The Human Person Flourishing in Terms of Science and Technology
4 pages
GK & Current Affairs: Career Options
No ratings yet
GK & Current Affairs: Career Options
2 pages
Oracle Forms and Reports Training Course Highlights
No ratings yet
Oracle Forms and Reports Training Course Highlights
4 pages
Spec of FYI Work
No ratings yet
Spec of FYI Work
36 pages

Bigdata Programs&Solutions

Uploaded by

Bigdata Programs&Solutions

Uploaded by

Big Data Programs

1. Create vector for the following

6. Consider “Airquality” dataset

1. Create vector for the following

a. tmp <- c(4,6,3) # Create the vector

a. # Create the vector of strings

# display the vector

#construct factor from the vector

#Display the vector

#Create the vector

n <- as.integer(readline(prompt = "Enter no of students")) # Read No of

#Read the elements of the vector

#Construct the data frame from the vectors

print("Enter Age") # Read the vector of Age

for(i in 1:n) # Print student age > 20 , marks > 25

a. n <- as.integer(readline(prompt = "Enter no of Employee"))

print("The Employee detials are as follows")

write.table(x,"C:/Users/ARCHANA/Documents/Empfile.csv",col.names = FALSE, append = T,row.names

d. //The scatter plot

// Plot histogram of Miles/gallon

6. Consider “Airquality” dataset

You might also like