0% found this document useful (0 votes)

23 views6 pages

Assignment 1

Uploaded by

annupoonia152005

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

23 views6 pages

Assignment 1

Uploaded by

annupoonia152005

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

Section A

Assume the following libraries have been imported:

import numpy as np
import pandas as pd

1. (i) A teacher wants to store the marks of four students in two subjects where
marks are random numbers between 40 and 60 (both inclusive). Write Python
code using an appropriate data structure to store marks. Also display the
average marks of the four students.

(ii) Consider a DataFrame df as shown below:

Using df determine the output of the following code snippet:

print('Shape of dataFrame: ', [Link])
df_filtered = [Link](thresh=2, axis =0)
print('New Frame:\n', df_filtered)
print('Shape of New Frame:', df_filtered.shape)

(iii) Determine the output of the following code snippet:

arr1 = [Link]([[1, 2, 3],[4, 5, 6]])
arr2 = [Link]([10, 20, 30])
arr3 = arr1 + arr2
print(arr3)
arr4 = [Link](1,0)
print(arr4)

(iv) Differentiate between simple random sampling and stratified random

sampling. Give one example for each type of sampling.

(v) Consider the following two variables:

data = {‘Student_Name’: [‘S1’, ‘S2’, ‘S3’, ‘S4’,

‘S5’, ‘S6’, ‘S7’, ‘S8’, ‘S9’, ‘S10’],
‘Score’: [85, 72, 92, 65, 78, 88, 45, 60, 70, 98]}

grades = { ‘score_ranges’:[‘0-60’, ‘60-70’,

‘70-80’, ‘80-90’, ‘90-100’]
‘letter_grades’:[‘F’, ‘D’, ‘C’, ‘B’, ‘A’]}

Write Python code to do the following:

Page Number 1 of 6
a) Create a DataFrame Student containing students’ exam scores using the
above dictionary data.
b) Add a column Grade_Obt to the DataFrame Student.
c) Categorize each students’ scores into letter grades (‘A’, ‘B’, ‘C’, ‘D’, ‘F’)
based on the given score ranges in grades above.
d) Display names of students getting grade ‘A’.

(vi) Write Python code to load the titanic dataset from seaborn library into
a data frame and replace the missing values in each column by the mean of
that column.

3 (i) Distinguish between unimodal, bimodal and multimodal distribution. Use a

diagram to illustrate your answer.

(ii) Determine the output of the following code snippet using the DataFrames df1
and df2:

df1 df2

merged_df1 = [Link](df1, df2, how='outer')

print("Merged DataFrame 1:")
print(merged_df1)
merged_df2 = [Link](df1, df2, how='outer', on=
[“A”])
print("Merged DataFrame 2:")
print(merged_df2)

(iii) Given the list: my_list = list(range(0,6))

a) Create a numpy array One_D using my_list. Convert this one-
dimensional array into a two-dimensional array Two_D with 3 rows.
b) Create another numpy array Tran_Two_D that is the transpose of the
array Two_D.
c) Replace all odd numbers in Two_D with -1.
d) Print the sum of the arrays Two_D and Tran_Two_D.

4(i) Write Python code for the following:

a) Create a numpy array ArrayNum with 5 rows and 4 columns to store

random numbers from 0 to 1.
b) Compute the mean and standard deviation of each row in the array
ArrayNum.

Page Number 2 of 6
c) Convert the ArrayNum into a DataFrame df and name the columns as
‘A’, ‘B’, ‘C’ and ‘D’ respectively. Name the rows as ‘One’,
‘Two’, ‘Three’, ‘Four’ and ‘Five’ respectively.
d) Find the correlation between columns A and C of df.

(ii) Consider the following data frame df_Sales containing sales data for
multiple products across different regions for four quarters of a year:

df_Sales
Write Python code for the following:

a) Create a boxplot for the column ‘Sales_in_INRLakhs’ in

df_Sales. Give an appropriate title to the plot and save the file on disk
b) Create a hierarchical index for df_Sales such that data is arranged
region-wise and quarter-wise within each region.
c) Using df_Sales, find total sales in the northern region in the 2nd quarter.
d) Update df_Sales such that the Sales_in_INRLakhs for product
‘A’ is increased by 25%.

5(i) Consider a DataFrame df as shown below:

Determine the output of the following code snippet using the given DataFrame
df:

Page Number 3 of 6
filled_df1 = [Link](100)
filled_df2 = [Link]()
filled_df3 = [Link]()

print("filled df1:")
print(filled_df1)

print("filled df2:")
print(filled_df2)

print("filled df3:")
print(filled_df3)
print(filled_df3.value_counts())

(ii) Consider the following Series having details of four products:

sales = [Link]({‘Product A’: 5000, ‘Product B’:
8000, ‘Product C’: 3000, ‘Product D’: 6000})

Write Python code to do the following:

a) Find the product name whose sales is maximum.
b) Determine the total quantity sold for all products taken together.
c) Calculate the percentage contribution of each product to the total sales.
d) Find products whose sale is lesser than the average sale.
e) Update the sales figure for the product with the lowest sales to 9999.

6 Consider the following DataFrame Income_Data:

Income_Data

Write Python code for the following:

i. Use an appropriate plot to visualize the distribution of age in

Income_Data. Give an appropriate title to the chart, the x-axis and the
y-axis.
ii. Find the minimum income for each level of education in Income_Data.

Page Number 4 of 6
iii. Determine the Education_Level with the highest average income.
iv. Find the average age of individuals having Income more than 60000.
v. For each Education_Level, find the total number of individuals
studying at that level.

7 Assume that the following data about rubies is saved in an excel file
[Link]).

Cut_Type Cost X Y
Ideal 53940 2.2 1.1
Premium 38450 2.9 1.4
Ideal 64730 2.3 1.2
Good 8493 1.8 0.9
Premium 29480 2.8 1.3
Good 9838 1.7 0.8

Write Python statements to do the following (Mention the libraries used

explicitly):

i. Read data from the given excel file [Link] into a DataFrame df.
ii. Find the total Cost of rubies.
iii. Display the unique values of column Cut_Type.
iv. Find the statistical summary of all numeric columns in the DataFrame df.
v. Arrange details of rubies by Cost in descending order.
vi. Rename the column X as Length and Y as Width.
vii. Create a heatmap of the correlations between the numeric features of df.
Give plot title as “Correlation Matrix”. Save the plotted figure to
a file named “[Link]”.

8 Given a comma separated file [Link] consisting of the

following details of automobiles:

autodf
Cyl: cylinders, HP: horse power

Write Python statement(s) to do the following (Make use of appropriate

libraries):

Page Number 5 of 6
i. Read from the given CSV file [Link] and store this data in
a DataFrame autodf.
ii. Find the number of missing values in each column.
iii. Replace missing values in the HP column with its mean value.
iv. Print Year_of_Make and Model_Name of cars from origin “USA”.
v. Calculate the average Miles for each year.
vi. Plot a pie chart on Country. Give title of the plot as “Pie Chart -
Country”.
vii. Choose a suitable plot to compare frequency of distinct values of the
cylinders. Give appropriate labels to the axes and add a title to the chart.

Page Number 6 of 6

Manishadav
No ratings yet
Manishadav
27 pages
GE Practical Sem 2
No ratings yet
GE Practical Sem 2
28 pages
Python 1
No ratings yet
Python 1
16 pages
2023 Data Analysis and Visualization Using Python
100% (2)
2023 Data Analysis and Visualization Using Python
9 pages
Class XII Informatics Practices
No ratings yet
Class XII Informatics Practices
5 pages
Ge Sem II Dav Upc 2344001201 Sl. No. Qp. 2012 July 2023
No ratings yet
Ge Sem II Dav Upc 2344001201 Sl. No. Qp. 2012 July 2023
16 pages
Information Practices: Section A
No ratings yet
Information Practices: Section A
8 pages
Work Sheet-1 Class 12 IPR
No ratings yet
Work Sheet-1 Class 12 IPR
5 pages
Data Analysis and Visualization Course
No ratings yet
Data Analysis and Visualization Course
4 pages
Ip Questions
No ratings yet
Ip Questions
5 pages
Data Analysis with Python
No ratings yet
Data Analysis with Python
6 pages
Ge - Computer Science Data Analysis
No ratings yet
Ge - Computer Science Data Analysis
16 pages
DXE 24gksmknvj
No ratings yet
DXE 24gksmknvj
16 pages
Class 12 IP Pre-Board Exam 2019-20
No ratings yet
Class 12 IP Pre-Board Exam 2019-20
11 pages
Sanyam Data Science
No ratings yet
Sanyam Data Science
33 pages
XII IP Practical List 2023-24
No ratings yet
XII IP Practical List 2023-24
4 pages
Class XII Informatics Practices Sample Paper
No ratings yet
Class XII Informatics Practices Sample Paper
15 pages
Class XII Informatics Practices
No ratings yet
Class XII Informatics Practices
8 pages
PYQ Data Analysis and Visualisation Using Python GE May 2024
No ratings yet
PYQ Data Analysis and Visualisation Using Python GE May 2024
6 pages
23HCS4142 PDF
No ratings yet
23HCS4142 PDF
24 pages
Ip pb1 QP Ms Agra Set A
No ratings yet
Ip pb1 QP Ms Agra Set A
17 pages
Int375 Etp Paper
No ratings yet
Int375 Etp Paper
11 pages
2020-21 XIIInfo - Pract.S.E.155
No ratings yet
2020-21 XIIInfo - Pract.S.E.155
11 pages
CBSE Class XII IP Practical Guide
No ratings yet
CBSE Class XII IP Practical Guide
21 pages
Ip Worksheet 2 - Q'S
No ratings yet
Ip Worksheet 2 - Q'S
7 pages
GE - Computer Scien 4ogygeb
No ratings yet
GE - Computer Scien 4ogygeb
8 pages
Revision Worksheet (2024-2025)
No ratings yet
Revision Worksheet (2024-2025)
9 pages
Python Pandas DataFrame Tasks
No ratings yet
Python Pandas DataFrame Tasks
9 pages
Fods Programs 25 August 25
No ratings yet
Fods Programs 25 August 25
6 pages
Pandas Worksheet
No ratings yet
Pandas Worksheet
19 pages
GE Python Visualization 2023
No ratings yet
GE Python Visualization 2023
16 pages
Data Analysis and Visualization Exam Guide
No ratings yet
Data Analysis and Visualization Exam Guide
12 pages
Minimum Level Pandas Skill Based Questions
No ratings yet
Minimum Level Pandas Skill Based Questions
8 pages
IP MODEL 1 QST Set 2
No ratings yet
IP MODEL 1 QST Set 2
4 pages
Holidays Homework - Ip
No ratings yet
Holidays Homework - Ip
5 pages
Pandas1 Q&ans
No ratings yet
Pandas1 Q&ans
14 pages
Data Analysis Exam for CS Majors
No ratings yet
Data Analysis Exam for CS Majors
12 pages
Questions Practical File
No ratings yet
Questions Practical File
13 pages
DataFrame Assignment2024
No ratings yet
DataFrame Assignment2024
10 pages
GE - Computer Scien EaQvs42
No ratings yet
GE - Computer Scien EaQvs42
6 pages
HY Exam Revision (11/9/2024)
No ratings yet
HY Exam Revision (11/9/2024)
15 pages
Pragya Exam Question Paper 2023-24
No ratings yet
Pragya Exam Question Paper 2023-24
8 pages
Ipqppt1 24-25kvamc
No ratings yet
Ipqppt1 24-25kvamc
3 pages
Question Bank CIA 2
No ratings yet
Question Bank CIA 2
3 pages
Httppython Mykvs inuploadsfilesXIIInfo Pract S E 150 PDF
No ratings yet
Httppython Mykvs inuploadsfilesXIIInfo Pract S E 150 PDF
15 pages
Python Practical Questions@Subas
No ratings yet
Python Practical Questions@Subas
7 pages
DAV Practical File 234003
No ratings yet
DAV Practical File 234003
14 pages
Model Practical Examination 2024-25 Python Pandas QP
No ratings yet
Model Practical Examination 2024-25 Python Pandas QP
3 pages
Ip Sample Paper 1
No ratings yet
Ip Sample Paper 1
4 pages
DataFrame QP
No ratings yet
DataFrame QP
17 pages
Informatics Practices Practical Record
No ratings yet
Informatics Practices Practical Record
50 pages
Informatic Practices HHW
No ratings yet
Informatic Practices HHW
59 pages
Assignment 1
No ratings yet
Assignment 1
2 pages
Informatics Practices Exam Question Paper
No ratings yet
Informatics Practices Exam Question Paper
12 pages
Data Analysis and Visualization Exam Paper
No ratings yet
Data Analysis and Visualization Exam Paper
12 pages
Class XII Informatics Test
No ratings yet
Class XII Informatics Test
6 pages
Informatic Practices HHW
No ratings yet
Informatic Practices HHW
21 pages
DC/DC Converter: Features
No ratings yet
DC/DC Converter: Features
3 pages
Human Physiology - Prilohy A Rejstrik PDF
No ratings yet
Human Physiology - Prilohy A Rejstrik PDF
83 pages
TG 036
No ratings yet
TG 036
31 pages
Monochromatic Light Sources Guide
No ratings yet
Monochromatic Light Sources Guide
4 pages
WS 5 Number Sequences - Docx - 20240918 - 090130 - 0000
No ratings yet
WS 5 Number Sequences - Docx - 20240918 - 090130 - 0000
4 pages
F-35 Active Stick & Throttle Overview
100% (2)
F-35 Active Stick & Throttle Overview
20 pages
Automatic Plant Irrigation System Using Soil Moisture Sensor
No ratings yet
Automatic Plant Irrigation System Using Soil Moisture Sensor
3 pages
A320 Electrical System Overview
No ratings yet
A320 Electrical System Overview
14 pages
CH 23 - RELATIONSHIPS and GRAPHS
No ratings yet
CH 23 - RELATIONSHIPS and GRAPHS
6 pages
1.) Make A Research About Solar System.: Source
No ratings yet
1.) Make A Research About Solar System.: Source
5 pages
JV Institute
No ratings yet
JV Institute
3 pages
Memory Hierarchy, Cache Memory, Direct Memory Access
No ratings yet
Memory Hierarchy, Cache Memory, Direct Memory Access
6 pages
Statistical Signal Processing Course Overview
No ratings yet
Statistical Signal Processing Course Overview
5 pages
Machine Learning Techniques Overview
No ratings yet
Machine Learning Techniques Overview
159 pages
Molecular Orbital Diagrams for HF and H3+
No ratings yet
Molecular Orbital Diagrams for HF and H3+
17 pages
Roundness Cylindricity Coaxiality Concentricity Runout and Total Runout
No ratings yet
Roundness Cylindricity Coaxiality Concentricity Runout and Total Runout
2 pages
Latihan Slot 2 - Tutormu 4.0
No ratings yet
Latihan Slot 2 - Tutormu 4.0
10 pages
Yaesu Ft-767gx User Manual - Tabascan
No ratings yet
Yaesu Ft-767gx User Manual - Tabascan
40 pages
Understanding Pythagorean Triples
No ratings yet
Understanding Pythagorean Triples
104 pages
Phenol Oxidation: Mechanisms & Products
No ratings yet
Phenol Oxidation: Mechanisms & Products
13 pages
SINAMICS V20 Data Sheet Overview
No ratings yet
SINAMICS V20 Data Sheet Overview
1 page
Systems Engineering for DoD
100% (31)
Systems Engineering for DoD
222 pages
15 16 Program 1
No ratings yet
15 16 Program 1
2 pages
A Good Problem Solver
No ratings yet
A Good Problem Solver
6 pages
Parallel Feeder
No ratings yet
Parallel Feeder
3 pages
Tiempo de Preparación de Pizzas
No ratings yet
Tiempo de Preparación de Pizzas
29 pages
Volleyball Drills and Games for Kids
No ratings yet
Volleyball Drills and Games for Kids
8 pages
Digital Clock Circuit Guide
100% (1)
Digital Clock Circuit Guide
6 pages
Simso2024 - Math - Primary 1
No ratings yet
Simso2024 - Math - Primary 1
5 pages
Title: Measuring G Using A Simple Pendulum: (The Date of Your Lab Experiment Here)
No ratings yet
Title: Measuring G Using A Simple Pendulum: (The Date of Your Lab Experiment Here)
2 pages

Assignment 1

Uploaded by

Assignment 1

Uploaded by

Section A

Assume the following libraries have been imported:

(ii) Consider a DataFrame df as shown below:

Using df determine the output of the following code snippet:

(iii) Determine the output of the following code snippet:

(iv) Differentiate between simple random sampling and stratified random

(v) Consider the following two variables:

data = {‘Student_Name’: [‘S1’, ‘S2’, ‘S3’, ‘S4’,

grades = { ‘score_ranges’:[‘0-60’, ‘60-70’,

Write Python code to do the following:

3 (i) Distinguish between unimodal, bimodal and multimodal distribution. Use a

merged_df1 = [Link](df1, df2, how='outer')

(iii) Given the list: my_list = list(range(0,6))

4(i) Write Python code for the following:

a) Create a numpy array ArrayNum with 5 rows and 4 columns to store

a) Create a boxplot for the column ‘Sales_in_INRLakhs’ in

5(i) Consider a DataFrame df as shown below:

(ii) Consider the following Series having details of four products:

Write Python code to do the following:

6 Consider the following DataFrame Income_Data:

Write Python code for the following:

i. Use an appropriate plot to visualize the distribution of age in

Write Python statements to do the following (Mention the libraries used

8 Given a comma separated file [Link] consisting of the

Write Python statement(s) to do the following (Make use of appropriate

You might also like