0% found this document useful (0 votes)

15 views3 pages

TP1 Python Numpy

This document outlines the guidelines and tasks for Lab 1 of the X/HEC Data Science for Business course, focusing on Python, Numpy, and Pandas. It includes instructions for submission, coding practices, and a series of exercises designed to familiarize students with Python programming and data manipulation using Numpy. Key topics include string manipulation, array operations, and understanding value versus reference types in Python.

Uploaded by

ryanbalech1

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

15 views3 pages

TP1 Python Numpy

Uploaded by

ryanbalech1

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 3

X/HEC Datascience for Business

2024/2025 Mathurin Massias

Lab no 1 : Introduction: Python, Numpy, Pandas

- Evaluation -

The Lab is done by pairs. There is no exception. On Moodle you have a section called “Lab
1 submission” for the first class. Each student in the pair must upload their notebook, with the
filename constructed as fistname1_LASTNAME1_firstname2_LASTNAME2.ipynb, where you have sub-
stituted your respective first and last names. Ex : mathurin_MASSIAS_sylvain_COMBETTES.ipynb.
There is no evaluation if you don’t respect this. If code is shared between groups, both groups
get 0. Read the intro slides on what’s expected of you in the labs.

Important preliminary remarks

We use jupyter notebook or jupyter lab, potentially with Visual Studio Code, for all practical
sessions. Some important points to remember :

- Loading -

import sklearn # import a package

import numpy as np # import a package under an alias
import matplotlib.pyplot as plt # import a submodule with an alias
from sklearn import linear_model # import a submodule
from os import mkdir # import a peculiar function

- Using standard help -

When facing a difficulty, you are strongly encouraged to refer to the online documentation of pandas,
numpy, etc. It should become a reflex to look for the answer in the doc or on stackoverflow.
linear_model.LinearRegression? # to get some help on the LinearRegression object

- Package versions -

print(np.version) # to get a package version

Strings
1) From a string containing all the alphabet letters, generate the string cfilorux using slicing (notice
the pattern : this is the 3rd letter, then the 6th, then the 9th, etc). Do the same for the strings
vxz and zxvt (again, notice the patterns). Note : don’t type the whole alphabet yourself, use the
string module.
2) Declare a string variable " XHEC DataScience for Business ". Make it all lowercase. Remove
spaces at the beginning and at the end, but not between words. Replace all e’s with E’s.

page 1
3) Display the number π with 9 decimal digits (np.pi). Don’t cast a number to string, and don’t use
round : use Python’s string formatting instead (either the format method, either the % operator,
either an f-string – the latter is considered more modern).
4) Count the number of occurrences of each character in the string s = "HelLo WorLd!!" (in real
life, you should use a collections.Counter ; here, you are asked to code the method yourself).
Output a dictionary that to each character associates the number of occurrences in this string. In
this question, we consider that lower and upper case characters are the same (e.g. your dictionary
should not have both a L and a l entry).

Fast computations with numpy ; basic plots.

Hint : useful functions : np.arange, np.allclose, np.all, np.linalg.norm for instance. The
whole numpy.linalg module contains interesting Linear Algebra functions. numpy.random contains func-
tions to generate arrays of (pseudo-) random numbers.
In all this section, unless asked explicitly, you cannot use for/while loops (they are slow in native
python).
5) Compute 0.1 + 100 - 100. Using ==, check if it is equal to 0.1. Comment. Compare the two
floating point numbers again with an appropriate numpy function.
6) Create a list (resp. a numpy array) containing all square numbers from 1, 4, ... to 121, using a for
loop (resp. only numpy). Why should you use arrays instead of loops whenever possible ?
7) Create an array containing integers from 2 to 14 by step of 3 (2, 5, 8, ...). Create an array with 15
equispaced valued from 0 to 1 included. Use numpy built-in functions.
ś8 2
8) Compute 2 k“1 4k4k2 ´1 (approximate 8 by a large number n) using a for loop. Propose a ver-
sion without loop, using only numpy (see numpy.prod). Measure the time taken by both versions
using time.time(). You should display the results with a relevant number of significant digits, e.g.
not 0.002487976589749873 seconds. Use the ipython magic %timeit again to measure time of one
version. Why is it better than time.time ?
9) (row and column vectors, aka numpy only knows 1D arrays) Compute the dot product (aka scalar
product) of np.arange(5) and np.ones(5). What is the shape of np.arange(5) ? How many di-
mensions does this array have ? What is the shape of its transpose (use .T) ? What does transposing
1D arrays do ?
10) What does reshape do in M = np.arange(12).reshape(2, 6) ? What does M[:, ::3] do ? What
happens when you do np.arange(3) * np.arange(4)[:, np.newaxis] (this powerful tool is
known as broadcasting)
11) Create a random matrix M P R5ˆ6 with coefficients taken uniformly (and independently) in r´1, 1s.
Substract to each even column of M (say M[:, 0] is even), twice the value of the following (uneven)
column.
12) Replace the negative values in M by 0. Compute the mean of each row of M . Substract to each
row of M , its mean.
13) Create a random matrix M P R5ˆ10 with coefficients taken uniformly (and independently) in r´1, 1s.
Test whether G “ M J M is symmetric semi-definite positive, and that its eigenvalues are strictly
positive. Compute the rank of G. Compute the Euclidean norm of G. Compute the operator norm
of G (aka spectral norm, aka Schatten 2-norm). Compute the standard deviation of each column of
G.
14) Plot the functions x ÞÑ xd on the interval r´1, 2s for d P t2, 3, 4u with a decent resolution. Put a
xlabel, a ylabel, legend the 3 curves with d “ 2, d “ 3, d “ 4 respectively. Put a title.

Numpy advanced behavior

Numpy broadcasting
In this exercise, you can use neither lists nor for loops. You should use only numpy’s operations,
which are fast. An introduction to broadcasting is available here : https://numpy.org/doc/stable/
user/basics.broadcasting.html.

page 2
15) Create an array with integers 1, 3, ..., 19. Subtract its mean to it. Observe than you can thus
subtract a number to an array, even though they do not have the same shape. Create an array with
3 lines and 4 columns, such that arr[i, j] = 4 * i + j (it thus contains integers from 0 to 11).
reshape will help.
16) Now, we subtract vectors to 2D arrays, using broadcasting. Take the previous (3, 4) array, and
subtract its column wise mean to it (easy). Subtract its row wise mean to it (technically more
challenging the first time, you can add an axis to a 1D array with arr[:, None]).
17) Using broadcasting and np.arange, create an array of shape (3, 5) such that arr[i, j] = i * j.

Value and reference types

18) Create a variable a equal to 1000. Check the address in memory of the variable with the builtin id
function. Create a second variable b equal to 1000. What is the address in memory of b ? Create a
third variable c equal to a, check its address in memory. Do a += 1. How does it affect the values
of the three variables ? Why ?
19) Do the same but this time using a = np.array([0, 1]), b = np.array([0, 1]), c = a. What is
going on ?
20) (passing by value/passing by reference) Define a function f as follows : def f(a): a += 1. Call it
on a = 1, then on a = np.ones(10). For both cases, check the value of a after calling f on it.
What’s the reason for this behavior ?
21) For two arrays a = np.zeros(10), b = np.ones(10), what’s the difference between doing a = b
and a[:] = b ? What’s the difference between a = a + 1 and a += 1 ?

page 3

Lab1 ML Eac22050
No ratings yet
Lab1 ML Eac22050
17 pages
Data Toolkit Assignment
No ratings yet
Data Toolkit Assignment
30 pages
Ibm Ai
No ratings yet
Ibm Ai
10 pages
Ch11a Numpy
No ratings yet
Ch11a Numpy
8 pages
Numpy
No ratings yet
Numpy
52 pages
Numpy Revision Exercise
No ratings yet
Numpy Revision Exercise
2 pages
Assignment 2
No ratings yet
Assignment 2
5 pages
Cycle 1 Programs
No ratings yet
Cycle 1 Programs
20 pages
Questionnaire
No ratings yet
Questionnaire
3 pages
NumPy Basics and Operations Guide
No ratings yet
NumPy Basics and Operations Guide
53 pages
Lab 1-4
No ratings yet
Lab 1-4
20 pages
Emerging Technologies
No ratings yet
Emerging Technologies
16 pages
Data Science Lab Manual: Python Guide
No ratings yet
Data Science Lab Manual: Python Guide
72 pages
Assignment 1 All Answers
No ratings yet
Assignment 1 All Answers
20 pages
CKCS 149 Lab 5 Completed
No ratings yet
CKCS 149 Lab 5 Completed
8 pages
Questions
No ratings yet
Questions
25 pages
N Umpy Pandas Tutorial
No ratings yet
N Umpy Pandas Tutorial
65 pages
Lecture 7
No ratings yet
Lecture 7
35 pages
Python Numpy
No ratings yet
Python Numpy
20 pages
2.4. NumPy Operations
No ratings yet
2.4. NumPy Operations
49 pages
Test II NumPy LinAlg Python IntMScS2 Mar 2025
No ratings yet
Test II NumPy LinAlg Python IntMScS2 Mar 2025
2 pages
Session 2 Assessment - Google Forms
No ratings yet
Session 2 Assessment - Google Forms
11 pages
Applied Python Programming (Cycle-1) - 1
No ratings yet
Applied Python Programming (Cycle-1) - 1
26 pages
Python Numpy Library Basics
No ratings yet
Python Numpy Library Basics
69 pages
100 Numpy Exercises 100 Numpy Exercises: NP NP
No ratings yet
100 Numpy Exercises 100 Numpy Exercises: NP NP
18 pages
Practical 1 ML
No ratings yet
Practical 1 ML
11 pages
Short Long
No ratings yet
Short Long
4 pages
NumPy Element-Wise Operations Guide
No ratings yet
NumPy Element-Wise Operations Guide
25 pages
Vid Ids File + 7
No ratings yet
Vid Ids File + 7
30 pages
Numpy Notes
No ratings yet
Numpy Notes
8 pages
Numpy 2
No ratings yet
Numpy 2
20 pages
Numpy Coding Question
No ratings yet
Numpy Coding Question
11 pages
Python Numpy Programming: Eliot Feibush
No ratings yet
Python Numpy Programming: Eliot Feibush
66 pages
13 - NumPy
No ratings yet
13 - NumPy
46 pages
Numpy Tutorial in Python Programming Language.
No ratings yet
Numpy Tutorial in Python Programming Language.
11 pages
DAV
No ratings yet
DAV
80 pages
Aiml - Kaushik - Gogoi
No ratings yet
Aiml - Kaushik - Gogoi
13 pages
Numpy and Pandas Essential Functions
No ratings yet
Numpy and Pandas Essential Functions
46 pages
Python Numpy
100% (1)
Python Numpy
31 pages
Numpy Basics for Scientific Computing
No ratings yet
Numpy Basics for Scientific Computing
23 pages
Workshop Notes-2 Handling Array With NumPy
No ratings yet
Workshop Notes-2 Handling Array With NumPy
13 pages
Lab Sheet 05 - Numpy and Matplotlib
No ratings yet
Lab Sheet 05 - Numpy and Matplotlib
12 pages
UNIT 5 Python Aktu
No ratings yet
UNIT 5 Python Aktu
49 pages
Matrix Python
No ratings yet
Matrix Python
13 pages
Numpy
No ratings yet
Numpy
22 pages
ECE 102 Quiz #2 (W2021) : Your Name (Type Into Box)
No ratings yet
ECE 102 Quiz #2 (W2021) : Your Name (Type Into Box)
4 pages
Python Numpy
No ratings yet
Python Numpy
15 pages
Numpy
No ratings yet
Numpy
19 pages
NumPy Array Operations and Functions
No ratings yet
NumPy Array Operations and Functions
3 pages
IP - NumPy
No ratings yet
IP - NumPy
5 pages
Basic Math: 1.1 Scipy Constants (Scipy - Constants)
No ratings yet
Basic Math: 1.1 Scipy Constants (Scipy - Constants)
32 pages
Python Basics and NumPy Guide
No ratings yet
Python Basics and NumPy Guide
20 pages
Numpy Lab 1-5
No ratings yet
Numpy Lab 1-5
9 pages
Python Libraries and NumPy Guide
No ratings yet
Python Libraries and NumPy Guide
90 pages
Numpy Handbook
No ratings yet
Numpy Handbook
16 pages
Numpy Week 2 Notes New
No ratings yet
Numpy Week 2 Notes New
14 pages
Exp 12 Pyt
No ratings yet
Exp 12 Pyt
7 pages
Uniform Acceleration Exam Problems
No ratings yet
Uniform Acceleration Exam Problems
184 pages
Ai 3,4,5 Vtu nOTES
No ratings yet
Ai 3,4,5 Vtu nOTES
22 pages
PSS/E Oscillation Issue PSLF Initialization Issue
No ratings yet
PSS/E Oscillation Issue PSLF Initialization Issue
6 pages
Python Programs Loops Conditionals Strings
No ratings yet
Python Programs Loops Conditionals Strings
13 pages
Topic 03: Elementary Analytic Functions: MA201 Mathematics III
No ratings yet
Topic 03: Elementary Analytic Functions: MA201 Mathematics III
35 pages
Cluster Analysis and K-Means Guide
No ratings yet
Cluster Analysis and K-Means Guide
20 pages
Business 3550 Financial Management - Assignment 1
No ratings yet
Business 3550 Financial Management - Assignment 1
8 pages
CBSE Class 12 Question Paper 2016 Chemistry Set 2
No ratings yet
CBSE Class 12 Question Paper 2016 Chemistry Set 2
16 pages
Tad1241ge PDF
No ratings yet
Tad1241ge PDF
14 pages
Predicted Question Paper 22627 Summer2025
No ratings yet
Predicted Question Paper 22627 Summer2025
2 pages
E Commerce
No ratings yet
E Commerce
13 pages
RTV Exporter
No ratings yet
RTV Exporter
11 pages
Program Peka v1.5 5g 2011
No ratings yet
Program Peka v1.5 5g 2011
118 pages
The Purpose of Earthing
100% (1)
The Purpose of Earthing
28 pages
Beethoven's Two Movement Piano Sonatas
67% (3)
Beethoven's Two Movement Piano Sonatas
135 pages
DSP Da-01 23bec0056 Yashmehta
No ratings yet
DSP Da-01 23bec0056 Yashmehta
15 pages
Panasonic Video Coding Proposal Overview
No ratings yet
Panasonic Video Coding Proposal Overview
30 pages
Computer Literacy Among Employees of Government and Non-Government Agencies Basis For The Formulation of Computer Training Program
No ratings yet
Computer Literacy Among Employees of Government and Non-Government Agencies Basis For The Formulation of Computer Training Program
9 pages
Mand Maj Connec 3rd Yr
No ratings yet
Mand Maj Connec 3rd Yr
45 pages
ADS R Man en 4 21
73% (11)
ADS R Man en 4 21
20 pages
DLP June 11 - Monomial Factoring
67% (6)
DLP June 11 - Monomial Factoring
5 pages
Hose Quotation List: Description Quantity Specification
No ratings yet
Hose Quotation List: Description Quantity Specification
1 page
Bechtel Electrical Design Guide E34E
100% (1)
Bechtel Electrical Design Guide E34E
23 pages
Choosing The Right Surfboard For You
No ratings yet
Choosing The Right Surfboard For You
10 pages
Data Link Control
No ratings yet
Data Link Control
106 pages
Reduction of Vibrations G.B. Warburton, J. Wiley & Sons, Chichester, 1992, 91 Pages, 17.50 - 1993
No ratings yet
Reduction of Vibrations G.B. Warburton, J. Wiley & Sons, Chichester, 1992, 91 Pages, 17.50 - 1993
2 pages
Electromagnetic Brake Overview
No ratings yet
Electromagnetic Brake Overview
16 pages
Major and Trace Minerals in Nutrition
No ratings yet
Major and Trace Minerals in Nutrition
9 pages
1983 Wood Gust Factor
No ratings yet
1983 Wood Gust Factor
3 pages
Understanding Biomolecules in Science 10
No ratings yet
Understanding Biomolecules in Science 10
9 pages

TP1 Python Numpy

Uploaded by

TP1 Python Numpy

Uploaded by

X/HEC Datascience for Business

2024/2025 Mathurin Massias

Lab no 1 : Introduction: Python, Numpy, Pandas

Important preliminary remarks

import sklearn # import a package

- Using standard help -

print(np.__version__) # to get a package version

Fast computations with numpy ; basic plots.

Numpy advanced behavior

Value and reference types

You might also like

print(np.version) # to get a package version