Python For Data Science
Python For Data Science
Objectives
1. To gain hands-on experience and practice using Python to solve real data science challenges.
2. To familiarize the students to practice Python programming and coding for modeling,
statistics, and storytelling.
3. To utilize popular libraries such as Pandas, numPy, matplotlib, and SKLearn.
4. To enable the students to get hands-on experience creating analytics models and apply those
models to real-world problems.
Course Outcomes
Identify the need for data science and solve basic problems using Python built-in
CO 1
data types and their methods.
CO4 Apply the data transformation and data manipulation operations using “pandas”.
CO5 Analyze nature of data with help of different tools and visualization.
Introduction to Data Science - Why Python? - Essential Python libraries - Python Introduction-
Features, Identifiers, Reserved words, Indentation, Comments, Built-in Data types and their
Methods: Strings, List, Tuples, Dictionary, Set - Type Conversion- Operators. Decision Making-
Looping- Loop Control statement- Math and Random number functions. User defined functions -
function arguments & its types.
User defined Modules and Packages in Python- Files: File manipulations, File and Directory
related methods - Python Exception Handling. OOPs Concepts -Class and Objects, Constructors
– Data hiding- Data Abstraction- Inheritance.
Introducing Pandas Objects – series, data frames, index, Processing CSV, JSON, XLS data,
Operations on Pandas Objects – indexing and selection, universal functions, missing data,
hierarchical indexing, Combining Dataset – concat and append, merge and join. Aggregation and
grouping, Pivot tables, Vectorized string operations, Working with time series, High
performance Pandas – eval(), query().
General MatplotLib, Simple Line Plots, Simple Scatter Plots, Density and Contour Plots,
Histograms, Binnin, and Density, Customizing Plot Legends, Customizing Colorbars, Text and
Annotation, Three-Dimensional Plotting in Matplotlib, Geographic Data with Basemap,
Visualization with Seaborn.
1. Perform Creation, indexing, slicing, concatenation and repetition operations on Python built-
in data types: Strings, List, Tuples, Dictionary, Set
2. Apply Python built-in data types: Strings, List, Tuples, Dictionary, Set and their methods to
solve any given problem.
3. Handle numerical operations using math and random number functions.
4. Create user-defined functions with different types of function arguments.
5. Perform File manipulations- open, close, read, write, append and copy from one file to
another.
6. Write a program to implement OOP concepts like Data hiding and Data Abstraction.
7. Create NumPy arrays from Python Data Structures, Intrinsic NumPy objects and Random
Functions.
8. Manipulation of NumPy arrays- Indexing, Slicing, Reshaping, Joining and Splitting.
9. Computation on NumPy arrays using Universal Functions and Mathematical methods.
10. Load an image file and do crop and flip operation using NumPy Indexing.
11. Create Pandas Series and Data Frame from various inputs.
12. Import any CSV file to Pandas Data Frame and perform the following:
(a) Visualize the first and last 10 records
(b) Get the shape, index and column details
(c) Select/Delete the records (rows)/columns based on conditions.
(d) Perform ranking and sorting operations.
(e) Do required statistical operations on the given columns.
(f) Find the count and uniqueness of the given categorical values.
(g) Rename single/multiple columns
13. Import any CSV file to Pandas Data Frame and perform the following:
(a) Handle missing data by detecting and dropping/ filling missing values.
(b) Transform data using apply () and map() method.
(c) Detect and filter outliers.
(d) Perform Vectorized String operations on Pandas Series.
14. Visualize data using Line Plots, Bar Plots, Histograms, Density Plots and Scatter Plots
using Matplotlib.
15. Statistical Analysis using Python.
References
Resource Persons
1. Dr. Neeba E A
Associate Professor & HoD
Department of Information Technology
Email: [email protected]
Fee Structure
Eligibility Criteria
This certificate course is mainly for the students pursuing B.Tech. in Computer Science,
Information Technology, Electronics and Communication, Applied Electronics and
Instrumentation, B.Sc. Computer Science & Electronics. Those who are completed plus two are
also eligible for this course.
Evaluation scheme
The exam will be conducted after the completion of the entire course.
Those students who acquire a minimum of 60 marks from both the assignments and the
exam will be eligible to get the certificate
Project
The interested inhouse students will get a chance to carry out a project after the
successful completion of the course.