0% found this document useful (0 votes)
21 views

Data Analysis With Python, SQL &

Uploaded by

ronitgupta547
Copyright
© © All Rights Reserved
Available Formats
Download as PPTX, PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
21 views

Data Analysis With Python, SQL &

Uploaded by

ronitgupta547
Copyright
© © All Rights Reserved
Available Formats
Download as PPTX, PDF, TXT or read online on Scribd
You are on page 1/ 18

Industrial Training Presentation

on
Data Analysis with Python, SQL & Power BI
CONTENT

 Introduction
 Literature Review
 Project Objectives
 Result and discussion
 Conclusion
 Reference
INTRODUCTION
Data Analaysis:
Data Analysis is the process of systematically applying
statistical and/or logical techniques to describe and
illustrate, condense and recap, and evaluate data.

PYTHON:
Python is a programming language that is widely used
in
web applications, software development, data science,
and
machine learning (ML).
Libraries of Python:

Pandas:

Pandas is a Python library used for working with data sets. It has
functions for analyzing, cleaning, exploring, and manipulating data.
The name "Pandas" has a reference to both "Panel Data", and
"Python
Data Analysis" and was created by Wes McKinney in 2008.

Numpy:
NumPy is a Python library used for working with arrays. It also has
functions for working in domain of linear algebra, fourier transform, and
matrices. NumPy was created in 2005 by Travis Oliphant. It is an open
source project and you can use it freely. NumPy stands for Numerical
Python.
Matplotlib:
Matplotlib is a comprehensive library for creating
static, animated, and interactive visualizations in
Python. Matplotlib makes easy things easy and hard
things possible. Create publication quality plots.
Make interactive figures that can zoom, pan, update.
Customize visual style and layout.

SQL:
Structured query language (SQL) is a programming
language
for storing and processing information in a relational
Database.
Literature Review
Topic Research Author(s) & Key findings/
Paper Time Period Contribution

Jupyter Notebooks F1000Research Jupyter, et al. Discusses how


for Data Science 2015 Jupyter notebooks
and Scientific (often used with
Computing Python) can be
applied to
interactive data
analysis.

Kusumoto, Y., et al. Kusumoto, Y., et al. Analyzes the


Combining SQL and 2019 integration of
Python for Real- Python and SQL in
Time Data real-time data
Analytics in IoT analysis for IoT
Systems applications.
Literature Review
Topic Research Author(s) & Key findings/
Paper Time Period Contribution

SQL and Python Proceedings of the Li, X., et al. Focuses on using
Integration for International 2021 SQL for data
Data Cleaning in Conference on extraction and
Relational Database Systems Python for
Databases cleaning and
preprocessing
data in relational
databases.
Data Visualization Jones, E.,
Data Visualization Best Practices McAllister, P., & Explores Power
Best Practices Using Power BI Carter, M. BI's role in
Using Power BI 2022 creating
interactive
dashboards and
reports. Shows
how to integrate
Python scripts for
advanced
analytics within
Power BI reports.
Literature Review
Topic Research Author(s) & Key findings/
Paper Time Period Contribution

Integrating Integrating Morris, K., & Highlights the role


Predictive Predictive Green, D. of predictive
Analytics into Analytics into 2020 analytics in
Power BI Power BI. business
intelligence,
specifically within
Power BI.
Discusses the
challenges and
opportunities in
embedding Python
Johnson, L. models.
Python in Machine The Role of Python 2021
Learning and Data in Machine Explores Python’s
Science Learning and Data integration in
Science machine learning
frameworks and
its role in
predictive
modeling.
PROJECT OBJECTIVE
 Project dashboards serve as a quick reference point
for project managers, making it easy for them to
keep a close eye on tasks that may be falling
behind or projects that are underperforming so they
can take corrective action
 Dashboards are useful for monitoring, measuring,
and analyzing relevant data in key areas. They take
raw data from many sources and clearly present it
in a way that's highly tailored to the viewer's needs
—whether you're a business leader, line of business
analyst, sales representative, marketer, and more.
 Develop interactive and dynamic dashboards that
allow users to visualize and explore data in real-
time using Python tools like Dash or Streamlit
Project Objective
 Build a dashboard that can reflect real-time or
periodic updates from the underlying SQL
database.
 Develop a clean, intuitive, and user-friendly
interface with easy navigation and clearly defined
sections for various metrics and data
visualizations.
 Improve query performance to ensure fast load
times and minimize delays when refreshing
dashboard data.
 Design the dashboard to be scalable, allowing it to
handle larger datasets and additional features in
the future
 Identify and fix any bottlenecks in data
processing, visualization rendering, or database
queries.
Data:
Result and Discussion

Output:
Result and Discussion
1. Install and Open Power BI Desktop
 Download Power BI Desktop: If you don't have it installed, download it
from the Microsoft website or the Microsoft Store (for Windows).
 Open Power BI Desktop: Once installed, open the Power BI Desktop
application.
2. Connect to Your Data Source
 Click on "Get Data": This is the first step in importing your data into Power
BI.
 Select Data Source: Power BI supports a variety of data sources such as:
a) Excel
b) SQL Server
c) CSV
d) SharePoint
e) Web Data (APIs)
f) Google Analytics
and many others.
 Load the Data: After selecting your data source, Power BI will show a
preview of your data. Select the relevant tables or queries, and click Load to
import the data into Power BI.
Result and Discussion
3. Transform Data Using Power Query (Optional)
 Power Query Editor: If your data needs cleaning or reshaping (e.g.,
removing duplicates, changing data types, or combining tables), use the
Power Query Editor.
 To access it, click on Transform Data after loading your data.
 Here, you can perform operations like:
a) Removing unnecessary columns
b) Filtering data
c) Changing data types
d) Merging or appending queries
 After transforming your data, click Close & Apply to load the changes.
4. Create Relationships Between Tables (Optional)
 If you're working with multiple tables, you may need to establish
relationships between them.
 Go to the "Model" view: Here, you can drag and drop fields to create
relationships between different tables.
 Ensure that the relationships are set correctly (e.g., one-to-many, many-
to-one).
Result and Discussion
5. Design Your Report Page (Create Visualizations)
 Select a Report View: Power BI provides a drag-and-drop interface to create
your visualizations. You'll see a blank canvas where you can create charts, graphs,
tables, etc.
 Choose a Visualization: On the right side, under the Visualizations pane,
choose from a variety of charts like:
 Bar Chart
 Line Chart
 Pie Chart
 Table
 Matrix
 Gauge
 Card
 Map
 Drag Fields: Drag the fields from the Fields pane on the right to the visualization
pane. Depending on the visualization type, you will drag different fields to the
Values, Axis, Legend, or Filters section.
 Customize Visualizations: Once a visualization is added to the report, you can
customize it by adjusting the formatting options such as colors, labels, titles, etc.
 Add More Visualizations: Add multiple visualizations to the same report by
dragging and dropping them onto the report canvas. You can also resize and
rearrange the visualizations as needed.
Conclusion:

This project involved the development of an interactive


job
openings dashboard that provides real-time insights
into job
postings across multiple companies. By integrating
Python, SQL,
and Power BI, we created a comprehensive data analysis
and
visualization solution that allows stakeholders to monitor
job
market trends, track company hiring patterns, and make
data-
driven decisions.
Conclusion:

 Python for Data Cleaning and Advanced Analysis.


 SQL for Data Management and Querying.
 Power BI for Visualization and Interactive
Dashboard Creation.
 Data Integration and Synergy Between Python,
SQL, and Power BI.
 Future Enhancements and Recommendations.
Reference:

You might also like