Data Analysis With Python, SQL &
Data Analysis With Python, SQL &
on
Data Analysis with Python, SQL & Power BI
CONTENT
Introduction
Literature Review
Project Objectives
Result and discussion
Conclusion
Reference
INTRODUCTION
Data Analaysis:
Data Analysis is the process of systematically applying
statistical and/or logical techniques to describe and
illustrate, condense and recap, and evaluate data.
PYTHON:
Python is a programming language that is widely used
in
web applications, software development, data science,
and
machine learning (ML).
Libraries of Python:
Pandas:
Pandas is a Python library used for working with data sets. It has
functions for analyzing, cleaning, exploring, and manipulating data.
The name "Pandas" has a reference to both "Panel Data", and
"Python
Data Analysis" and was created by Wes McKinney in 2008.
Numpy:
NumPy is a Python library used for working with arrays. It also has
functions for working in domain of linear algebra, fourier transform, and
matrices. NumPy was created in 2005 by Travis Oliphant. It is an open
source project and you can use it freely. NumPy stands for Numerical
Python.
Matplotlib:
Matplotlib is a comprehensive library for creating
static, animated, and interactive visualizations in
Python. Matplotlib makes easy things easy and hard
things possible. Create publication quality plots.
Make interactive figures that can zoom, pan, update.
Customize visual style and layout.
SQL:
Structured query language (SQL) is a programming
language
for storing and processing information in a relational
Database.
Literature Review
Topic Research Author(s) & Key findings/
Paper Time Period Contribution
SQL and Python Proceedings of the Li, X., et al. Focuses on using
Integration for International 2021 SQL for data
Data Cleaning in Conference on extraction and
Relational Database Systems Python for
Databases cleaning and
preprocessing
data in relational
databases.
Data Visualization Jones, E.,
Data Visualization Best Practices McAllister, P., & Explores Power
Best Practices Using Power BI Carter, M. BI's role in
Using Power BI 2022 creating
interactive
dashboards and
reports. Shows
how to integrate
Python scripts for
advanced
analytics within
Power BI reports.
Literature Review
Topic Research Author(s) & Key findings/
Paper Time Period Contribution
Output:
Result and Discussion
1. Install and Open Power BI Desktop
Download Power BI Desktop: If you don't have it installed, download it
from the Microsoft website or the Microsoft Store (for Windows).
Open Power BI Desktop: Once installed, open the Power BI Desktop
application.
2. Connect to Your Data Source
Click on "Get Data": This is the first step in importing your data into Power
BI.
Select Data Source: Power BI supports a variety of data sources such as:
a) Excel
b) SQL Server
c) CSV
d) SharePoint
e) Web Data (APIs)
f) Google Analytics
and many others.
Load the Data: After selecting your data source, Power BI will show a
preview of your data. Select the relevant tables or queries, and click Load to
import the data into Power BI.
Result and Discussion
3. Transform Data Using Power Query (Optional)
Power Query Editor: If your data needs cleaning or reshaping (e.g.,
removing duplicates, changing data types, or combining tables), use the
Power Query Editor.
To access it, click on Transform Data after loading your data.
Here, you can perform operations like:
a) Removing unnecessary columns
b) Filtering data
c) Changing data types
d) Merging or appending queries
After transforming your data, click Close & Apply to load the changes.
4. Create Relationships Between Tables (Optional)
If you're working with multiple tables, you may need to establish
relationships between them.
Go to the "Model" view: Here, you can drag and drop fields to create
relationships between different tables.
Ensure that the relationships are set correctly (e.g., one-to-many, many-
to-one).
Result and Discussion
5. Design Your Report Page (Create Visualizations)
Select a Report View: Power BI provides a drag-and-drop interface to create
your visualizations. You'll see a blank canvas where you can create charts, graphs,
tables, etc.
Choose a Visualization: On the right side, under the Visualizations pane,
choose from a variety of charts like:
Bar Chart
Line Chart
Pie Chart
Table
Matrix
Gauge
Card
Map
Drag Fields: Drag the fields from the Fields pane on the right to the visualization
pane. Depending on the visualization type, you will drag different fields to the
Values, Axis, Legend, or Filters section.
Customize Visualizations: Once a visualization is added to the report, you can
customize it by adjusting the formatting options such as colors, labels, titles, etc.
Add More Visualizations: Add multiple visualizations to the same report by
dragging and dropping them onto the report canvas. You can also resize and
rearrange the visualizations as needed.
Conclusion: