Portofolio - Intensive Bootcamp
Microsoft
Excel
Owner: Burhanuddin Yusuf Robbani
Build your skill and portfolio via myskill.id/bootcamp
List of Contents
Part 1: Data Pre-Processing with Basic Excel
Basic Formula, Function, Data Formatting, Data Cleansing, & Conditional If Else
Part 2: Data Validation and Data Visualization
Data Validation, Data Visualizzation, Vlookup, & Index Match
Part 3: Data Pre-Processing and Data Analysis
Pivot Table, Power Pivot, Macro VBA
#RintisKarirImpian
Intensive Bootcamp
Excel
Part 1: Data
Pre-Processing with
Basic Excel
(Basic Formula, Function, Data Formatting, Data Cleansing, &
Conditional If Else)
EXCEL PORTOFOLIO
Link Excel
Course Summary
A) Difference Between Formula and Function
Formula: Function that calculate a value from the cell reference (You
need to click each cell that needed for calculation)
Example : Profit =D2+D3+D4+D5
Function: Function that has specific formula to simplified calculation
(You don’t have to click each cell that needed for calculation)
Example : Profit =SUM(D2:D5)
#RintisKarirImpian
Course Summary
B) Cell Reference - Relative Reference
Relative reference is the basic cell references that adjust and change
when copied or when using AutoFill
#RintisKarirImpian
Course Summary
B) Cell Reference - Absolut Reference
Absolut reference is the opposite of the relative reference which the
cell won’t adjust and change when copied or autofill
#RintisKarirImpian
Course Summary
B) Cell Reference - Mix Reference
Absolut Row reference is only lock the rows reference
Absolut Column reference is only lock the columns reference
#RintisKarirImpian
Course Summary
C) Basic Aggregation
COUNTA is calculating the numbers of data numerical or text
COUNT is calculating the numbers of data numerical only
SUM is calculating the total data
AVERAGE is calculating average of the data
MAX is finding the maximum number of the data
MIN is finding the minimum number of the data
#RintisKarirImpian
Course Summary
D) Error Type
#DIV/0! will happen when we divide with 0 or blank cell
#NAME? will happen when the function is not recognized by Excel
#REF! will happen when the reference is not valid or blank
#### will happen when the width of the column is too tight
#VALUE! will happen when the data type is not recognized by the used function
#N/A will happen when there is no data in excel
#RintisKarirImpian
Course Summary
E) Data Type
Data Data Type
99 Whole Number
99,4 Decimal Number
9.9 String
MySkill String
07 September 2023 Date
TRUE Boolean
#RintisKarirImpian
Course Summary
F) Data Extraction - Date
#RintisKarirImpian
Course Summary
F) Data Extraction - Text
Function Result Note
=LEFT(B2,3) FUR Get the character from the left
=RIGHT(B2,3) 798 Get the character from the right
=CONCAT(A2;”-”;C2) 1-Furniture Merging the data
=FIND(“-”;B2;1) 4 Finding the character’s index
=MID(B2,2,3) UR- Get the character from the middle
#RintisKarirImpian
Course Summary
G) IF - Basic
Function : :
=IF(logical_test;[value_if_true];[value_if_false])
Example : =IF(I4>0;”Profit”;”Not Profit”)
IF = “Condition”
it will give the value of True when the data is meet the condition, if not, it will give the value
of False, you can input the value with any data types.
#RintisKarirImpian
Course Summary
G) IF - Advanced - IFS
Function : : =IFS(logical_test1; value_if_true1;
[logical_test2; value_if_true2];…)
IFS = “Multi Condition”
with this function, you can input many condition as many as you want, however, there is no
false result, so make sure you put all the needed conditions to prevent the error result.
#RintisKarirImpian
Course Summary
H) AND & OR
Function : : =AND(logical_test1; logical_test2)
=OR(logical_test1;logical_test2)
AND : when 2 conditions are meet then it will
give the True result, if not then False result
OR : when atleast 1 condition is meet then it
will give the True result, if not then False result
#RintisKarirImpian
Case Study
We have a sales data transaction that contain records of the regional,
item, and the name of salesperson in specific date. As we can see, the
data itself is not in good condition for analysis, therefore we need to do
some data formatting, data cleansing, and data transformation to answer
some of the questions from the data.
#RintisKarirImpian
Case Study
After we did some data pre-processing, the sales manager wants us to
answer all of these questions to give some insight that can be applied for
the selling strategy in the future, therefore, we need to calculate these
data with accurate function so the result is not wrong or biased.
#RintisKarirImpian
Problem Solving
A) Tidying up the width columns and the height rows
By using Alt+H+O+I & Alt+H+O+A we can simply autofit
the width of the column and the height of rows
#RintisKarirImpian
Problem Solving
B) Data Formatting - Date (part 1)
As you can see, the data in the date column are not
consistent because some of them are containing text
value and also some of them have different date
format, therefore, we need to change the text type first
into the number type.
#RintisKarirImpian
Problem Solving
B) Data Formatting - Date (part 2)
First, we need to prepare 2 columns on the right side to separate the
data into days, months, and years. After that, choose the date column
and select the data type to number to see which data is text or not, if
the data is not converted to number then the data is having text type.
Next, choose text to columns from the data menu.
Next, choose delimited and click next button.
#RintisKarirImpian
Problem Solving
B) Data Formatting - Date (part 3)
Choose the specific delimiters (in this case, we use “/”)
then click next button.
Choose the column data format (in this case, we use
Date format) into all columns, then click finish button.
#RintisKarirImpian
Problem Solving
B) Data Formatting - Date (part 4)
The result would be like this, we need to change the
column F and G into the number type.
As we can see all the columns now containing number
types corresponding to the date format
(month/day/year).
#RintisKarirImpian
Problem Solving
B) Data Formatting and Extraction - Date (part 5)
Prepare one column again into the right side to joining
3 columns into one text using concat function then
convert them into date format using datevalue function.
Now our date data have same type
and can be used for analysis,
however, in this case we only need
the year for calculation, therefore,
we can use text function to extract
the year from date.
#RintisKarirImpian
Problem Solving
C) Data Formatting - Accounting
Convert the data into accounting format.
#RintisKarirImpian
Problem Solving
D) Data Extraction - SalePrice
Create a new column to calculate the
SalePrice using the formula
price - (price*discount %).
#RintisKarirImpian
Problem Solving
E) Data Extraction - OrderCode
Create an order code using the
combination of function right, left, and
mid.
#RintisKarirImpian
Problem Solving
F) Data Aggregation - Sum of Sales
Before answering all of the questions from the sales
manager, we need to aggregate the specific values first,
for example, in this case we need to calculate the total of
sales by some regions and each year using SUMIFS
function.
Function : : =SUMIFS(sum_range;criteria_range1;criteria1;...)
● sum_range is the column reference for calculating the total data, in this
case, we use all data from SalePrice that we already calculated before.
● criteria_range means which column reference that we use for criteria
selecion, after that we can choose the exact criteria itself.
Note: Be careful for using this function, we may need some absolute and mix reference so that the
data we selected would not adjusting by itself, as you can see I use absolut reference for range
selection, rows reference for the region, and columns reference for the year.
#RintisKarirImpian
Problem Solving
G) Data Aggregation - Average of Sales
Same as sum of sales, this time, we calculate the average
of sales for some regions and each year, we can use the
function of AVERAGEIFS.
Function : : =AVERAGEIFS(Average_range;criteria_range1;criteria1;...)
● average_range is the column reference for calculating the average data,
in this case, we use all data from SalePrice that we already calculated
before.
● criteria_range means which column reference that we use for criteria
selecion, after that we can choose the exact criteria itself.
Note: Be careful for using this function, we may need some absolute and mix reference so that the
data we selected would not adjusting by itself, as you can see I use absolut reference for range
selection, rows reference for the region, and columns reference for the year.
#RintisKarirImpian
Problem Solving
H) Data Aggregation - Number of Sales
Same as sum of sales, this time, we calculate the number
of sales for some regions and each year, we can use the
function of COUNTIFS.
Function : : =COUNTIFS(criteria_range1;criteria1;...)
● criteria_range means which column reference that we use for criteria
selecion, after that we can choose the exact criteria itself.
Note: Be careful for using this function, we may need some absolute and mix reference so that the
data we selected would not adjusting by itself, as you can see I use absolut reference for range
selection, rows reference for the region, and columns reference for the year.
#RintisKarirImpian
Problem Solving
I) Data Aggregation - Calculate Performance of SalesPerson (part 1)
After doing some aggregations, now we also need to
calculate the performance from each sales person with the
parameter from how many they making sales and how
many they generate the revenue from the sales.
However, we need to copy the data of Salesperson first
and also watch out from the duplicate, we could use a
feature from excel called remove duplicates from the data
menu.
#RintisKarirImpian
Problem Solving
I) Data Aggregation - Calculate Performance of SalesPerson (part 2)
If we click OK, and found this warning in excel, it means the duplicate values
already been removed.
Now we don’t have duplicate values
anymore, but we also need to sort the name
alphabetically using a feature of sort from
the home menu.
#RintisKarirImpian
Problem Solving
I) Data Aggregation - Calculate Performance of SalesPerson (part 3)
Now the data has been sorted by alphabetically and now its time to calculate
the salescount and total sales.
To calculate sales count we can use a function called COUNTIF.
Function : : =COUNTIF(range;criteria)
To calculate total sales we can use a function called SUMIF.
Function : : =SUMIF(range;criteria;[sum_range])
#RintisKarirImpian
Problem Solving
I) Data Aggregation - Calculate Performance of SalesPerson (part 4)
Now we can calculate the performance per
salesperson and label them by 3 categories
(Bad, Good, Excellent). Each category has
specific criteria and we can calculate them
using IF with multiple criteria by combine them
with the function of AND & OR.
Category Criteria (Sales Count) Criteria (Total Sales)
Bad <3 < $ 1.000
Good 3-5 $ 1.000 - $ 3.500
Excellent >= 6 > $ 3.500
Function : : =IF(OR(logical1;logical2);”value_if_true”;if(and(logical1;logical2);”value_if_true”;”value_if_false”))
#RintisKarirImpian
Problem Solving
I) Data Aggregation - Calculate Performance of SalesPerson (part 5)
Now we have the data of the performance from
each sales person. After this, we can answers
all questions from the sales manager.
First question is, what is the best year
of sales?.
We can answer this by making the
aggregation of the total sales per year
and find the maximum number of the
total sales.
#RintisKarirImpian
Problem Solving
J) Question - Number 1 (Best Year of Sales)
First, we calculate the total sales per year by
using a function called SUMIF.
Then we can find which year has the most
amount of sales by using a function called
INDEXMATCH.
Note: INDEX function return the value of the data while MATCH function return the index of the
rows, notice that I use MAX function to return the highest number of sales so that it will return
which row index is having the highest number. After that the INDEX function will find an exact
value corresponding to the row index that we called.
#RintisKarirImpian
Problem Solving
K) Question - Number 2 (Number of “Good” Performing SalesPerson)
We could use a function called COUNTIF to find the number of good
performance.
#RintisKarirImpian
Problem Solving
L) Question - Number 3 (What Year has the Lowest Number of Sales)
We can answer this by making the aggregation of the number of sales per
year and find the lowest number of sales.
First, we calculate the number of sales per year
by using a function called COUNTIF.
Then we can find which year has the least
amount of number of sales by using a function
called INDEXMATCH.
#RintisKarirImpian
Problem Solving
M) Question - Number 4 (Who Generates Revenue the Most)
We can answer this by using a function called INDEXMATCH.
#RintisKarirImpian
Problem Solving
N) Question - Number 5 (What is the Best Selling Item) (Part 1)
Before that, we need to calculate the revenue from each item by using a
function called SUMIF.
#RintisKarirImpian
Problem Solving
N) Question - Number 5 (What is the Best Selling Item) (Part 2)
After that, we can find which item who has the best selling by using a function
called INDEXMATCH.
#RintisKarirImpian
Problem Solving
O) Question - Number 6 (What is the Most Number of Sales by SalesPerson)
We can find the maximum number of sales from salesperson by just using a
function called MAX.
#RintisKarirImpian
Final Result
BEFORE AFTER
ALL ANSWERS
#RintisKarirImpian