0% found this document useful (0 votes)

21 views8 pages

Supply Chain Analysis for Cosmetics

The document outlines a study utilizing a dataset from Kaggle to analyze supply chain inefficiencies in a cosmetic start-up in India, focusing on their impact on pricing. It details the dataset's structure, cleaning processes, analytical methodologies, and statistical analyses employed to quantify relationships between operational inefficiencies and product pricing. The study aims to provide data-driven recommendations for optimizing costs and refining pricing strategies.

Uploaded by

Paulina Lopez Sanchez

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

21 views8 pages

Supply Chain Analysis for Cosmetics

Uploaded by

Paulina Lopez Sanchez

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

3.

Methodology
3.1 Data

3.1.1 Origin, Timeframe, and Relevance

The dataset was obtained from Kaggle (2025) and represents a cross-sectional
snapshot of the supply chain for a cosmetic start-up in India. It contains 100 records
(rows 2–101) and 24 variables (columns A–X), covering operations in Mumbai,
Kolkata, Delhi, Bangalore, and Chennai.

Since no explicit date fields are provided, the dataset is treated as a single
observation captured on March 1, 2025. Lead time (in days) is the only temporal
dimension, capturing order duration without introducing seasonal bias (Wooldridge,
2016).

The dataset enables the quantification of operational inefficiencies—extended lead

times, elevated shipping and manufacturing costs, and higher defect rates, and their
impact on pricing and profitability across haircare, skincare, and cosmetic portfolios.

3.1.2 Scope and Objectives

This study aims to:

● Quantify the relationship between internal supply-chain inefficiencies and

selling prices.

● Identify which inefficiency factors exert the strongest influence on pricing

structures.

● Provide data-driven recommendations to optimize costs and refine pricing

strategies.

3.1.3 Structure and Definition of Variables

The dataset comprises 24 variables describing operational, logistical, and

commercial aspects of the supply chain. These variables are categorized as
categorical or numerical and are presented in Table 1 with their corresponding
descriptions and units of measurement.
Table 1. Definition of variables in the dataset

Field Type Description Unit

Haircare, Skincare,
Product type Categorical Product category Cosmetics
Unique alphanumeric
SKU Categorical identifier —
Price Numeric Sales price USD
Availability Numeric Units available in inventory units
Number of products sold Numeric Units sold units
Revenue generated Numeric Total revenue from sales USD
Male, Female, Non-binary,
Customer demographics Categorical Customer gender segment Unknown
Stock levels Numeric Current inventory level units
Time from order
Lead times Numeric placement to dispatch days
Quantity ordered per
Order quantities Numeric purchase units
Shipping times Numeric Duration of transportation days
Carrier A, Carrier B, Carrier
Shipping carriers Categorical Shipping company used C
Shipping costs Numeric Cost to ship USD
Supplier name Categorical Name of the supplier —
Mumbai, Kolkata, Delhi,
Location Categorical Supplier city Bangalore, Chennai
Lead time Numeric Fulfillment lead time days
Volume produced during
Production volumes Numeric the period units
Duration of the
Manufacturing lead time Numeric manufacturing process days
Manufacturing costs Numeric Cost of manufacturing USD
Outcome of quality
Inspection results Categorical inspection Pass, Fail, Pending
Proportion of defective
Defect rates Numeric units fraction (0–1)
Transportation modes Categorical Mode of transport Road, Rail, Air, Sea
Routes Categorical Transportation route used Route A, Route B, Route C
Total cost associated with
Costs Numeric the route USD
The variables encompass five main dimensions:

● Product and Sales – Product type, SKU, price, availability, number of products
sold, revenue generated, and customer demographics.

● Inventory and Suppliers – Stock levels, lead times, and order quantities.

● Shipping – Shipping times, carriers, and costs.

● Production and Quality – Supplier name, location, production volumes,

manufacturing lead time, manufacturing costs, inspection results, and defect
rates.

● Transportation – Transportation modes and routes, including associated

costs.

3.1.4 Data Cleaning and Preparation

To ensure data quality and consistency, the following steps were applied (Little &
Rubin, 2002; Tukey, 1977):

● Duplicates – Verified SKU uniqueness; no records were removed.

● Missing Values – Imputed using mode for categorical variables and median by
product type for numerical variables.

● Outliers – Detected via boxplots for Price, Shipping Costs, and Lead Times;
records above the 99th percentile were excluded when they distorted
aggregated measures.

● Data Types – Converted cost and price fields to numeric and durations to
integer values. Standardized labels for suppliers, carriers, and routes.

● Derived Fields –

○ Total Cost per Unit = Shipping Costs + Manufacturing Costs

○ Defect Rate (%) = Defect Rate × 100

○ Gross Margin (%) = (Price – Total Cost per Unit) / Price × 100
3.2 Analytical Methodology

3.2.1 Tools

Tableau Public was used for interactive visualizations and spatial analysis (Tableau
Software, 2024). Excel (Office 365) supported initial data profiling and calculation
verification.

3.2.2 Data Connection and Modeling

The dataset was imported into Tableau using the “Text File” connector. The Data
Interpreter tool was applied to clean headers and remove nested tables. Field names
were standardized, data types verified, and units annotated within the metadata.

3.2.3 Calculations and Parameters

Key calculated fields include:

● Total Cost = [Shipping Costs] + [Manufacturing Costs]

● Defect Rate (%) = [Defect Rate] × 100

● Gross Margin (%) = ([Price] – [Total Cost]) / [Price] × 100

A parameter named Metric was created to toggle views across Total Cost, Lead
Time, and Defect Rate, allowing flexible analysis of cost and process drivers.

3.2.4 Analytical Techniques and Visualizations

Visualization techniques were chosen to reveal cost-price relationships, assess

supplier performance, and identify operational inefficiencies (Kirk, 2016; Few, 2009).

● Cost–Price Relationship – Scatter plots with trend lines to explore price

elasticity and cost impact (Chambers et al., 1983).

● Lead Time and Defect Impact – Bar charts and heat maps comparing lead
times by supplier and location, and scatter plots linking defect rates with
manufacturing costs (Kraak & Ormeling, 2010).

● Route and Carrier Optimization – Boxplots of shipping costs and shipping

times by transportation mode and route to identify cost-saving opportunities.

● Customer Insights and Pricing – Comparative bar charts and heat maps to
analyze revenue, sales volume, and product preferences by customer
demographics.
Where appropriate, exploratory data analysis (Tukey, 1977) informed visualization
design and identification of key trends.

Dashboards & Storytelling

● Interactive dashboards with global filters (Product Type, Customer

Demographics, Location).
● A Tableau Story that guides the reader from high-level KPIs to specific
insights and operational-improvement recommendations (Dean, 2021).

Design and Interaction Notes

All worksheets include global filters (Product Type, Location, Customer

Demographics) to enable dynamic, segmented comparisons (Tableau Software,
2024).

Visual perception principles were applied: divergent color scales for defect rates,
sequential palettes for costs and lead-time metrics, and brand-consistent hues (Few,
2009).

Content is organized into a five-step storytelling dashboard that mirrors the analytical
blocks and culminates in strategic recommendations (Dean, 2021).

3.2.5 Statistical Analysis (Regression Model)

Objective Alignment

In Chapter 1, we stated our purpose “to quantify the relationship between supply
chain inefficiencies and product pricing” and “to identify which inefficiency factors
most significantly affect pricing” (Chapter 1.2). To deliver on that, we estimate
multivariate OLS regressions that link key operational drivers to Price.

Model Specifications:
• Linear Levels
– Dependent variable: Price (USD) – Independents: Lead time (days),
Manufacturing lead time (days), Manufacturing costs (USD), Order quantities (units)
– Estimation: OLS with HC3 robust standard errors (Wooldridge, 2016)

• Log–Log
– Same variables in natural logs, so coefficients are elasticities (Gould et al., 2013)
– Also estimated with HC3 robust errors

Key Results
Table 3.2.5 summarizes the main coefficients, significance, and fit statistics.
Table 3.2.5 Regression Results (N = 100, HC3 SEs)

Model Predictor Coef. p-value 95 % CI Adj. R²

Linear Constant 619.174 <.001 [39.14, 84.70] 1.492
Lead time 5.191 1.224 [–0.14, 1.18]
Manufacturing lead
time –1.2444 <.001 [–1.88, –0.61]
Manufacturing
costs –0.2351 209 [–0.43, –0.04]
Order quantities 1.660 1.704 [–0.07, 0.40]
Log–Log Constant 3.1844 <.001 [1.62, 4.75] 1.036
ln Lead time 2.143 536 [–0.00, 0.43]
ln Manufacturing
lead time –0.3096 27 [–0.51, –0.11]
ln Manufacturing
costs –0.1107 4.045 [–0.37, 0.15]
ln Order quantities 2.605 713 [–0.02, 0.54]

Model fit comparison: Linear OLS (Adj. R²=0.1492, AIC=960.38) marginally outperforms the
Log–Log specification (Adj. R²=0.1036, AIC=283.51), so we adopt the level model for final inference.

Execution in Excel

1. Enabled the Data Analysis ToolPak (File → Options → Add-Ins → Analysis
ToolPak).
2. Launched Data Analysis → Regression.
3. Defined the Y Input Range as Price (or LN(Price)) and the X Input Range as
the set of inefficiency variables.
4. Checked “Labels,” “Residuals,” and “Residual Plots.”
5. Generated output—including coefficients, standard errors, t-stats, p-values,
R², and ANOVA—and exported it to a new sheet.
6. Calculated AIC and BIC manually:

A I C = 𝑛 ⋅ ln ⁡( S S E / 𝑛 ) + 2 𝑘 , B I C =𝑛 ⋅ ln ⁡( S S E / 𝑛 ) + 𝑘 ⋅ ln ⁡( 𝑛 )

, where 𝑛 = number of observations and 𝑘 = number of parameters.

Diagnostic Tests

To ensure valid inference, the following checks were performed in Excel:

Multicollinearity

● • Ran auxiliary regressions of each predictor on the remaining X’s.

● • Computed VIFj=1/(1−Rj2)\mathrm{VIF}_j = 1 / ( 1 − 𝑅 𝑗 2 ) .
● • Criterion: V I F < 5 indicates acceptable collinearity (Wooldridge, 2016).

Heteroskedasticity (Breusch–Pagan)

● • Saved residuals 𝜀 ^ 𝑖 , squared them, and regressed 𝜀 ^ 𝑖 2 on the original

X’s.
● • A significant p-value (< .05) triggers the use of robust standard errors.

Normality of Residuals

● • Created Q-Q plots via Excel chart tools. •

● Performed the Jarque–Bera test using skewness and kurtosis formulas.

Model Specification (RESET)

● • Added powers of the fitted values to an auxiliary regression.

● • Inspected joint significance of additional terms.

These results directly address our Chapter 1 objectives by quantifying which internal
inefficiencies most affect pricing and therefore should be prioritized in cost‐control
and pricing‐strategy initiatives.
REFERENCES

Chambers, J. M., Cleveland, W. S., Kleiner, B., & Tukey, P. A. (1983). Graphical
methods for data analysis. Wadsworth.

Dean, J. (2021). Storytelling with data: A data visualization guide for business
professionals. Wiley.

Gould, W. W., Pitblado, J. R., & Poi, B. P. (2013). Maximum Likelihood Estimation
with Stata (4th ed.). Stata Press.

Kaggle. (2025). Supply Chain Data for Cosmetic Startup in India. Recuperado de
[Link]

Kraak, M.-J., & Ormeling, F. (2010). Cartography: Visualization of spatial data (3ª
ed.). Guilford Press.

Little, R. J. A., & Rubin, D. B. (2002). Statistical analysis with missing data (2ª ed.).
Wiley.

Tableau Software. (2024). Tableau Public user guide [Manual de software]. Tableau
Software.

Tukey, J. W. (1977). Exploratory data analysis. Addison-Wesley.

Wooldridge, J. M. (2016). Introductory econometrics: A modern approach (6ª ed.).

Cengage Learning.

Van der Aalst, W. (2016). Process mining: Data science in action (2ª ed.). Springer

ISOM3360 Group Project Final Report.docx
No ratings yet
ISOM3360 Group Project Final Report.docx
10 pages
Blinkit Sales Analysis Dashboard Insights
No ratings yet
Blinkit Sales Analysis Dashboard Insights
23 pages
Shipment Data Analysis with Power BI
No ratings yet
Shipment Data Analysis with Power BI
7 pages
SCM KPIs and Tools Overview
No ratings yet
SCM KPIs and Tools Overview
40 pages
Customer Transaction Analysis Report
No ratings yet
Customer Transaction Analysis Report
23 pages
Retail Inventory Optimization Analytics
No ratings yet
Retail Inventory Optimization Analytics
18 pages
Capstone 3 Final Report
No ratings yet
Capstone 3 Final Report
12 pages
Imputation Strategies for Retail Analytics
No ratings yet
Imputation Strategies for Retail Analytics
7 pages
Supplier Selection in Construction Projects
No ratings yet
Supplier Selection in Construction Projects
64 pages
Optimizing E-Commerce Pricing with Analytics
No ratings yet
Optimizing E-Commerce Pricing with Analytics
3 pages
Chapter 3 - Methodology
No ratings yet
Chapter 3 - Methodology
4 pages
Retail Inventory Optimization Analytics
No ratings yet
Retail Inventory Optimization Analytics
14 pages
Supply Chain Performance Insights Dashboard
No ratings yet
Supply Chain Performance Insights Dashboard
13 pages
Supply Chain Management Dashboard Guide
No ratings yet
Supply Chain Management Dashboard Guide
11 pages
Predictive Analytics for Retail Inventory
No ratings yet
Predictive Analytics for Retail Inventory
5 pages
E-Commerce Shipping Data Analysis
No ratings yet
E-Commerce Shipping Data Analysis
23 pages
Optimizing Retail Inventory with Analytics
No ratings yet
Optimizing Retail Inventory with Analytics
26 pages
Amazon Sales 2025 Analysis Report
No ratings yet
Amazon Sales 2025 Analysis Report
8 pages
Target Brazil SQL Business Case Analysis
No ratings yet
Target Brazil SQL Business Case Analysis
25 pages
Supplier Price and Quality Analysis
No ratings yet
Supplier Price and Quality Analysis
13 pages
Chip Purchasing Analysis Insights
No ratings yet
Chip Purchasing Analysis Insights
7 pages
Inventory Model for Deteriorating Items Analysis
No ratings yet
Inventory Model for Deteriorating Items Analysis
21 pages
Olist E-Commerce Inventory Analysis
No ratings yet
Olist E-Commerce Inventory Analysis
22 pages
Data Analysis and Forecasting Guide
No ratings yet
Data Analysis and Forecasting Guide
5 pages
Project1 ECommerce Sales Analysis
No ratings yet
Project1 ECommerce Sales Analysis
11 pages
Supply Chain Optimization Project Report
No ratings yet
Supply Chain Optimization Project Report
19 pages
Supply Chain Management Techniques
No ratings yet
Supply Chain Management Techniques
61 pages
E-Commerce On-Time Delivery Analysis
No ratings yet
E-Commerce On-Time Delivery Analysis
12 pages
Business Analytics Answer
No ratings yet
Business Analytics Answer
6 pages
Supply Chain Data Analytics Project
No ratings yet
Supply Chain Data Analytics Project
19 pages
Superstore Dataset Analysis Insights
No ratings yet
Superstore Dataset Analysis Insights
31 pages
Customer Segmentation in Retail Analytics
No ratings yet
Customer Segmentation in Retail Analytics
17 pages
Supplier Reclassification Methodology
No ratings yet
Supplier Reclassification Methodology
2 pages
Olist E-commerce Inventory Analysis
No ratings yet
Olist E-commerce Inventory Analysis
23 pages
Data-Driven Fresh Food Ordering Solutions
No ratings yet
Data-Driven Fresh Food Ordering Solutions
5 pages
E-Fulfillment Optimization Model for KingFood Mart
No ratings yet
E-Fulfillment Optimization Model for KingFood Mart
47 pages
Excel & SQL Data Analysis Project Guide
No ratings yet
Excel & SQL Data Analysis Project Guide
6 pages
Retail Business Data Analysis Report
No ratings yet
Retail Business Data Analysis Report
19 pages
Amazon Profit Data Analysis Insights
No ratings yet
Amazon Profit Data Analysis Insights
20 pages
Data-Driven Efficiency Improvement Framework
No ratings yet
Data-Driven Efficiency Improvement Framework
107 pages
23f1001713 Final - Report BDM Project
No ratings yet
23f1001713 Final - Report BDM Project
21 pages
Data Visualization Techniques Explained
No ratings yet
Data Visualization Techniques Explained
15 pages
Case Study en
No ratings yet
Case Study en
6 pages
Strategic Cost Management in Procurement
No ratings yet
Strategic Cost Management in Procurement
32 pages
Prescriptive Analytics in Supply Chain
No ratings yet
Prescriptive Analytics in Supply Chain
11 pages
EDA on Online Shopping Behavior
No ratings yet
EDA on Online Shopping Behavior
47 pages
Sales Data Analysis Report Insights
No ratings yet
Sales Data Analysis Report Insights
18 pages
Predictive Supply Chain KPI Dashboard
No ratings yet
Predictive Supply Chain KPI Dashboard
3 pages
Supply Chain Strategies for Pharmaceuticals
No ratings yet
Supply Chain Strategies for Pharmaceuticals
8 pages
SKU Performance Analysis Model Overview
No ratings yet
SKU Performance Analysis Model Overview
3 pages
HP DeskJet Printer Supply Chain Analysis
No ratings yet
HP DeskJet Printer Supply Chain Analysis
13 pages
Operations Performance and Decision Analysis
No ratings yet
Operations Performance and Decision Analysis
11 pages
Online Shopping Dynamics Analysis
No ratings yet
Online Shopping Dynamics Analysis
20 pages
Customer Purchasing Behavior Analysis
No ratings yet
Customer Purchasing Behavior Analysis
2 pages
Delivery Analysis and Influencing Factors
No ratings yet
Delivery Analysis and Influencing Factors
4 pages
Data-Driven Retail Logistics Insights
No ratings yet
Data-Driven Retail Logistics Insights
130 pages
Data-Driven Marketing Insights for Auto Parts
No ratings yet
Data-Driven Marketing Insights for Auto Parts
30 pages
Logistics KPI Overview and Metrics
No ratings yet
Logistics KPI Overview and Metrics
7 pages
Kanban Container Calculation Guide
No ratings yet
Kanban Container Calculation Guide
37 pages
Concrete Screw Production Equation
No ratings yet
Concrete Screw Production Equation
46 pages
Suggested Solutions for MATH 4220 Assignments
No ratings yet
Suggested Solutions for MATH 4220 Assignments
11 pages
Grade 7 Science Lesson Plan: Motion
No ratings yet
Grade 7 Science Lesson Plan: Motion
46 pages
Yuktibhâsâ: Rethinking Mathematics Education
No ratings yet
Yuktibhâsâ: Rethinking Mathematics Education
38 pages
Volume Change Rate for Cube at s=14 cm
No ratings yet
Volume Change Rate for Cube at s=14 cm
12 pages
CHS Column Design Example Calculation
100% (2)
CHS Column Design Example Calculation
2 pages
RLS Method for Recursive Estimation
No ratings yet
RLS Method for Recursive Estimation
33 pages
AI Homework: First-Order Logic Problems
No ratings yet
AI Homework: First-Order Logic Problems
2 pages
Complex Engineering Problems in EEE
No ratings yet
Complex Engineering Problems in EEE
3 pages
Economics and Language
No ratings yet
Economics and Language
136 pages
Python Basics Lab for Digital Image Processing
No ratings yet
Python Basics Lab for Digital Image Processing
15 pages
Data Analysis Techniques Overview
No ratings yet
Data Analysis Techniques Overview
24 pages
8051 Microcontroller Timer Functions
100% (3)
8051 Microcontroller Timer Functions
53 pages
ECE 403 Assignment 3 Solutions
No ratings yet
ECE 403 Assignment 3 Solutions
5 pages
ICSE Mathematics Model Test Paper 18
No ratings yet
ICSE Mathematics Model Test Paper 18
3 pages
Physics Form 4 Definitions Overview
92% (12)
Physics Form 4 Definitions Overview
34 pages
Number System Conversions Explained
No ratings yet
Number System Conversions Explained
41 pages
AP Statistics Semester Exam Review
100% (1)
AP Statistics Semester Exam Review
9 pages
Programming Exercises for Beginners
No ratings yet
Programming Exercises for Beginners
6 pages
Op-Amp Circuit Analysis and Applications
No ratings yet
Op-Amp Circuit Analysis and Applications
100 pages
Energy Transport in Heat Transfer Systems
No ratings yet
Energy Transport in Heat Transfer Systems
36 pages
Combinatorial Probability Analysis
No ratings yet
Combinatorial Probability Analysis
19 pages
Unofficial Transcript Overview
No ratings yet
Unofficial Transcript Overview
1 page
MIDAS Civil Software Installation Guide
No ratings yet
MIDAS Civil Software Installation Guide
108 pages
JEE Main Physics Chemistry Math Test Guide
No ratings yet
JEE Main Physics Chemistry Math Test Guide
24 pages
20 Coding Patterns for Interviews
No ratings yet
20 Coding Patterns for Interviews
26 pages
Pressure Control Experiment Lab Report
No ratings yet
Pressure Control Experiment Lab Report
7 pages
Arcface: Additive Angular Margin Loss For Deep Face Recognition
No ratings yet
Arcface: Additive Angular Margin Loss For Deep Face Recognition
10 pages
Grade 1 Maths Exam Paper - March 2017
No ratings yet
Grade 1 Maths Exam Paper - March 2017
4 pages
Bare vs Infilled Frame Analysis
No ratings yet
Bare vs Infilled Frame Analysis
6 pages