0% found this document useful (0 votes)

26 views6 pages

1142pm - 1.EPRA JOURNALS 14814

Uploaded by

Samay Patel

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

26 views6 pages

1142pm - 1.EPRA JOURNALS 14814

Uploaded by

Samay Patel

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 6

SJIF Impact Factor (2023): 8.574| ISI I.F. Value: 1.241| Journal DOI: 10.

36713/epra2016 ISSN: 2455-7838(Online)

EPRA International Journal of Research and Development (IJRD)
Volume: 8 | Issue: 11 | November 2023 - Peer Reviewed Journal

SUPERMARKET SALES PREDICTION USING MACHINE

LEARNING

Chavali Saathvika Durga Abhinaya 1, Bellamkonda Lahari2,

Chinta Devika Priya3, Devarapalli Anjali4, Bathula Sri Navya5,
B. Sai Jyothi6
1
B. Tech Students Department of Information Technology, Vasireddy Venkatadri Institute of Technology, Guntur
2
B. Tech Students Department of Information Technology, Vasireddy Venkatadri Institute of Technology, Guntur
3
B. Tech Students Department of Information Technology, Vasireddy Venkatadri Institute of Technology, Guntur
4
B. Tech Students Department of Information Technology, Vasireddy Venkatadri Institute of Technology, Guntur
5
B. Tech Students Department of Information Technology, Vasireddy Venkatadri Institute of Technology, Guntur
6
Professor Department of Information Technology, Vasireddy Venkatadri Institute of Technology, Guntur

Article DOI: https://doi.org/10.36713/epra14814

DOI No: 10.36713/epra14814
ABSTRACT
The huge supermarkets are more data-driven in today's retail world. These businesses tediously analyze sales data for each individual
item they provide in order to optimize inventory management and predict managers demand. Using machine learning techniques,
anomalies and patterns are being added to the data repository.
This data is used to forecast future sales volume, which is critical for merchants like supermarkets. We provide a prediction model,
similar to supermarkets, that uses the capabilities of the XGBoost algorithm to forecast a company's sales. Our findings show that our
suggested model exceeds existing models in terms of predicted accuracy, illustrating the power of complicated machine learning
approaches in optimizing retail operations. This study provides useful information for improving sales forecasting and inventory
management.
KEY WORDS: Regression, Sales, Prediction, Data Exploration, Supermarkets, XGBoost.

1. INTRODUCTION
Today's board of supermarket, a large grocery chain with locations all over the nation, has issued a challenge to all data scientists
to assist them in developing a model that can forecast the sales, per product, for each shop in order to provide accurate findings.
Supermarket has gathered sales information from Kaggle for a variety of items across numerous retailers in several cities. The
corporation expects that by providing us with this information, we will be able to identify the goods and retailers who are essential
to their sales and utilize that knowledge to take the appropriate actions to assure the achievement of their business objective, which
is to turn a profit for every supermarket. This is accomplished by selling more products and having a high turnover rate.

Here, jupyter Notebook is utilized as a tool and Python is used as a programming language. This application was created using
machine learning components like the Supervised Learning task, There are regression tasks. The major reason for doing this is
to forecast future retail sales for a corporation. Many techniques utilized include data collection and Feature engineering, data
preprocessing, and model creation Evaluation.

Learning under supervision aids in comprehension of the data flow, understanding of sale pricing, etc. The Regression analysis use
variety of techniques to forecast the retail costs. It has tasks like data cleansing, data transformation and visualizing XG Boost
algorithms are employed.

In this study, we used the XG Boost approach to create a prediction model and test it on the Supermarket dataset for predicting sales
of the product from the particular outlet.

OBJECTIVES OUR WORK

1. Examine the items' prior sales data
2. Recognizing the elements that influence a product's sales
3. drawing conclusions about those sales
2023 EPRA IJRD | Journal DOI: https://doi.org/10.36713/epra2016 | https://eprajournals.com/ |1 |
SJIF Impact Factor (2023): 8.574| ISI I.F. Value: 1.241| Journal DOI: 10.36713/epra2016 ISSN: 2455-7838(Online)
EPRA International Journal of Research and Development (IJRD)
Volume: 8 | Issue: 11 | November 2023 - Peer Reviewed Journal

4. Computing future sales from the data and making predictions

5. Help for businesses in properly increasing or decreasing product inventories.

2. RELATED WORK
Numerous regression models are used to predict crime, health outcomes, home values, and sales, among other things. cardiovascular
risk assessment using XGBoost. To forecast product sales, utilize sales forecasting. Being sold at several Big Mart Company shops.
As the items are produced in greater quantity, and increasing regions are greater and more capable of being predicted by hand more
challenging. Python is utilized as a programming language here. Jupyter Notebook is used as a tool and a language. In this
application, supervised machine learning features Regression and learning functions are also employed. Here is mostly carried out
to forecast the company's future revenue store merchandise

The different techniques include data processing, engineering features, model design, and testing. The regression function forecasts
using a number of algorithms. prices. This requires labor for data identification, cleaning, and transformation. Profits generated by
the business are Accurate sales projections are intimately related to supermarkets want a reliable forecasting method so that the there
is no loss to the firm. Experiments confirm this. Our methods result in forecasts that are more accurate. Compared to alternative
techniques like decision trees, local gatherings, etc.

3. PROPOSED MODEL
Description of the Supermarket. Sales dataset "SUPERMARKET" is the name of the dataset. Every dataset is made up of different
properties. Item Outlet Sales is the response variable for these characteristics, while the other features are mostly utilized as predictor
factors. This data collection includes diverse items from several cities.

Advantages of proposed model

➢ Improved pricing Accuracy which helps supermarkets set competitive prices for their products, maximizing revenue and
profit.
➢ Feature Importance: XGBoost provides insights into the most important features affecting pricing, helping supermarkets
make data-driven decisions.
➢ XGBoost is optimized for performance, making it capable of real-time or near-real-time predictions, vital in a dynamic
retail environment.
➢ XGBoost's flexibility and support for hyperparameter tuning allow for fine-tuning models to best fit the specific needs
of a supermarket sales price prediction system.
➢ XGBoost is robust against outliers and can handle missing data, common challenges in supermarket datasets.
➢
3.1 The Data
1. Item_Weight: This feature represents the weight of the item being sold. It's typically measured in units like kilograms
or pounds.
2. Item_Fat_Content: This feature describes the fat content of the item. It has multiple categories like "Low Fat," "Regular,"
"LF," "low fat," and "reg." I'll need to preprocess this feature to ensure consistency.
3. Item_Visibility: This feature indicates how prominently the item is displayed in the store. It might be measured as a
percentage or another numeric value.

2023 EPRA IJRD | Journal DOI: https://doi.org/10.36713/epra2016 | https://eprajournals.com/ |2 |

SJIF Impact Factor (2023): 8.574| ISI I.F. Value: 1.241| Journal DOI: 10.36713/epra2016 ISSN: 2455-7838(Online)
EPRA International Journal of Research and Development (IJRD)
Volume: 8 | Issue: 11 | November 2023 - Peer Reviewed Journal

4. Item_Type: This feature categorizes the item into types such as "Baking Goods," "Dairy," "Frozen Foods," and so on.
5. Item_MRP: This is the Maximum Retail Price (MRP) of the item. It represents the highest price at which the item can
be sold.
6. Outlet_Identifier:Each outlet has a unique identifier, and this feature represents that. Different outlets may have distinct
characteristics.
7. Outlet_Establishment_Year: This feature represents the year when each outlet was established. It's important for
understanding the age of the outlet.
8. Outlet_Size: This feature describes the size of the retail outlet, categorized as "High," "Medium," or "Small."
9. Outlet_Location_Type: It indicates the location of the outlet, such as "Tier 1," "Tier 2," or "Tier 3." These categories
might signify different levels of urbanization or geographical areas.
10. Outlet_Type: This feature tells us the type of retail outlet, such as "Grocery Store" or different types of supermarkets.
11. Item_Outlet_Sales: This is the target variable I want to predict. It represents the sales of the item in the outlet and will
be used to train and evaluate my predictive model.
The dataset contains a mix of numerical and categorical features, and performed preprocessing steps to handle missing
values, one-hot encode categorical variables, and scale numerical variables.

3.2 Data Pre Processing

Handling missing values
Load and explore the provided dataset, including both training and testing data. During this investigation, missing values were
identified in two key columns: Item_Weight and Outlet_Size. The following steps are taken to resolve this issue:
- Missing values in the "Item_Weight" column are filled with the average value of the column.
- For the "Outlet_Size" column, filled in the missing value with the mode.
This ensures that no values are missing from the records after this process.
To gain a deeper understanding of the data, we conducted an exploratory data analysis. Here utilized various libraries, including
‘pandas-profiling`, `klib`, and `seaborn`, to:
- Visualize data distributions, correlations, and patterns.
- Uncover insights into the dataset's characteristics.

3.3 Feature Engineering

As part of feature engineering, here implemented the following actions:
- Dropped unnecessary columns, including 'Item_Identifier' and 'Outlet_Identifier.'
- Applied label encoding to convert categorical variables into numerical representations.
- Split the data into training and testing sets and standardized the features.

3.4 Model Building

For modeling, training of regression models, including XGBoost. And also tuned the hyperparameters for the XGboost model
using grid search. To evaluate the models' performance, we employed metrics such as RMSE and R-squared.

2023 EPRA IJRD | Journal DOI: https://doi.org/10.36713/epra2016 | https://eprajournals.com/ |3 |

4.ALGORITHMS USED
4.1 Lasso Regression
The operator that selects the minimum absolute shrinkage rate is called an operator. The typical regression type of linear regression
always assumes that there is a linear relationship between input and output variables. A famous linear regression with an L1 penalty
is called lasso regression. This reduces the coefficients of input factors that are not useful for prediction. The L1 penalty allows
some coefficient values to be zero, essentially removing input variables from the model and allowing automatic feature selection.
The mathematical equation for Lasso regression is the degree of shrinkage, expressed as sum of squares + * (sum of absolute values
of coefficient magnitudes) Lasso regression. λ=0 means that all features are considered, similar to linear regression where only sums
of squares are considered to create the model. λ = ∞ means no features are considered. It refers to infinity and excludes other
characteristics. As λ increases, the deviation also increases. As λ decreases, the variance increases. Linear regression refers to a
model that assumes a linear relationship between the input variable and the target variable.

4.2 Ridge Regression

A common regression technique for estimating the outcome of an equation using any unique solution is ridge regression. This is a
common problem in machine learning difficulty of selecting "required" answers.

There is little data. Ridge regression is a well-known and widely used modeling approach that is a variation of linear regression.
However, ridge regression stands out because it addresses one of the major problems: multicollinearity.

Traditional linear regression. When there are many independent factors such as seasonal trends or promotions Multicollinearity
often occurs in supermarket sales forecasts because area demographics are interrelated. This can lead to irregular and unreliable
regression results. Features of ridge regression Managing multicollinearity proves to be a very useful tool in this situation. The
custom matrix contains three data sets created from your data. One is the training data, the second is the valid data set, and the third
is the test data. The model is trained using the training set you can use the model to provide results. The test data set is ML algorithms.

4.3 XGBoost Algorithm

Regardless of the type of prediction task, such as regression or classification, XGboost is one of the most widely used and accurate
machine learning algorithms today. This is a competitive implementation of gradient boosting decision trees for machine learning,
designed for performance and speed. It is well known that this method produces better results than other machine learning
algorithms. Since its inception, it has become a truly "state-of-the-art" machine learning technique for processing well-structured
data. A distributed gradient boosting library. This is a software library that you can obtain from the Internet and install and use on
your computer.

XGBoost (short for Extreme Gradient Boosting) is a cutting-edge machine learning algorithm that has gained immense popularity
and recognition for its superior predictive capabilities. It is known for efficiently processing complex and diverse datasets, making
it ideal for supermarket sales forecasting. The goal of this research is to use XGBoost to create a reliable and accurate model for
predicting sales in the food industry. Like any other retail industry, supermarkets suffer from various problems that negatively
impact sales. Seasonality, geography, marketing, and many other factors come into play. As an ensemble learning method, XGBoost
is well suited to address such problems. It is extremely adept at managing both organized and unstructured data, successfully
identifying subtle relationships and patterns that contradict traditional linear models. This research attempts to use XGBoost to
develop a predictive model that can predict product sales across multiple supermarkets in the future. It could improve retailers'
ability to make data-driven decisions, effectively manage inventory, and improve overall performance. The success of this project
will not only help retailers, but also serve as an example of the breakthrough potential of cutting-edge machine learning algorithms
in tackling difficult real-world problems. We explore the intricacies of XGBoost, its capabilities, and its potential to transform
grocery sales forecasts in the process. This project shows how XGBoost can revolutionize retail by enabling data-driven decision-
making that supports supermarket performance and sustainability.

5.RESULTS
The results of the various models will be presented. The results were obtained by appling various models like lasso regression,
Ridge regression, xgboost on supermarket training and testing data.

5.1 Performance Metric

Use the mean absolute error (MAE) when evaluating the model. This means that the lower the MAE,A better model.
The choice of performance metrics is based on the fact that the task is a regression task, similar to MAE.
Tested and reliable metrics that provide a good measure of model performance.

5.1.1 Average Absolute Error

The mean absolute error (MAE) is defined as it is the measure of the difference between two continuous variables. Assume

2023 EPRA IJRD | Journal DOI: https://doi.org/10.36713/epra2016 | https://eprajournals.com/ |4 |

X and Y are variables

From the observations, X is the known value and Y is the predicted value of the machine learning model.
The mean absolute error (MAE) is the average vertical distance between each observed and predicted point.
To calculating MAE the below formulae is used
MAE sum i = 1 to n |y_{i} - x_{i}|/n

5.1.2 RMSE
Root mean square error, also known as RMSE, is a commonly used statistic to assess accuracy.
Predictive models (such as regression models). You can estimate how well the model's predictions match the observed values.
Analyze the data by quantifying the average size of the error between expected and actual values. Improved model fit
The impact on the data is indicated by the reduced RMSE. The RMSE formula is as follows:
RMSE=(√Σlly(i)-(i)||² N)/N
Where,
• n refers to total number of data points or observations.
• yi represents the actual or observed values in the dataset.
• y^i represents the predicted values generated by the model for the corresponding observations.
• ∑ denotes the summation of the squared differences between actual and predicted values.
• Finally, the entire expression

5.1.3 R-square method

R-squared (R2) is a statistical measure of the proportion of variance in a dependent variable.
explained by the independent variables in the regression model. Correlation describes the strength of the relationship
between independent and dependent variables;
R-squared describes the extent to which the variance in one variable explains the variance in a second variable. So, if
If the model's R2 is 0.50, approximately half of the observed variation can be explained by the model inputs.
The formula for calculating R-squared is:
R squared = 1-(SSR/SST)
• The R2 value can range from 0 to 1.
The meanings of the various R-squared values are as follows:
• R2=1 The model perfectly explains the variation in the data.
• R2=0 model does not explain variation in the data.
• 0some of the variation in the data, and higher values indicate better fit.
It is important to note that R-square provides information about goodness of fit, but it does not necessarily indicate goodness
of fit. The overall quality of the model, or its ability to make accurate predictions.

Error Measurements &R-Squared:

In the below table the RMSE and R-squared results are shown respectively.We observe that the XGboost algorithm does best
among all three with a R-squared 0.608451. The lasso model has a close R_squared to the ridge but with a much lower RSME

Algorithms RSME R-squared

XGBoost 1031.6085175933238 0.60845185009
Lasso Regression 1207.2491022080023 0.46377245678
Ridge Regression 1209.3436327744663 0.46190597103

6. CONCLUSION
This project explains the fundamentals of machine learning, along with the related data processing and modeling methods, and
applies them to forecasting sales of various supermarkets products. The many factors taken into account like the location with the
highest sales was medium-sized, proposing that other stores should do the same comparable trends to boost sales. Many occurrence
parameters and several other elements can be utilized for More successfully and innovatively anticipating the sales.

In prediction systems, accuracy is crucial and can include increased greatly when the parameters employed are increased.
Additionally, how the sub-models function might result in increasing the system's productivity

Since the accuracy of the sales estimates directly relates to the profit made, the big stores strive to make accurate predictions to
prevent losses for the business.

In this study, we developed a model using the Xgboost method tested with it on lasso regression, ridge regression, and other data.
The supermarket sales dataset for estimating the product's sales of a certain outlet. Experiments confirm that our approach i.e is

2023 EPRA IJRD | Journal DOI: https://doi.org/10.36713/epra2016 | https://eprajournals.com/ |5 |

xgboost results in more accurate predictions than compared to alternative methods.

7.REFERENCES
1. Ching Wu Chu and Guoqiang Peter Zhang, “A comparative study of linear and nonlinear models for aggregate retails sales forecasting”,
Int. Journal Production Economics, vol. 86, pp. 217- 231, 2003.
2. Wang, Haoxiang. "Sustainable development and management in consumer electronics using soft computation." Journal of Soft
Computing Paradigm (JSCP) 1, no. 01 (2019): 56.- 2.
3. Suma, V., and Shavige Malleshwara Hills. "Data Mining based Prediction of Demand in Indian Market for Refurbished Electronics."
Journal of Soft Computing Paradigm (JSCP) 2, no. 02 (2020): 101- 110 41
4. Giuseppe Nunnari, Valeria Nunnari, “Forecasting Monthly Sales Retail Time Series: A Case Study”, Proc. of IEEE Conf. on Business
Informatics (CBI), July 2017.
5. https://halobi.com/blog/sales-forecasting-five-uses/.
6. Zone-Ching Lin, Wen-Jang Wu, “Multiple Linear Regression Analysis of the Overlay Accuracy Model Zone”, IEEE Trans. On
Semiconductor Manufacturing, vol. 12, no. 2, pp. 229 – 237, May 1999.
7. O. Ajao Isaac, A. Abdullahi Adedeji, I. Raji Ismail, “Polynomial Regression Model of Making Cost Prediction In Mixed Cost Analysis”,
Int. Journal on Mathematical Theory and Modeling, vol. 2, no. 2, pp. 14 – 23, 2012.
8. C. Saunders, A. Gammerman and V. Vovk, “Ridge Regression Learning Algorithm in Dual Variables”, Proc. of Int. Conf. on Machine
Learning, pp. 515 – 521, July 1998.IEEE TRANSACTIONS ON INFORMATION THEORY, VOL. 56, NO. 7, JULY 2010 3561.
9. ”Robust Regression and Lasso”. Huan Xu, Constantine Caramanis, Member, IEEE, and Shie Mannor, Senior Member, IEEE. 2015
International Conference on Industrial Informatics-Computing Technology, Intelligent Technology, Industrial Information
Integration.”An improved Adaboost algorithm based on uncertain functions”.Shu Xinqing School of Automation Wuhan University of
Technology.Wuhan, China Wang Pan School of the Automation Wuhan University of Technology Wuhan, China. 42
10. Xinqing Shu, Pan Wang, “An Improved Adaboost Algorithm based on Uncertain Functions”, Proc. of Int. Conf. on Industrial Informatics
– Computing Technology, Intelligent Technology, Industrial Information Integration, Dec. 2015.
11. A. S. Weigend and N. A. Gershenfeld, “Time series prediction: Forecasting the future and understanding the past”, Addison-Wesley,
1994.
12. N. S. Arunraj, D. Ahrens, A hybrid seasonal autoregressive integrated moving average and quantile regression for daily food sales
forecasting, Int. J. Production Economics 170 (2015) 321-335P
13. D. Fantazzini, Z. Toktamysova, Forecasting German car sales using Google data and multivariate models, Int. J. Production Economics
170 (2015) 97-135.
14. X. Yua, Z. Qi, Y. Zhao, Support Vector Regression for Newspaper/Magazine Sales Forecasting, Procedia Computer Science 17 ( 2013)
1055–1062.
15. E. Hadavandi, H. Shavandi, A. Ghanbari, An improved sales forecasting approach by the integration of genetic fuzzy systems and data
clustering: a Case study of the printed circuit board, Expert Systems with Applications 38 (2011) 9392–9399.
16. P. A. Castillo, A. Mora, H. Faris, J.J. Merelo, P. GarciaSanchez, A.J. Fernandez-Ares, P. De las Cuevas, M.I. Garcia-Arenas, Applying
computational intelligence methods for predicting the sales of newly published books in a real editorial business management environment,
Knowledge-Based Systems 115 (2017) 133-151.
17. R. Majhi, G. Panda and G. Sahoo, “Development and performance evaluation of FLANN based model for forecasting of stock markets”.
Expert Systems with Applications, vol. 36, issue 3, part 2, pp. 6800-6808, April 2009.
18. Pei Chann Chang and Yen-Wen Wang, “Fuzzy Delphi and back propagation model for sales forecasting in PCB industry”, Expert systems
with applications, vol. 30,pp. 715-726, 2006.
19. R. J. Kuo, Tung Lai HU and Zhen Yao Chen “application of radial basis function neural networks for sales forecasting”, Proc. Of Int.
Asian Conference on Informatics in control, automation, and robotics, pp. 325- 328, 2009.
20. R. Majhi, G. Panda, G. Sahoo, and A. Panda, “On the development of Improved Adaptive Models for Efficient Prediction of Stock
Indices using Clonal-PSO (CPSO) and PSO Techniques”, International Journal of Business Forecasting and Market Intelligence, vol.
1, no. 1, pp.50-67, 2008. 44
21. Suresh K and Praveen O, "Extracting of Patterns Using Mining Methods Over Damped Window," 2020 Second International
Conference on Inventive Research in Computing Applications (ICIRCA), Coimbatore, India, 2020, pp. 235-241, DOI:
10.1109/ICIRCA48905.2020.9182893.
22. Shobha Rani, N., Kavyashree, S., & Harshitha, R. (2020). Object Detection in Natural Scene Images Using Thresholding Techniques.
Proceedings of the International Conference on Intelligent Computing and Control Systems, ICICCS 2020, Iciccs, 509–515.
23. https://www.kaggle.com/brijbhushannanda1979/bigmartsalesdata.

2023 EPRA IJRD | Journal DOI: https://doi.org/10.36713/epra2016 | https://eprajournals.com/ |6 |

IJCRT2105404 Bigmart 4
No ratings yet
IJCRT2105404 Bigmart 4
4 pages
Sales Prediction
100% (1)
Sales Prediction
37 pages
Big Mart Sales Analysis
No ratings yet
Big Mart Sales Analysis
4 pages
Supermarket Sales Forecasting Model
No ratings yet
Supermarket Sales Forecasting Model
3 pages
RP 3
No ratings yet
RP 3
12 pages
Improvizing Big Market Sales Prediction: Meghana N
No ratings yet
Improvizing Big Market Sales Prediction: Meghana N
7 pages
Basepaper 3
No ratings yet
Basepaper 3
14 pages
Bigmart Sales Prediction Analysis
No ratings yet
Bigmart Sales Prediction Analysis
47 pages
DSP Research Paper by Shanmukh and Meher
No ratings yet
DSP Research Paper by Shanmukh and Meher
33 pages
Big Mart Sales Analysis
No ratings yet
Big Mart Sales Analysis
4 pages
Big Mart Sales Prediction Analysis: Dr.B.Santosh Kumar
No ratings yet
Big Mart Sales Prediction Analysis: Dr.B.Santosh Kumar
90 pages
Big Mart Outlets
100% (2)
Big Mart Outlets
11 pages
Neba 2672024 AJPAS118179
No ratings yet
Neba 2672024 AJPAS118179
24 pages
Predictive Analysis For Big Mart Sales Using Machine
100% (1)
Predictive Analysis For Big Mart Sales Using Machine
11 pages
Intern Report
No ratings yet
Intern Report
17 pages
Final PBL of Aaryan & Satyam
No ratings yet
Final PBL of Aaryan & Satyam
19 pages
Predicting The Future of Sales: A Machine Learning Analysis of Rossman Store Sales
No ratings yet
Predicting The Future of Sales: A Machine Learning Analysis of Rossman Store Sales
11 pages
Grocery Sales Forecasting Report
No ratings yet
Grocery Sales Forecasting Report
8 pages
Data Analysis On BigMart Sales
67% (3)
Data Analysis On BigMart Sales
17 pages
HET Ka FML
No ratings yet
HET Ka FML
13 pages
FML Micro Project
No ratings yet
FML Micro Project
12 pages
Mini PRJCT
No ratings yet
Mini PRJCT
11 pages
Retail Sales Prediction Using Machine Learning Algorithms
No ratings yet
Retail Sales Prediction Using Machine Learning Algorithms
9 pages
Final DMT Report PDF
No ratings yet
Final DMT Report PDF
27 pages
Bigmart Sales Using Machine Learning With Data Analysis
No ratings yet
Bigmart Sales Using Machine Learning With Data Analysis
5 pages
BigMart Sales Prediction with ML
No ratings yet
BigMart Sales Prediction with ML
2 pages
Synopsis-Big Mart Sales Prediction
No ratings yet
Synopsis-Big Mart Sales Prediction
3 pages
Major ppt-1
No ratings yet
Major ppt-1
13 pages
Applied Machine Learningfor Supermarket Sales Prediction
No ratings yet
Applied Machine Learningfor Supermarket Sales Prediction
8 pages
Retail Sales Prediction Report
No ratings yet
Retail Sales Prediction Report
9 pages
Grid Search Optimization (GSO) Based Future Sales Prediction For Big Mart
No ratings yet
Grid Search Optimization (GSO) Based Future Sales Prediction For Big Mart
7 pages
Chetan Research Paper
No ratings yet
Chetan Research Paper
7 pages
ForecastingRetailSalesusingMachine Learning Models
No ratings yet
ForecastingRetailSalesusingMachine Learning Models
34 pages
Salespredmmmm
No ratings yet
Salespredmmmm
15 pages
Aiml Team 6
No ratings yet
Aiml Team 6
22 pages
Big Mart Sales Prediction Using Machine Learning Report PDF
No ratings yet
Big Mart Sales Prediction Using Machine Learning Report PDF
56 pages
Target Corp Sales Forecasting Report
No ratings yet
Target Corp Sales Forecasting Report
36 pages
Comparative Analysis of Supervised Machine Learnin
No ratings yet
Comparative Analysis of Supervised Machine Learnin
10 pages
IJNRD2406005
No ratings yet
IJNRD2406005
8 pages
Food Sales Prediction Presentation
No ratings yet
Food Sales Prediction Presentation
10 pages
Basepaper 1
No ratings yet
Basepaper 1
7 pages
Ids Case Study
No ratings yet
Ids Case Study
15 pages
An Effective Predicting E Commerce Sales
No ratings yet
An Effective Predicting E Commerce Sales
11 pages
Improving Sales Forecasting Accuracy: A Tensor Factorization Approach With Demand Awareness
No ratings yet
Improving Sales Forecasting Accuracy: A Tensor Factorization Approach With Demand Awareness
30 pages
C A M M L M R S F: Omparative Nalysis of Odern Achine Earning Odels For Etail Ales Orecasting
No ratings yet
C A M M L M R S F: Omparative Nalysis of Odern Achine Earning Odels For Etail Ales Orecasting
20 pages
Big Mart Sales Prediction Using ML
No ratings yet
Big Mart Sales Prediction Using ML
25 pages
Big Mart Project Report
No ratings yet
Big Mart Project Report
19 pages
Ifmpt2024 343 349
No ratings yet
Ifmpt2024 343 349
7 pages
Chapter 1: Introduction: 1.1 Background Theory
No ratings yet
Chapter 1: Introduction: 1.1 Background Theory
36 pages
Big Mart Sales Forecasting
No ratings yet
Big Mart Sales Forecasting
6 pages
PPIR
No ratings yet
PPIR
8 pages
0th Review
No ratings yet
0th Review
10 pages
Prediction of Big Mart Sales Using Machine Learning: (Peer-Reviewed, Open Access, Fully Refereed International Journal)
No ratings yet
Prediction of Big Mart Sales Using Machine Learning: (Peer-Reviewed, Open Access, Fully Refereed International Journal)
8 pages
Grocery
No ratings yet
Grocery
17 pages
Walmart Sales Prediction with ML
No ratings yet
Walmart Sales Prediction with ML
11 pages
Writing Task 1 Map
No ratings yet
Writing Task 1 Map
4 pages
Some People Think That School Should Focus On Academic Subjects
No ratings yet
Some People Think That School Should Focus On Academic Subjects
2 pages
The Imperative 2
No ratings yet
The Imperative 2
7 pages
German Articles
No ratings yet
German Articles
2 pages
German Universities for Computer Science Programs
No ratings yet
German Universities for Computer Science Programs
66 pages
Audishankara Python Material
No ratings yet
Audishankara Python Material
108 pages
Ipdc S 2022
No ratings yet
Ipdc S 2022
10 pages
DWM (W2022)
No ratings yet
DWM (W2022)
2 pages
Define and Solve A Problem by Using Solver
No ratings yet
Define and Solve A Problem by Using Solver
7 pages
Characterizing and Forecasting UPLB Rainfall Through Neural Networks Approach
No ratings yet
Characterizing and Forecasting UPLB Rainfall Through Neural Networks Approach
4 pages
Bayesian Model for Spanish League Predictions
No ratings yet
Bayesian Model for Spanish League Predictions
20 pages
Regional Sales and Profit Analysis
No ratings yet
Regional Sales and Profit Analysis
9 pages
A Comparative Analysis of Deep Learning Models For Flower Recognition and Health Prediction Proposal
No ratings yet
A Comparative Analysis of Deep Learning Models For Flower Recognition and Health Prediction Proposal
7 pages
Grade 7/8 Science Lab Planning Guide
No ratings yet
Grade 7/8 Science Lab Planning Guide
3 pages
Python IEEE Projects 2023-2024
No ratings yet
Python IEEE Projects 2023-2024
8 pages
GE Digital Twin Overview and Tutorial - RRI v3 PDF
100% (1)
GE Digital Twin Overview and Tutorial - RRI v3 PDF
45 pages
Maneuverability Prediction Methods
No ratings yet
Maneuverability Prediction Methods
4 pages
Financial Astrology An Unexplored Tool of Security Analysis
No ratings yet
Financial Astrology An Unexplored Tool of Security Analysis
9 pages
Supervised vs Unsupervised Learning Guide
100% (1)
Supervised vs Unsupervised Learning Guide
25 pages
3228 SW Hillsdale Hwy Apt 27, Portland, OR 97239 (503) - 821-9265 Email
No ratings yet
3228 SW Hillsdale Hwy Apt 27, Portland, OR 97239 (503) - 821-9265 Email
2 pages
Real-Time UAS Risk Assessment Framework
No ratings yet
Real-Time UAS Risk Assessment Framework
17 pages
Case 2
No ratings yet
Case 2
4 pages
Pavement Management System Development: Transportation Research Board
No ratings yet
Pavement Management System Development: Transportation Research Board
42 pages
CHIRPS Rainfall Data Accuracy in Ruvu
No ratings yet
CHIRPS Rainfall Data Accuracy in Ruvu
15 pages
IDRISI Selva GIS Image Processing Brochure PDF
No ratings yet
IDRISI Selva GIS Image Processing Brochure PDF
8 pages
Statistical Arbitrage With ML 1721555596
No ratings yet
Statistical Arbitrage With ML 1721555596
9 pages
Spreedsheet Approach
No ratings yet
Spreedsheet Approach
9 pages
ML Sas
No ratings yet
ML Sas
17 pages
Acf3c63 PDF
No ratings yet
Acf3c63 PDF
1 page
Short-Term Wind Speed Forecasting by An Adaptive Network-Based Fuzzy Inference System (ANFIS) : An Attempt Towards An..
No ratings yet
Short-Term Wind Speed Forecasting by An Adaptive Network-Based Fuzzy Inference System (ANFIS) : An Attempt Towards An..
11 pages
FF29
No ratings yet
FF29
57 pages
Customer Churn Prediction Using Machine Learning Techniques: The Case of Lion Insurance
No ratings yet
Customer Churn Prediction Using Machine Learning Techniques: The Case of Lion Insurance
14 pages
Design and Analysis of Mixed Flow Pump Impeller
No ratings yet
Design and Analysis of Mixed Flow Pump Impeller
5 pages
Correlational Studies & Scatterplots
No ratings yet
Correlational Studies & Scatterplots
12 pages
Concept Bottleneck Models
No ratings yet
Concept Bottleneck Models
19 pages
2017 - Van Schaik Risk Perception
No ratings yet
2017 - Van Schaik Risk Perception
43 pages
A Model To Predict The Performance of Roadheaders Based On The Rock Mass Brittleness Index
No ratings yet
A Model To Predict The Performance of Roadheaders Based On The Rock Mass Brittleness Index
10 pages

1142pm - 1.EPRA JOURNALS 14814

Uploaded by

1142pm - 1.EPRA JOURNALS 14814

Uploaded by

SJIF Impact Factor (2023): 8.574| ISI I.F. Value: 1.241| Journal DOI: 10.

36713/epra2016 ISSN: 2455-7838(Online)

SUPERMARKET SALES PREDICTION USING MACHINE

Chavali Saathvika Durga Abhinaya 1, Bellamkonda Lahari2,

Article DOI: https://doi.org/10.36713/epra14814

OBJECTIVES OUR WORK

4. Computing future sales from the data and making predictions

Advantages of proposed model

2023 EPRA IJRD | Journal DOI: https://doi.org/10.36713/epra2016 | https://eprajournals.com/ |2 |

3.2 Data Pre Processing

3.3 Feature Engineering

3.4 Model Building

2023 EPRA IJRD | Journal DOI: https://doi.org/10.36713/epra2016 | https://eprajournals.com/ |3 |

4.2 Ridge Regression

4.3 XGBoost Algorithm

5.1 Performance Metric

5.1.1 Average Absolute Error

2023 EPRA IJRD | Journal DOI: https://doi.org/10.36713/epra2016 | https://eprajournals.com/ |4 |

X and Y are variables

5.1.3 R-square method

Error Measurements &R-Squared:

Algorithms RSME R-squared

2023 EPRA IJRD | Journal DOI: https://doi.org/10.36713/epra2016 | https://eprajournals.com/ |5 |

xgboost results in more accurate predictions than compared to alternative methods.

2023 EPRA IJRD | Journal DOI: https://doi.org/10.36713/epra2016 | https://eprajournals.com/ |6 |

You might also like