Big Mart Sales Prediction using Machine Learning

Authors

  • Koh Ya Wen Asia Pacific University of Technology & Innovation image/svg+xml
  • Minnu Helen Joseph Asia Pacific University of Technology & Innovation image/svg+xml
  • V. Sivakumar Asia Pacific University of Technology & Innovation image/svg+xml

DOI:

https://doi.org/10.4108/eetiot.6453

Keywords:

Big Mart, sales prediction, machine learning, prediction model, regression, linear regression, decision tree, random forest, XGBoost regression, K-nearest neighbours

Abstract

INTRODUCTION: Sales prediction, also known as revenue forecasting or sales forecasting, refers to the process of accurately and timely estimating future revenue for manufacturers, distributors, and retailers, providing them with valuable insights. Sales prediction plays a crucial role in various industries, particularly in sectors such as retail, automotive leasing, real estate transactions, and other conventional businesses.
OBJECTIVES: This paper focuses on developing a sales prediction model for Big Mart, a supermarket chain, using machine learning algorithms. The developed model aims to provide Big Mart with accurate sales forecasts, enabling better decision-making, improved profitability, and enhanced customer service.
METHODS: The study utilises the CRISP-DM methodology and explores various machine learning algorithms, including Linear Regression, Decision Tree, Random Forest, XGBoost, Stacked Ensemble Model, and K-Nearest Neighbours (KNN). The dataset used for model development is sourced from Kaggle and includes information about products, stores, and sales. Pre-processing techniques are applied to handle missing data and feature engineering.
RESULTS: The XGBoost Regression Model Tuned with RandomizedSearchCV outperforms the existing models with an RMSE of 1018.82 and an R² of 0.6181.
CONCLUSION: This research contributes to the field of sales forecasting in the retail industry and provides insights for businesses looking to enhance their revenue prediction capabilities.

Downloads

Download data is not yet available.
<br data-mce-bogus="1"> <br data-mce-bogus="1">

References

Bajaj, P., Ray, R., Shedge, S., Vidhate, S., & Shardoor, N. (2020). Sales prediction using machine learning. International Research Journal of Engineering and Technology (IRJET), 7(6), 3619–3625. https://doi.org/10.1063/5.0078390 DOI: https://doi.org/10.1063/5.0078390

Batta, M. (2018). Machine Learning Algorithms - A Review. International Journal of Science and Research (IJSR), 18(8), 381–386. https://doi.org/10.21275/ART20203995 DOI: https://doi.org/10.21275/ART20203995

Beheshti-kashi, S., Karimi, H. R., Thoben, K., Lütjen, M., & Teucke, M. (2015). A survey on retail sales forecasting and prediction in fashion markets. Systems Science & Control Engineering: An Open Access Journal, 6, 37–41. https://doi.org/10.1080/21642583.2014.999389 DOI: https://doi.org/10.1080/21642583.2014.999389

Boyapati, S. N., & Mummidi, R. (2020). Predicting sales using Machine Learning Techniques. May. https://www.diva-portal.org/smash/get/diva2:1455353/FULLTEXT02

Brij. (2017). BigMart Sales Data. https://www.kaggle.com/datasets/brijbhushannanda1979/bigmart-sales-data

Hotz, N. (2022). What is CRISP DM? - Data Science Process Alliance. Data Science Process Alliance. https://www.datascience-pm.com/crisp-dm-2/

Malik, N., & Singh, K. (2020). Sales Prediction Model for Big Mart. Parichay: Maharaja Surajmal Institute Journal of Applied Research, 3(1), 22–32. https://www.researchgate.net/publication/344099746_SALES_PREDICTION_MODEL_FOR_BIG_MART

Nagar, R., & Singh, Y. (2019). A literature survey on Machine Learning Algorithms. Journal of Emerging Technologies and Innovative Research (JETIR), 6(4), 471–474. https://doi.org/10.22214/ijraset.2021.37969 DOI: https://doi.org/10.22214/ijraset.2021.37969

Niu, Y. (2020). Walmart Sales Forecasting using XGBoost algorithm and Feature engineering. Proceedings - 2020 International Conference on Big Data and Artificial Intelligence and Software Engineering, ICBASE 2020, 458–461. https://doi.org/10.1109/ICBASE51474.2020.00103 DOI: https://doi.org/10.1109/ICBASE51474.2020.00103

Odegua, R. (2020). Applied Machine Learning for Supermarket Sales Prediction. https://www.researchgate.net/publication/338681895

Ray, S. (2019). A Quick Review of Machine Learning Algorithms. International Conference on Machine Learning, Big Data, Cloud and Parallel Computing, 35–39. https://ieeexplore.ieee.org/abstract/document/8862451 DOI: https://doi.org/10.1109/COMITCon.2019.8862451

Sav, R., Shinde, P., & Gaikwad, S. (2021). Big Mart Sales Prediction Using Machine Learning. International Journal of Creative Research Thoughts (IJCRT), 9(6), 674–678. https://ijcrt.org/papers/IJCRT2106802.pdf

Tom, M., Raju, N., Isaac, A., James, J., & R, R. S. (2021). Supermarket Sales Prediction Using Regression. International Journal of Advanced Trends in Computer Science and Engineering, 10(2), 1153–1157. https://doi.org/10.30534/ijatcse/2021/951022021 DOI: https://doi.org/10.30534/ijatcse/2021/951022021

Vengatesan, K., Visuvanathan, E., Kumar, A., Yuvaraj, S., & Tanesh, P. S. (2020). An approach of sales prediction system of customers using data analytics techniques. Advances in Mathematics: Scientific Journal, 9(7), 5049–5056. https://doi.org/10.37418/amsj.9.7.70 DOI: https://doi.org/10.37418/amsj.9.7.70

Downloads

Published

28-06-2024

How to Cite

[1]
K. Y. Wen, M. H. Joseph, and V. Sivakumar, “Big Mart Sales Prediction using Machine Learning”, EAI Endorsed Trans IoT, vol. 10, Jun. 2024.