E-COMMERCE DEMAND FEATURE IDENTIFICATION AND HIGH-PRECISION FORECASTING BASED ON ENSEMBLE CLUSTERING AND REGULARIZED REGRESSION
Keywords:
Hierarchical clustering, Entropy weighting method, LASSO regressionAbstract
To address the issues of uneven distribution of e-commerce warehouse allocation demand and the difficulty in forecasting short-term fluctuations, this study proposes a comprehensive analytical framework that integrates feature extraction, preference identification, and time-series forecasting. Using hierarchical clustering algorithms to construct a clustering tree, combined with Ward’s method, the study divides the national distribution centers into four clusters with similar consumption characteristics. By utilizing the entropy weighting method to quantify the contribution of key indicators to consumption preferences, the study identifies transaction value as the core factor driving the distribution of distribution center demand. Based on this, a 62-dimensional high-dimensional feature space encompassing product attributes, user behavior, and trend characteristics was constructed, and a LASSO regression model with L1 regularization was introduced. By applying an absolute value penalty to the coefficients, this model automatically screened 32 core features. Experimental results demonstrate that this approach delivers superior performance: the R-squared value for the training set reaches 0.9999, and the residuals follow a normal distribution with a mean of 0.03 and exhibit white noise characteristics. In stability tests, the model exhibits exceptional robustness, with a coefficient of variation in R-squared of only 0.09%. This study not only reveals the regional patterns of demand for branch warehouses but also provides a decision-making basis for precise replenishment at the central warehouse over the next 14 days.References
[1] Li Tujin. A Study on the Application of Artificial Intelligence in E-commerce Logistics. Modernization of Commerce, 2025(15): 53-55.
[2] Li Chengyue, Fang Rui, Lin Ziqi, et al. A Study on E-commerce Product Demand Forecasting and Multi-objective Warehouse Allocation Optimization Strategies Based on Historical Data. Journal of Shantou University (Natural Science Edition), 2025, 40(03): 52-66.
[3] Liao Hongfu, Hao Qian. Research on Cutting-Edge Technologies and Innovative Integration Pathways for the Deep Integration of AIGC and RPA E-commerce Robots. Science and Technology Innovation and Productivity, 2025, 46(08): 115-119.
[4] Qiu Yang. A Study on AIGC Empowering Quality Improvement, Efficiency Enhancement, and Innovation in Zhejiang’s Cross-Border E-Commerce SMEs: A Case Study of Yiwu. Marketing World, 2025(15): 175-177.
[5] Cui Tianxu, Ding Rijia, Hua Guowei, et al. How Does Federated Learning Reshape Data Sharing in E-commerce Supply Chains? Methods and Case Studies. Chinese Journal of Management Science, 2026: 1-18.
[6] Jin Yiwén. Path Optimization of Rural E-commerce Logistics and Distribution Empowered by Big Data. Rural Economy and Technology, 2025, 36(14): 201-203.
[7] Yu Shenshen. Artificial Intelligence Aids Precise Marketing of Rural Agricultural Products. Modern Business, 2025(14): 8-11.
[8] Zhu Jing. Research on Big Data-Driven Last-Mile Delivery in Rural E-commerce Logistics. Village Committee Chairperson, 2025(14): 244-246.
[9] Huang Huacheng, Teng Xun, Lin Tianli. A Study on Precision Marketing Strategies for Agricultural E-commerce Based on Big Data Analysis. Marketing World, 2025(14): 76-78.
[10] Dai Yulin. A Study on Strategies and Pathways for Cross-Border E-Commerce to Enhance Supply Chain Resilience in Foreign Trade Enterprises. Modernization of Commerce, 2025(14): 56-61.