Research on e-commerce user stratification and coupon intelligent placement optimisation based on multi-behavioural RFM and deep reinforcement learning
DOI:
https://doi.org/10.54097/46xab844Keywords:
E-commerce platform, Customer segmentation, Multi-behavior RFM, Deep reinforcement learning, Intelligent coupon delivery.Abstract
With the ongoing accumulation of user behavioral data on e-commerce platforms, the precision of coupon allocation based on individual characteristics has emerged as a pivotal challenge for enhancing both resource allocation efficiency and marketing performance. Addressing the limitations of the traditional RFM model—particularly its reliance on single-dimensional user stratification and inability to capture complex interactive behaviors—this study introduces a novel customer segmentation approach that integrates a multi-behavioral RFM model with an enhanced self-organizing map (SOM) algorithm. This integrated method enables a more comprehensive identification of user value and facilitates fine-grained segmentation. Building on this stratification, an intelligent coupon delivery optimization framework powered by deep reinforcement learning (DRL) is further developed. In this framework, user stratification outcomes are embedded into the state space alongside real-time behavioral data, allowing dynamic learning of individual response patterns through a state-action-reward mechanism. Consequently, the system autonomously generates personalized and optimized coupon distribution strategies. Experimental results demonstrate that, under budgetary constraints, the proposed approach significantly increases the activity levels and repurchase frequencies of key users. Furthermore, it substantially outperforms static and random delivery strategies across critical metrics, including coupon conversion rate, delivery efficiency, and overall marketing effectiveness. These findings not only confirm the method’s flexibility and adaptability in practical e-commerce settings but also offer a viable pathway and robust theoretical support for advancing precision marketing and optimizing resource allocation.
Downloads
References
[1] Yin Chunli, Han Jinglong. Dynamic pricing model of e-commerce platforms based on deep reinforcement learning [J]. Computer Modeling in Engineering & Sciences, 2021, 127 (1): 291–307.
[2] Zhang Qi, et al. Optimization of dynamic pricing models for consumer segmentation markets and analysis of big data-driven marketing strategies [J]. Journal of Organizational and End User Computing (JOEUC), 2025, 37 (1): 1–33.
[3] Li Cheng, Chu Min, Zhou Chen, et al. Two-period discount pricing strategies for an e-commerce platform with strategic consumers [J]. Computers & Industrial Engineering, 2020, 147: 106640.
[4] Wang Yihua, Minner Stefan. Deep reinforcement learning for demand fulfillment in online retail[J]. International Journal of Production Economics, 2024, 269: 109133.
[5] Wei Zhehuan, Yan Liang, Zhang Chunxi. Optimization strategies in consumer choice behavior for personalized recommendation systems based on deep reinforcement learning [J]. Journal of Organizational and End User Computing (JOEUC), 2025, 37 (1): 1–35.
[6] Islam Md Rafiqul, Shawon Reza E Rabbi, Sumsuzoha Md. Personalized marketing strategies in the US retail industry: leveraging machine learning for better customer engagement [J]. International Journal of Machine Learning Research in Cybersecurity and Artificial Intelligence, 2023, 14 (1): 750–774.
[7] Tang Zhipeng, et al. E-commerce platforms offer different free shipping coupon services: surplus or loss? [J]. Journal of Systems Science and Complexity, 2025: 1–28.
[8] Day Min-Yuh, Yang Ching-Ying, Ni Yensen. Portfolio dynamic trading strategies using deep reinforcement learning [J]. Soft Computing, 2024, 28 (15): 8715–8730.
[9] Zhong Xingyu, et al. Deep reinforcement learning for dynamic strategy interchange in financial markets [J]. Applied Intelligence, 2025, 55 (1): 1–19.
[10] Hu Li, Zhang Mengwei, Wen Xin. Optimal distribution strategy of coupons on e-commerce platforms: sufficient or scarce? [J]. International Journal of Production Economics, 2023, 266: 109031.
[11] Zhou Shuang, Hudin Norlaile Salleh. Advancing e-commerce user purchase prediction: integration of time-series attention with event-based timestamp encoding and graph neural network-enhanced user profiling [J]. PLOS ONE, 2024, 19 (4): e0299087.
[12] Gabryel Marcin, et al. Accelerating user profiling in e-commerce using conditional GAN networks for synthetic data generation [J]. Journal of Artificial Intelligence and Soft Computing Research, 2024, 14 (4): 309–319.
[13] Liu Yishu, Hou Jia, Zhao Wei. Deep learning and user consumption trends classification and analysis based on shopping behavior [J]. Journal of Organizational and End User Computing (JOEUC), 2024, 36 (1): 1–23.
[14] Liu Yishu, Chen Chen. Improved RFM model for customer segmentation using hybrid meta-heuristic algorithm in medical IoT applications [J]. International Journal on Artificial Intelligence Tools, 2022, 31 (1): 2250009.
[15] Liao Juan, et al. Multi-behavior RFM model based on improved SOM neural network algorithm for customer segmentation [J]. IEEE Access, 2022, 10: 122501–122512.
[16] Rungruang Chongkolnee, et al. RFM model customer segmentation based on hierarchical approach using FCA [J]. Expert Systems with Applications, 2024, 237: 121449.
[17] Li Liangwei, et al. Spending money wisely: online electronic coupon allocation based on real-time user intent detection [C]// Proceedings of the 29th ACM International Conference on Information & Knowledge Management. 2020.
[18] Uehara Yuki, et al. Robust portfolio optimization model for electronic coupon allocation [J]. INFOR: Information Systems and Operational Research, 2024, 62 (4): 646–660.
Downloads
Published
Issue
Section
License
Copyright (c) 2025 Highlights in Business, Economics and Management

This work is licensed under a Creative Commons Attribution-NonCommercial 4.0 International License.







