Gu Hengsheng
School of Economics & Management, Tongji University, Shanghai, 200092, China.
Sci Rep. 2025 Jul 1;15(1):20942. doi: 10.1038/s41598-025-06180-w.
This study aims to explore the enterprise fission path optimization strategy based on the Soft Actor-Critic (SAC) algorithm and analyze its impact on the parent company and the overall operational efficiency. Firstly, the enterprise finance and marketing data provided by the National Bureau of Statistics public dataset are used for data pre-processing. Secondly, a multi-level reward function is designed that covers short-term financial and market indices. Meanwhile, it incorporates long-term indices that measure dynamic capabilities, such as innovation, market agility, and resource integration. Finally, by introducing the reinforcement learning algorithm of SAC, the enterprise fission scenario is constructed into a complicated decision environment, in which the state space includes the current financial situation, market performance, and dynamic capability level of the enterprise. The action space encompasses various strategic choices of enterprise fission to simulate the enterprise fission decision process. The SAC algorithm's entropy regularization feature prompts the model to strike a balance between exploration and utilization to optimize the dynamic capability construction. The experimental results show that the fission path optimized by deep reinforcement learning (DRL) markedly improves the resource allocation efficiency and market response speed by an average of 20.4% and 25.2%, respectively. More importantly, dynamic capability construction has been significantly enhanced, with the innovation capability index increasing by 15.4%, market agility improving by 12.3%, and resource integration capability also enhancing by 10.5%. This indicates that the strategy can help accelerate the formation of industrial clusters. Therefore, the SAC algorithm-based enterprise fission path optimization strategy constructed in this study can bring lasting competitive advantages to enterprises.
本研究旨在探索基于软演员-评论家(SAC)算法的企业裂变路径优化策略,并分析其对母公司及整体运营效率的影响。首先,使用国家统计局公开数据集提供的企业财务和营销数据进行数据预处理。其次,设计了一个涵盖短期财务和市场指标的多层次奖励函数。同时,纳入了衡量动态能力的长期指标,如创新、市场敏捷性和资源整合。最后,通过引入SAC强化学习算法,将企业裂变场景构建为一个复杂的决策环境,其中状态空间包括企业当前的财务状况、市场表现和动态能力水平。行动空间涵盖企业裂变的各种战略选择,以模拟企业裂变决策过程。SAC算法的熵正则化特征促使模型在探索和利用之间取得平衡,以优化动态能力建设。实验结果表明,通过深度强化学习(DRL)优化的裂变路径显著提高了资源配置效率和市场响应速度,分别平均提高了20.4%和25.2%。更重要的是,动态能力建设得到了显著增强,创新能力指数提高了15.4%,市场敏捷性提高了12.3%,资源整合能力也提高了10.5%。这表明该策略有助于加速产业集群的形成。因此,本研究构建的基于SAC算法的企业裂变路径优化策略能够为企业带来持久的竞争优势。