College of Landscape Architecture, Nanjing Forestry University, Nanjing, China.
Institute of Industrial Economics of CASS, Beijing, China.
PLoS One. 2021 Aug 18;16(8):e0256162. doi: 10.1371/journal.pone.0256162. eCollection 2021.
The development of China's manufacturing industry has received global attention. However, research on the distribution pattern, changes, and driving forces of the manufacturing industry has been limited by the accessibility of data. This study proposes a method for classifying based on natural language processing. A case study was conducted employing this method, hotspot detection and driving force analysis, wherein the driving forces industrial development during the "13th Five-Year plan" period in Jiangsu province were determined. The main conclusions of the empirical case study are as follows. 1) Through the acquisition of Amap's point-of-interest (POI, a special point location that commonly used in modern automotive navigation systems.) data, an industry type classification algorithm based on the natural language processing of POI names is proposed, with Jiangsu Province serving as an example. The empirical test shows that the accuracy was 95%, and the kappa coefficient was 0.872. 2) The seven types of manufacturing industries including the pulp and paper (PP) industry, metallurgical chemical (MC) industry, pharmaceutical manufacturing (PM) industry, machinery and electronics (ME) industry, wood furniture (WF) industry, textile clothing (TC) industry, and agricultural and food product processing (AF) industry are drawn through a 1 km× 1km projection grid. The evolution map of the spatial pattern and the density field hotspots are also drawn. 3) After analyzing the driving forces of the changes in the number of manufacturing industries mentioned above, we found that manufacturing base, distance from town, population, GDP per capita, distance from the railway station were the significant driving factors of changes in the manufacturing industries mentioned above. The results of this research can help guide the development of manufacturing industries, maximize the advantages of regional factors and conditions, and provide insight into how the spatial layout of the manufacturing industry could be optimized.
中国制造业的发展受到了全球的关注。然而,由于数据的可获得性限制,对于制造业的分布格局、变化和驱动力的研究一直受到限制。本研究提出了一种基于自然语言处理的分类方法。通过采用该方法进行案例研究,即热点检测和驱动力分析,确定了江苏省“十三五”期间制造业发展的驱动力。实证案例研究的主要结论如下:1)通过获取高德地图的兴趣点(POI)数据,提出了一种基于 POI 名称自然语言处理的产业类型分类算法,并以江苏省为例进行了实证检验。结果表明,该算法的准确率为 95%,kappa 系数为 0.872。2)通过 1km×1km 的投影网格,将制造业分为纸浆和造纸(PP)业、冶金化工(MC)业、制药制造业(PM)业、机械和电子(ME)业、木材家具(WF)业、纺织服装(TC)业和农产品加工业(AF)等七种类型,并绘制了空间格局演化图和密度场热点图。3)在分析上述制造业数量变化的驱动因素后,发现制造业基地、距离城镇的远近、人口、人均 GDP、距离火车站的远近是制造业变化的显著驱动因素。本研究的结果可以帮助指导制造业的发展,充分发挥区域因素和条件的优势,并深入了解如何优化制造业的空间布局。