Ketcham Mahasak, Ganokratanaa Thittaporn, Sridoung Nattapat
Department of Information Technology Management, King Mongkut's University of Technology North Bangkok, Bangkok, Thailand.
Applied Computer Science Programme, Department of Mathematics, King Mongkut's University of Technology Thonburi, Bangkok, Thailand.
MethodsX. 2023 Aug 30;11:102346. doi: 10.1016/j.mex.2023.102346. eCollection 2023 Dec.
The Broadband Internet industry is highly competitive, with service providers investing heavily in network development to meet customer demands and competing on pricing. Effective cost management is crucial for profitability in this market. This work proposes a model for classifying broadband network devices based on text mining techniques applied to a device list from a leading broadband network company in Thailand. The device descriptions are used to generate a feature vector, which is then employed by a classification algorithm to categorize devices into core, access, and last mile hierarchies. Various algorithms including decision tree, naïve Bayes, Bayesian network, k-nearest neighbor, support vector machine, and deep neural network are compared, with support vector machine achieving the highest accuracy of 90.35%. The results are visualized to provide insights into network hierarchy, device replacement dates, and budget requirements, enabling support for cost management, budget planning, maintenance, and investment decision-making. The methodology outline includes,•Obtaining a device list from a major broadband network company and extracting device descriptions through text mining and generating a feature vector.•Using a support vector machine for classification and comparing algorithm performances.•Visualizing the results for actionable insights in cost management, budget planning, and investment decisions.
宽带互联网行业竞争激烈,服务提供商在网络开发方面投入巨大,以满足客户需求并在价格上展开竞争。有效的成本管理对于该市场的盈利能力至关重要。这项工作提出了一种基于文本挖掘技术的宽带网络设备分类模型,该技术应用于泰国一家领先宽带网络公司的设备列表。设备描述用于生成特征向量,然后由分类算法将设备分类为核心、接入和最后一英里层次结构。比较了包括决策树、朴素贝叶斯、贝叶斯网络、k近邻、支持向量机和深度神经网络在内的各种算法,支持向量机的准确率最高,达到了90.35%。结果进行了可视化处理,以提供有关网络层次结构、设备更换日期和预算要求的见解,从而为成本管理、预算规划、维护和投资决策提供支持。方法概述包括:
从一家主要的宽带网络公司获取设备列表,并通过文本挖掘提取设备描述并生成特征向量。
使用支持向量机进行分类并比较算法性能。
对结果进行可视化处理,以获得成本管理、预算规划和投资决策方面的可操作见解。