Stefanovič Pavel, Štrimaitis Rokas, Kurasova Olga
Faculty of Fundamental Science, Vilnius Gediminas Technical University, Saulėtekio al. 11, LT-10223 Vilnius, Lithuania.
Institute of Data Science and Digital Technologies, Vilnius University, Akademijos str. 4, LT-08663 Vilnius, Lithuania.
Comput Intell Neurosci. 2020 Oct 26;2020:8878681. doi: 10.1155/2020/8878681. eCollection 2020.
In the paper, the flight time deviation of Lithuania airports has been analyzed. The supervised machine learning model has been implemented to predict the interval of time delay deviation of new flights. The analysis has been made using seven algorithms: probabilistic neural network, multilayer perceptron, decision trees, random forest, tree ensemble, gradient boosted trees, and support vector machines. To find the best parameters which give the highest accuracy for each algorithm, the grid search has been used. To evaluate the quality of each algorithm, the five measures have been calculated: sensitivity/recall, precision, specificity, -measure, and accuracy. All experimental investigation has been made using the newly collected dataset from Lithuania airports and weather information on departure/landing time. The departure flights and arrival flights have been investigated separately. To balance the dataset, the SMOTE technique is used. The research results showed that the highest accuracy is obtained using the tree model classifiers and the best algorithm of this type to predict is gradient boosted trees.
本文分析了立陶宛机场的航班飞行时间偏差。已实施监督式机器学习模型来预测新航班的时间延迟偏差区间。使用了七种算法进行分析:概率神经网络、多层感知器、决策树、随机森林、树集成、梯度提升树和支持向量机。为找到每种算法具有最高准确率的最佳参数,使用了网格搜索。为评估每种算法的质量,计算了五个指标:灵敏度/召回率、精确率、特异性、F1值和准确率。所有实验研究均使用从立陶宛机场新收集的数据集以及出发/降落时间的天气信息。出发航班和抵达航班分别进行了调查。为平衡数据集,使用了SMOTE技术。研究结果表明,使用树模型分类器可获得最高准确率,此类最佳预测算法为梯度提升树。