Han Jaewon, Woo Ayoung, Lee Sugie
Department of Urban Planning & Engineering, Hanyang University, Seoul, South Korea.
Graduate School of Urban Studies, Hanyang University, Seoul, South Korea.
PLoS One. 2025 May 21;20(5):e0323495. doi: 10.1371/journal.pone.0323495. eCollection 2025.
Previous studies using the conventional Hedonic Price Model to predict existing housing prices may have limitations in addressing the relationship between house prices and streetscapes as visually perceived at the human eye level, due to the constraints of streetscape estimations. Therefore, in this study, we analyzed the relationship between streetscapes visually perceived at eye level and single-family home prices in Seoul, Korea, using computer vision technology and machine learning algorithms. We used transaction data for 13,776 single-family housing sales between 2017 and 2019. To measure visually perceived streetscapes, this study used the Deeplab V3 + deep-learning model with 233,106 Google Street View panoramic images. Then, the best machine-learning model was selected by comparing the explanatory powers of the hedonic price model and all alternative machine-learning models. According to the results, the Gradient Boost model, a representative ensemble machine learning model, performed better than XGBoost, Random Forest, and Linear Regression models in predicting single-family house prices. In addition, this study used an interpretable machine learning model of the SHAP method to identify key features that affect single-family home price prediction. This solves the "black box" problem of machine learning models. Finally, by analyzing the nonlinear relationship and interaction effects between perceived streetscape characteristics and house prices, we easily and quickly identified the relationship between variables the hedonic price model partially considers.
以往使用传统享乐价格模型预测现有房价的研究,由于街景估计的限制,在处理房价与人眼视觉层面所感知的街景之间的关系时可能存在局限性。因此,在本研究中,我们运用计算机视觉技术和机器学习算法,分析了韩国首尔人眼视觉层面所感知的街景与独栋房屋价格之间的关系。我们使用了2017年至2019年间13776笔独栋房屋销售的交易数据。为了测量视觉上感知到的街景,本研究使用了带有233106张谷歌街景全景图像的深度卷积神经网络语义分割模型(Deeplab V3+)深度学习模型。然后,通过比较享乐价格模型和所有替代机器学习模型的解释力,选择了最佳机器学习模型。结果显示,作为代表性集成机器学习模型的梯度提升模型在预测独栋房屋价格方面比XGBoost、随机森林和线性回归模型表现更好。此外,本研究使用了SHAP方法的可解释机器学习模型来识别影响独栋房屋价格预测的关键特征。这解决了机器学习模型的“黑箱”问题。最后,通过分析感知到的街景特征与房价之间的非线性关系和交互效应,我们轻松快速地确定了享乐价格模型部分考虑的变量之间的关系。