• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

利用文本挖掘分析和网络拓扑发现交通致死事故叙述中的潜在主题。

Discovering latent themes in traffic fatal crash narratives using text mining analytics and network topology.

机构信息

Dept. of Civil and Construction Engineering, Western Michigan Univ., 4601 Campus Dr., G-238, Kalamazoo, MI, 49008-5316, United States.

Dept. of Statistics, Western Michigan Univ., 1903 W Michigan Ave, Kalamazoo, MI, 49008-5152, United States.

出版信息

Accid Anal Prev. 2021 Feb;150:105899. doi: 10.1016/j.aap.2020.105899. Epub 2020 Dec 4.

DOI:10.1016/j.aap.2020.105899
PMID:33285445
Abstract

The proliferation of digital textual archives in the transportation safety domain makes it imperative for the inventions of efficient ways of extracting information from the textual data sources. The present study aims at utilizing crash narratives complemented by crash metadata to discern the prevalence and co-occurrence of themes that contribute to crash incidents. Ten years (2009-2018) of Michigan traffic fatal crash narratives were used as a case study. The structural topic modeling (STM) and network topology analysis were used to generate and examine the prevalence and interaction of themes from the crash narratives that were mainly categorized into pre-crash events, crash locations and involved parties in the traffic crashes. The main advantage of the STM over the other topic modeling approaches is that it allows the researchers to discover themes from documents and estimate how the topic relates to the document metadata. Topics with the highest prevalence for the angle, head-on, rear-end, sideswipe and single motor vehicle crashes were crash at stop-sign, crossing the centerline, unable to stop, lane change maneuver and run-off-road crash, respectively. Eigenvector centrality measure in network topology showed that event-related topics were consistently central in articulating the crash occurrence. The centrality and association between topics varied across crash types. The efficacy of generated topics in classifying crashes by type was tested using a machine learning algorithm, Random Forest. The classification accuracy in the held-out sample ranged between 89.3 % for sideswipe crashes to 99.2 % for single motor vehicle crashes. High classification accuracy suggests that automation of crash typing and consistency checks can be accomplished effectively by using extracted latent themes from the crash narratives.

摘要

交通安全领域数字文本档案的大量涌现,使得从文本数据源中高效提取信息的发明变得势在必行。本研究旨在利用事故叙述,并辅以事故元数据,辨别导致事故的主题的普遍性和共同出现。选取密歇根州十年(2009-2018 年)的交通致命事故叙述作为案例研究。采用结构主题模型(STM)和网络拓扑分析,从事故叙述中生成和检验主题的普遍性和相互作用,这些主题主要分为事故前事件、事故地点和事故涉及方。与其他主题建模方法相比,STM 的主要优势在于,它允许研究人员从文档中发现主题,并估计主题与文档元数据的关系。角度碰撞、正面碰撞、追尾碰撞、侧面碰撞和单辆机动车碰撞的最高出现率主题分别是停车标志处碰撞、越过中心线、无法停车、变道机动和驶离道路碰撞。网络拓扑中的特征向量中心性度量表明,与事件相关的主题在阐述事故发生时始终处于中心位置。主题的中心性和关联在不同的碰撞类型之间有所不同。使用机器学习算法随机森林(Random Forest)对生成的主题进行分类的效果进行了测试。在保留样本中的分类准确率范围从侧面碰撞的 89.3%到单辆机动车碰撞的 99.2%。高分类准确率表明,可以通过从事故叙述中提取潜在主题来有效地实现碰撞类型的自动化和一致性检查。

相似文献

1
Discovering latent themes in traffic fatal crash narratives using text mining analytics and network topology.利用文本挖掘分析和网络拓扑发现交通致死事故叙述中的潜在主题。
Accid Anal Prev. 2021 Feb;150:105899. doi: 10.1016/j.aap.2020.105899. Epub 2020 Dec 4.
2
Advancing investigation of automated vehicle crashes using text analytics of crash narratives and Bayesian analysis.利用事故叙述的文本分析和贝叶斯分析推进自动驾驶汽车事故的调查。
Accid Anal Prev. 2023 Mar;181:106932. doi: 10.1016/j.aap.2022.106932. Epub 2022 Dec 27.
3
How instantaneous driving behavior contributes to crashes at intersections: Extracting useful information from connected vehicle message data.瞬时驾驶行为如何导致交叉口事故:从车联网消息数据中提取有用信息。
Accid Anal Prev. 2019 Jun;127:118-133. doi: 10.1016/j.aap.2019.01.014. Epub 2019 Mar 7.
4
Analyzing relationships between latent topics in autonomous vehicle crash narratives and crash severity using natural language processing techniques and explainable XGBoost.利用自然语言处理技术和可解释的 XGBoost 分析自动驾驶汽车事故叙述中潜在主题与事故严重程度之间的关系。
Accid Anal Prev. 2024 Aug;203:107605. doi: 10.1016/j.aap.2024.107605. Epub 2024 May 13.
5
Assessing the crash risk of mixed traffic on multilane rural highways using a proactive safety approach.采用主动安全方法评估多车道农村公路混合交通的碰撞风险。
Accid Anal Prev. 2023 Aug;188:107099. doi: 10.1016/j.aap.2023.107099. Epub 2023 May 7.
6
Injury severity analysis of pedestrian and bicyclist trespassing crashes at non-crossings: A hybrid predictive text analytics and heterogeneity-based statistical modeling approach.行人和自行车非交叉口闯入事故严重程度分析:一种基于混合预测文本分析和异质性的统计建模方法。
Accid Anal Prev. 2021 Feb;150:105835. doi: 10.1016/j.aap.2020.105835. Epub 2020 Dec 9.
7
Modeling crash outcome probabilities at rural intersections: application of hierarchical binomial logistic models.农村十字路口碰撞事故结果概率建模:分层二项逻辑模型的应用。
Accid Anal Prev. 2007 Jan;39(1):125-34. doi: 10.1016/j.aap.2006.06.011. Epub 2006 Aug 22.
8
Understanding the effects of vehicle platoons on crash type and severity.理解车阵对事故类型和严重程度的影响。
Accid Anal Prev. 2021 Jan;149:105858. doi: 10.1016/j.aap.2020.105858. Epub 2020 Nov 18.
9
Comparison of teen and adult driver crash scenarios in a nationally representative sample of serious crashes.在全国具有代表性的严重撞车事故样本中,青少年与成年驾驶员撞车场景的比较。
Accid Anal Prev. 2014 Nov;72:302-8. doi: 10.1016/j.aap.2014.07.016. Epub 2014 Aug 5.
10
The Association between Regional Environmental Factors and Road Trauma Rates: A Geospatial Analysis of 10 Years of Road Traffic Crashes in British Columbia, Canada.区域环境因素与道路创伤率之间的关联:对加拿大不列颠哥伦比亚省十年道路交通碰撞事故的地理空间分析
PLoS One. 2016 Apr 21;11(4):e0153742. doi: 10.1371/journal.pone.0153742. eCollection 2016.

引用本文的文献

1
Multivariate analysis of roadway multi-fatality crashes using association rules mining and rules graph structures: A case study in China.基于关联规则挖掘和规则图结构的道路多死事故的多变量分析:中国的案例研究。
PLoS One. 2022 Oct 27;17(10):e0276817. doi: 10.1371/journal.pone.0276817. eCollection 2022.