Kujala Rainer, Weckström Christoffer, Darst Richard K, Mladenović Miloš N, Saramäki Jari
Department of Computer Science, Aalto University, P.O. Box 15400, FI-00076 Aalto/Espoo, Finland.
Department of Built Environment, Aalto University, P.O. Box 14100, FI-0076 Aalto/Espoo, Finland.
Sci Data. 2018 May 15;5:180089. doi: 10.1038/sdata.2018.89.
Various public transport (PT) agencies publish their route and timetable information with the General Transit Feed Specification (GTFS) as the standard open format. Timetable data are commonly used for PT passenger routing. They can also be used for studying the structure and organization of PT networks, as well as the accessibility and the level of service these networks provide. However, using raw GTFS data is challenging as researchers need to understand the details of the GTFS data format, make sure that the data contain all relevant modes of public transport, and have no errors. To lower the barrier for using GTFS data in research, we publish a curated collection of 25 cities' public transport networks in multiple easy-to-use formats including network edge lists, temporal network event lists, SQLite databases, GeoJSON files, and the GTFS data format. This collection promotes the study of how PT is organized across the globe, and also provides a testbed for developing tools for PT network analysis and PT routing algorithms.
各种公共交通(PT)机构都以通用公交出行数据规范(GTFS)作为标准开放格式来发布其线路和时刻表信息。时刻表数据通常用于PT乘客的路线规划。它们还可用于研究PT网络的结构和组织,以及这些网络的可达性和所提供的服务水平。然而,使用原始的GTFS数据具有挑战性,因为研究人员需要了解GTFS数据格式的细节,确保数据包含所有相关的公共交通模式,并且没有错误。为了降低在研究中使用GTFS数据的难度,我们以多种易于使用的格式发布了25个城市公共交通网络的精选集合,包括网络边列表、时态网络事件列表、SQLite数据库、地理JSON文件和GTFS数据格式。该集合促进了对全球范围内PT组织方式的研究,也为开发PT网络分析工具和PT路由算法提供了一个测试平台。