Suppr超能文献

Leveraging the process mining technique to optimize data preparation time in a database used as an automated data delivery center.

作者信息

Abrehdari Seyed Hossein

机构信息

University of Tehran, Tehran, Iran.

Institute of Geophysics and Engineering Seismology, National Academy of Sciences, Armenia.

出版信息

MethodsX. 2025 Jun 12;15:103428. doi: 10.1016/j.mex.2025.103428. eCollection 2025 Dec.

Abstract

This study investigates the development and implementation of a seismic database utilizing process mining techniques. This data format is generated and stored in seismic centers, such as the U.S. Geological Survey (USGS). The study explored the various stages involved in the preparation, delivery, and processing of a database containing almost 900 earthquake waveform records (considered big data) by utilizing process mining techniques. The data were gathered from a region spanning 388,111.5 km², located between 44°-51°E and 38°-42.5°N, over the period from 1999 to 2018, and sourced from the USGS. The findings of this study indicate that the use of process mining methodologies decreases the time needed for database creation, including request, collection, preparation, and delivery, from 25 days with manual processing to approximately 8 days. In parallel, custom-built software scripts (computer codes) were deployed as unmanned tools to streamline the time-consuming phases of database creation. The idea presented in this study can help optimize the time for creating, storing, and delivering the database in seismological centers or other data centers, especially in an era where efficient management of large scientific datasets is increasingly vital. In total, process mining techniques were employed to analyze the workflow involved in creating a large database, including the steps of data request, preparation, and delivery.

摘要
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f292/12268689/407009a5c81f/ga1.jpg

相似文献

1
5
Leveraging a foundation model zoo for cell similarity search in oncological microscopy across devices.
Front Oncol. 2025 Jun 18;15:1480384. doi: 10.3389/fonc.2025.1480384. eCollection 2025.
6
Comparison of self-administered survey questionnaire responses collected using mobile apps versus other methods.
Cochrane Database Syst Rev. 2015 Jul 27;2015(7):MR000042. doi: 10.1002/14651858.MR000042.pub2.
7
The Subscapularis-Sparing Windowed Anterior Technique (SWAT) for Anatomic Total Shoulder Arthroplasty.
JBJS Essent Surg Tech. 2025 Jul 17;15(3). doi: 10.2106/JBJS.ST.24.00007. eCollection 2025 Jul-Sep.
8
Cost-effectiveness of using prognostic information to select women with breast cancer for adjuvant systemic therapy.
Health Technol Assess. 2006 Sep;10(34):iii-iv, ix-xi, 1-204. doi: 10.3310/hta10340.
9
Eliciting adverse effects data from participants in clinical trials.
Cochrane Database Syst Rev. 2018 Jan 16;1(1):MR000039. doi: 10.1002/14651858.MR000039.pub2.
10
Automated monitoring compared to standard care for the early detection of sepsis in critically ill patients.
Cochrane Database Syst Rev. 2018 Jun 25;6(6):CD012404. doi: 10.1002/14651858.CD012404.pub2.

本文引用的文献

1
Principles of data mining.
Drug Saf. 2007;30(7):621-2. doi: 10.2165/00002018-200730070-00010.

文献AI研究员

20分钟写一篇综述,助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型,支持多种主流文档格式。

立即体验