Department of Infectious Diseases, Peking University Third Hospital, Beijing, 100191, China.
College of Engineering, Peking University, Beijing, 100871,
JASA Express Lett. 2024 May 1;4(5). doi: 10.1121/10.0025851.
Machine learning enabled auscultating diagnosis can provide promising solutions especially for prescreening purposes. The bottleneck for its potential success is that high-quality datasets for training are still scarce. An open auscultation dataset that consists of samples and annotations from patients and healthy individuals is established in this work for the respiratory diagnosis studies with machine learning, which is of both scientific importance and practical potential. A machine learning approach is examined to showcase the use of this new dataset for lung sound classifications with different diseases. The open dataset is available to the public online.
机器学习辅助听诊诊断特别有助于筛查目的,有望提供解决方案。但目前其成功的瓶颈在于,用于训练的高质量数据集仍然稀缺。本研究建立了一个公开听诊数据集,包含来自患者和健康个体的样本和注释,可用于机器学习的呼吸诊断研究,具有重要的科学意义和实际应用潜力。本研究还采用机器学习方法,展示了使用这个新数据集进行不同疾病的肺部声音分类。该公开数据集可在线向公众开放。