Suppr超能文献

一个关于人类健康皮肤和皮肤癌所获取的光谱及临床特征的数据集。

A dataset of optical spectra and clinical features acquired on human healthy skin and on skin carcinomas.

作者信息

Elsen Thomas, Fauvel Clément, Khairallah Grégoire, Zghal Ahmed, Delconte Alain, Kupriyanov Valentin, Blondel Walter, Amouroux Marine

机构信息

Université de Lorraine, CNRS, CRAN UMR 7039, Vandoeuvre-lès-Nancy, 54500, France.

Metz-Thionville Regional Hospital, Department of plastic, aesthetic, and reconstructive surgery, Ars-Laquenexy, 57530, France.

出版信息

Data Brief. 2024 Feb 6;53:110163. doi: 10.1016/j.dib.2024.110163. eCollection 2024 Apr.

Abstract

Optical spectroscopy is studied to contribute to skin cancer diagnosis. Indeed, optical spectra are modified along cancer progression and provide complementary information (e.g., on metabolism and tissue structure) to clinical examination for surgical guidance [1,2]. The current original dataset is made of autofluorescence and diffuse reflectance spectra acquired on 131 patients' skin with the SpectroLive device [3,4]. Spatially-resolved spectroscopy measurements were performed using a multi-fiber optic probe featuring 4 distances (0.4-1 mm) between excitation and collection optical fibers: spatial resolution allows spectra acquired at different distances to carry information from different depths in skin tissues. Five types of autofluorescence spectra were acquired using five different wavelength excitations (on the 365-415 nm spectral range) in order to collect information on several skin endogenous fluorophores (e.g., flavins, collagen). A sixth light source (white broadband) was used to acquire diffuse reflectance spectra carrying information about skin scattering properties and skin endogenous absorbers such as melanin and hemoglobin. Patients were proposed to be included into the clinical trial if they were suspected of suffering from actinic keratoses (precancerous skin lesions) or from basal or squamous cell carcinomas: in all cases, complete diagnostics is provided in the dataset. To increase the interest of the dataset and evaluate the dependence of optical spectra (intensity, shape) not only on pathological states but also on healthy skin features (civil age, skin age, gender, phototype, anatomical site), spectra were acquired for all 131 patients on two so-called "reference" skin sites known to rarely suffer from skin cancer: palm of the hand (featuring a thick skin type) and inner wrist (featuring thin skin). Spectra are available in .tab files: first column displays the spectral range on which intensity spectra were recorded (317-788 nm) and each following column provides an intensity spectrum acquired by each spectrometer for a given combination of light source excitation and distance. Each of the 131 folders corresponding to each of the 131 patients contains a .json file providing patients clinical features: gender, civil age, skin age, phototype score and class. All .tab files names include anatomical site and anatomopathological diagnostics of the skin site on which spectra were acquired: codes were defined to match a letter or an acronym to each diagnostic and anatomical site. To ensure quality control, a spectrum was acquired on the same calibration standard before starting spectra acquisition on each patient. It is therefore possible to follow the impact of the acquisition optical chain ageing during the 4.5 years that the patients were included. This dataset can be used by epidemiologists for the characterization of populations affected by skin cancers (gender ratio, mean age, anatomical sites typically affected, etc.); it may also be used by researchers in artificial intelligence to develop innovative methods to process such data and contribute to non-invasive diagnostics of skin cancers whose incidence is steadily increasing.

摘要

对光谱学进行研究以助力皮肤癌诊断。实际上,光谱会随着癌症进展而发生改变,并为临床检查提供补充信息(例如关于新陈代谢和组织结构的信息),以用于手术指导[1,2]。当前的原始数据集由使用SpectroLive设备在131名患者皮肤上采集的自体荧光和漫反射光谱组成[3,4]。使用多光纤探头进行空间分辨光谱测量,激发光纤和收集光纤之间有4个距离(0.4 - 1毫米):空间分辨率使在不同距离采集的光谱能够携带来自皮肤组织不同深度的信息。使用5种不同波长的激发光(在365 - 415纳米光谱范围内)采集了5种类型的自体荧光光谱,以便收集关于多种皮肤内源性荧光团(例如黄素、胶原蛋白)的信息。使用第六个光源(白色宽带光源)采集漫反射光谱,该光谱携带有关皮肤散射特性以及皮肤内源性吸收体(如黑色素和血红蛋白)的信息。如果患者被怀疑患有光化性角化病(癌前皮肤病变)或基底细胞癌或鳞状细胞癌,则建议将其纳入临床试验:在所有情况下,数据集中都提供了完整的诊断信息。为了增加数据集的价值,并评估光谱(强度、形状)不仅对病理状态的依赖性,还对健康皮肤特征(实际年龄、皮肤年龄、性别、光型、解剖部位)的依赖性,对131名患者在两个所谓的“参考”皮肤部位采集了光谱,已知这两个部位很少患皮肤癌:手掌(厚皮类型)和手腕内侧(薄皮类型)。光谱以.tab文件形式提供:第一列显示记录强度光谱的光谱范围(317 - 788纳米),随后的每一列提供每个光谱仪针对给定光源激发和距离组合采集的强度光谱。与131名患者对应的131个文件夹中的每一个都包含一个.json文件,提供患者的临床特征:性别、实际年龄、皮肤年龄、光型评分和类别。所有.tab文件的名称都包括采集光谱的皮肤部位的解剖部位和解剖病理学诊断:定义了代码,将每个诊断和解剖部位与一个字母或首字母缩写相匹配。为确保质量控制,在开始对每个患者进行光谱采集之前,在相同的校准标准上采集了一个光谱。因此,可以跟踪在纳入患者的4.5年期间采集光学链老化的影响。该数据集可供流行病学家用于表征受皮肤癌影响的人群(性别比例、平均年龄、通常受影响的解剖部位等);人工智能领域的研究人员也可以使用它来开发创新方法来处理此类数据,并为皮肤癌的非侵入性诊断做出贡献,皮肤癌的发病率正在稳步上升。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d791/10875235/d63a225ce806/gr1.jpg

文献AI研究员

20分钟写一篇综述,助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型,支持多种主流文档格式。

立即体验