Paramita Rafika Indah, Panigoro Sonar Soni, Fadilah Fadilah, Wanandi Septelia Inawati, Sutandyo Noorwati
Doctoral Program in Biomedical Sciences, Faculty of Medicine, Universitas Indonesia, Jalan Salemba Raya number 4, Jakarta, 10430, Indonesia.
Department of Medical Chemistry, Faculty of Medicine, Universitas Indonesia, Jalan Salemba Raya number 4, Jakarta, 10430, Indonesia.
Data Brief. 2025 Jan 27;59:111343. doi: 10.1016/j.dib.2025.111343. eCollection 2025 Apr.
Breast cancer is commonly acknowledged as the primary type of cancer on a global scale, exerting a substantial influence on death rates, particularly in developing countries. The aforementioned discovery provides evidence in favor of the concept that genetic factors may contribute to the onset of breast cancer. This paper presents the unprocessed idat data containing single nucleotide polymorphisms (SNPs) acquired from breast cancer patients and a control group comprising of healthy women. The DNA was obtained from stored blood samples that were collected from a total of 48 female patients diagnosed with breast cancer at Cipto Mangunkusumo National Hospital Jakarta and Dharmais National Cancer Center Hospital Jakarta. Additionally, 24 healthy women were included as control subjects. Subsequently, the DNA samples were subjected to hybridization onto Infinium Asian Screening Array (ASA)'s beadchips. The chip was then subjected to fluorescence intensity measurements using an iScan machine manufactured by Illumina. The data output is produced in the form of a .idat file for each sample. Subsequently, further quality control measures and population stratification analysis were conducted using PLINK (v1.9). After the conclusion of the quality control procedure, 72 individuals and a dataset consisting of 424,285 genetic variants were selected for further analysis. The idat raw data files have been added to the Gene Expression Omnibus (GEO) with accession number: GSE245794 (https://www.ncbi.nlm.nih.gov/geo/query/acc.cgi?acc=GSE245794)
乳腺癌被公认为全球主要的癌症类型,对死亡率有重大影响,尤其是在发展中国家。上述发现为遗传因素可能导致乳腺癌发病这一概念提供了证据。本文展示了从未经处理的包含单核苷酸多态性(SNP)的idat数据,这些数据取自乳腺癌患者以及由健康女性组成的对照组。DNA是从存储的血液样本中获取的,这些样本来自雅加达的Cipto Mangunkusumo国家医院和雅加达的Dharmais国家癌症中心医院总共48名被诊断患有乳腺癌的女性患者。此外,还纳入了24名健康女性作为对照对象。随后,将DNA样本与Infinium亚洲筛查阵列(ASA)的芯片进行杂交。然后使用Illumina制造的iScan机器对芯片进行荧光强度测量。每个样本的数据输出以.idat文件的形式生成。随后,使用PLINK(v1.9)进行了进一步的质量控制措施和群体分层分析。在质量控制程序结束后,选择了72名个体和一个由424,285个基因变异组成的数据集进行进一步分析。idat原始数据文件已添加到基因表达综合数据库(GEO)中,登录号为:GSE245794(https://www.ncbi.nlm.nih.gov/geo/query/acc.cgi?acc=GSE245794)