利用多种类型基因组数据的综合分析鉴定复杂疾病相关基因。

Identification of genes for complex diseases using integrated analysis of multiple types of genomic data.

机构信息

Department of Biomedical Engineering, Tulane University, New Orleans, Louisiana, United States of America.

出版信息

PLoS One. 2012;7(9):e42755. doi: 10.1371/journal.pone.0042755. Epub 2012 Sep 5.

DOI:10.1371/journal.pone.0042755

PMID:22957024

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC3434191/

Abstract

Various types of genomic data (e.g., SNPs and mRNA transcripts) have been employed to identify risk genes for complex diseases. However, the analysis of these data has largely been performed in isolation. Combining these multiple data for integrative analysis can take advantage of complementary information and thus can have higher power to identify genes (and/or their functions) that would otherwise be impossible with individual data analysis. Due to the different nature, structure, and format of diverse sets of genomic data, multiple genomic data integration is challenging. Here we address the problem by developing a sparse representation based clustering (SRC) method for integrative data analysis. As an example, we applied the SRC method to the integrative analysis of 376821 SNPs in 200 subjects (100 cases and 100 controls) and expression data for 22283 genes in 80 subjects (40 cases and 40 controls) to identify significant genes for osteoporosis (OP). Comparing our results with previous studies, we identified some genes known related to OP risk (e.g., 'THSD4', 'CRHR1', 'HSD11B1', 'THSD7A', 'BMPR1B' 'ADCY10', 'PRL', 'CA8','ESRRA', 'CALM1', 'CALM1', 'SPARC', and 'LRP1'). Moreover, we uncovered novel osteoporosis susceptible genes ('DICER1', 'PTMA', etc.) that were not found previously but play functionally important roles in osteoporosis etiology from existing studies. In addition, the SRC method identified genes can lead to higher accuracy for the diagnosis/classification of osteoporosis subjects when compared with the traditional T-test and Fisher-exact test, which further validates the proposed SRC approach for integrative analysis.

摘要

各种类型的基因组数据（例如，SNP 和 mRNA 转录本）已被用于鉴定复杂疾病的风险基因。然而，这些数据的分析在很大程度上是孤立进行的。将这些多种数据进行整合分析可以利用互补信息，从而可以更有效地识别个体数据分析不可能识别的基因（和/或其功能）。由于不同类型、结构和格式的基因组数据的不同性质，多种基因组数据的整合具有挑战性。在这里，我们通过开发一种基于稀疏表示的聚类（SRC）方法来解决这个问题，用于整合数据分析。例如，我们将 SRC 方法应用于 200 名受试者（100 例和 100 例对照）的 376821 个 SNP 和 80 名受试者（40 例和 40 例对照）的 22283 个基因表达数据的整合分析，以鉴定骨质疏松症（OP）的显著基因。将我们的结果与以前的研究进行比较，我们鉴定了一些已知与 OP 风险相关的基因（例如，'THSD4'、'CRHR1'、'HSD11B1'、'THSD7A'、'BMPR1B'、'ADCY10'、'PRL'、'CA8'、'ESRRA'、'CALM1'、'CALM1'、'SPARC' 和'LRP1'）。此外，我们还发现了一些以前没有发现但在现有研究中对骨质疏松症病因学具有重要功能作用的新的骨质疏松症易感基因（'DICER1'、'PTMA'等）。此外，与传统的 T 检验和 Fisher 精确检验相比，SRC 方法识别的基因可以提高骨质疏松症患者诊断/分类的准确性，进一步验证了该 SRC 方法在整合分析中的有效性。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3baa/3434191/24be57c369ee/pone.0042755.g001.jpg

相似文献

Identification of genes for complex diseases using integrated analysis of multiple types of genomic data.

PLoS One. 2012;7(9):e42755. doi: 10.1371/journal.pone.0042755. Epub 2012 Sep 5.

Identification of genes for complex diseases by integrating multiple types of genomic data.

Annu Int Conf IEEE Eng Med Biol Soc. 2012;2012:5541-4. doi: 10.1109/EMBC.2012.6347249.

Sparse representation based biomarker selection for schizophrenia with integrated analysis of fMRI and SNPs.

Neuroimage. 2014 Nov 15;102 Pt 1:220-8. doi: 10.1016/j.neuroimage.2014.01.021. Epub 2014 Feb 12.

Towards precise classification of cancers based on robust gene functional expression profiles.

BMC Bioinformatics. 2005 Mar 17;6:58. doi: 10.1186/1471-2105-6-58.

Integrative genomics analysis of eQTL and GWAS summary data identifies PPP1CB as a novel bone mineral density risk genes.

Biosci Rep. 2020 Apr 30;40(4). doi: 10.1042/BSR20193185.

Integrative genomic analysis predicts novel functional enhancer-SNPs for bone mineral density.

Hum Genet. 2019 Feb;138(2):167-185. doi: 10.1007/s00439-019-01971-4. Epub 2019 Jan 17.

InterSIM: Simulation tool for multiple integrative 'omic datasets'.

Comput Methods Programs Biomed. 2016 May;128:69-74. doi: 10.1016/j.cmpb.2016.02.011. Epub 2016 Feb 27.

Integrative clustering of multiple genomic data types using a joint latent variable model with application to breast and lung cancer subtype analysis.

Bioinformatics. 2009 Nov 15;25(22):2906-12. doi: 10.1093/bioinformatics/btp543. Epub 2009 Sep 16.

A single nucleotide polymorphism in the TGF-β1 gene (rs1982073 C>T) may contribute to increased risks of bone fracture, osteoporosis, and osteoarthritis: a meta-analysis.

Clin Rheumatol. 2016 Apr;35(4):973-85. doi: 10.1007/s10067-014-2840-7. Epub 2014 Dec 13.

An integrative systems genetics approach reveals potential causal genes and pathways related to obesity.

Genome Med. 2015 Oct 20;7:105. doi: 10.1186/s13073-015-0229-0.

引用本文的文献

Identification of Serum Exosome-Derived circRNA-miRNA-TF-mRNA Regulatory Network in Postmenopausal Osteoporosis Using Bioinformatics Analysis and Validation in Peripheral Blood-Derived Mononuclear Cells.

Front Endocrinol (Lausanne). 2022 Jun 9;13:899503. doi: 10.3389/fendo.2022.899503. eCollection 2022.

A hybrid gene selection method based on gene scoring strategy and improved particle swarm optimization.

BMC Bioinformatics. 2019 Jun 10;20(Suppl 8):289. doi: 10.1186/s12859-019-2773-x.

Lrp1 in osteoblasts controls osteoclast activity and protects against osteoporosis by limiting PDGF-RANKL signaling.

Bone Res. 2018 Feb 26;6:4. doi: 10.1038/s41413-017-0006-3. eCollection 2018.

Integrating multiple genomic data: sparse representation based biomarker selection for blood pressure.

BMC Proc. 2016 Oct 18;10(Suppl 7):283-288. doi: 10.1186/s12919-016-0044-7. eCollection 2016.

A novel strategy for gene selection of microarray data based on gene-to-class sensitivity information.

PLoS One. 2014 May 20;9(5):e97530. doi: 10.1371/journal.pone.0097530. eCollection 2014.

Sparse representation based biomarker selection for schizophrenia with integrated analysis of fMRI and SNPs.

Neuroimage. 2014 Nov 15;102 Pt 1:220-8. doi: 10.1016/j.neuroimage.2014.01.021. Epub 2014 Feb 12.

本文引用的文献

Classification of multicolor fluorescence in situ hybridization (M-FISH) images with sparse representation.

IEEE Trans Nanobioscience. 2012 Jun;11(2):111-8. doi: 10.1109/TNB.2012.2189414.

An integrative study ascertained SOD2 as a susceptibility gene for osteoporosis in Chinese.

J Bone Miner Res. 2011 Nov;26(11):2695-701. doi: 10.1002/jbmr.471.

A microRNA expression signature of osteoclastogenesis.

Blood. 2011 Mar 31;117(13):3648-57. doi: 10.1182/blood-2010-10-311415. Epub 2011 Jan 27.

A Hybrid Machine Learning Method for Fusing fMRI and Genetic Data: Combining both Improves Classification of Schizophrenia.

Front Hum Neurosci. 2010 Oct 25;4:192. doi: 10.3389/fnhum.2010.00192. eCollection 2010.

An integration of genome-wide association study and gene expression profiling to prioritize the discovery of novel susceptibility Loci for osteoporosis-related traits.

PLoS Genet. 2010 Jun 10;6(6):e1000977. doi: 10.1371/journal.pgen.1000977.

Integrative analysis of gene expression and copy number alterations using canonical correlation analysis.

BMC Bioinformatics. 2010 Apr 15;11:191. doi: 10.1186/1471-2105-11-191.

Molecular genetic studies of gene identification for osteoporosis: the 2009 update.

Endocr Rev. 2010 Aug;31(4):447-505. doi: 10.1210/er.2009-0032. Epub 2010 Mar 31.

Dicer inactivation in osteoprogenitor cells compromises fetal survival and bone formation, while excision in differentiated osteoblasts increases bone mass in the adult mouse.

Dev Biol. 2010 Apr 1;340(1):10-21. doi: 10.1016/j.ydbio.2010.01.008. Epub 2010 Jan 15.

Osteoclast-specific Dicer gene deficiency suppresses osteoclastic bone resorption.

J Cell Biochem. 2010 Apr 1;109(5):866-75. doi: 10.1002/jcb.22228.

Genome-wide association and follow-up replication studies identified ADAMTS18 and TGFBR3 as bone mass candidate genes in different ethnic groups.

Am J Hum Genet. 2009 Mar;84(3):388-98. doi: 10.1016/j.ajhg.2009.01.025. Epub 2009 Feb 26.

文献AI研究员

20分钟写一篇综述，助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型，支持多种主流文档格式。

立即体验

利用多种类型基因组数据的综合分析鉴定复杂疾病相关基因。

Identification of genes for complex diseases using integrated analysis of multiple types of genomic data.

机构信息

出版信息

相似文献

引用本文的文献

本文引用的文献

文献AI研究员

用中文搜PubMed

文档翻译

Suppr 超能文献

相似文献

引用本文的文献

本文引用的文献