Department of Genetics and Genomic Sciences, Icahn School of Medicine at Mount Sinai, New York, NY 10029, USA.
Division of Allergy and Immunology, Department of Pediatrics, Icahn School of Medicine at Mount Sinai, New York, NY 10029, USA.
Cell Rep. 2021 Apr 13;35(2):108975. doi: 10.1016/j.celrep.2021.108975.
Although clinical and laboratory data have long been used to guide medical practice, this information is rarely integrated with multi-omic data to identify endotypes. We present Merged Affinity Network Association Clustering (MANAclust), a coding-free, automated pipeline enabling integration of categorical and numeric data spanning clinical and multi-omic profiles for unsupervised clustering to identify disease subsets. Using simulations and real-world data from The Cancer Genome Atlas, we demonstrate that MANAclust's feature selection algorithms are accurate and outperform competitors. We also apply MANAclust to a clinically and multi-omically phenotyped asthma cohort. MANAclust identifies clinically and molecularly distinct clusters, including heterogeneous groups of "healthy controls" and viral and allergy-driven subsets of asthmatic subjects. We also find that subjects with similar clinical presentations have disparate molecular profiles, highlighting the need for additional testing to uncover asthma endotypes. This work facilitates data-driven personalized medicine through integration of clinical parameters with multi-omics. MANAclust is freely available at https://bitbucket.org/scottyler892/manaclust/src/master/.
尽管临床和实验室数据长期以来一直被用于指导医学实践,但这些信息很少与多组学数据整合,以识别表型。我们提出了合并亲和网络关联聚类(MANAclust),这是一个无编码、自动化的管道,能够整合临床和多组学特征,跨越类别和数值数据,进行无监督聚类,以识别疾病亚群。通过使用模拟和来自癌症基因组图谱的真实世界数据,我们证明了 MANAclust 的特征选择算法是准确的,并优于竞争对手。我们还将 MANAclust 应用于具有临床和多组学表型的哮喘队列。MANAclust 识别出具有临床和分子特征的不同聚类,包括“健康对照”的异质组和哮喘患者的病毒和过敏驱动亚组。我们还发现,具有相似临床表现的患者具有不同的分子特征,这突出表明需要进行额外的测试来发现哮喘的表型。这项工作通过将临床参数与多组学整合,促进了数据驱动的个性化医疗。MANAclust 可在 https://bitbucket.org/scottyler892/manaclust/src/master/ 上免费获得。