• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

高斯镜与 Model-X 伪影模型的系统比较。

The systematic comparison between Gaussian mirror and Model-X knockoff models.

机构信息

Department of Health Statistics, School of Public Health, Shanxi Medical University, No 56 Xinjian South Road, Yingze District, Taiyuan, Shanxi Province, China.

Department of Statistics, University of Auckland, 38 Princes Street, Auckland Central, Auckland, New Zealand, 1010.

出版信息

Sci Rep. 2023 Apr 4;13(1):5478. doi: 10.1038/s41598-023-32605-5.

DOI:10.1038/s41598-023-32605-5
PMID:37015993
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC10073103/
Abstract

While the high-dimensional biological data have provided unprecedented data resources for the identification of biomarkers, consensus is still lacking on how to best analyze them. The recently developed Gaussian mirror (GM) and Model-X (MX) knockoff-based methods have much related model assumptions, which makes them appealing for the detection of new biomarkers. However, there are no guidelines for their practical use. In this research, we systematically compared the performance of MX-based and GM methods, where the impacts of the distribution of explanatory variables, their relatedness and the signal-to-noise ratio were evaluated. MX with knockoff generated using the second-order approximates (MX-SO) has the best performance as compared to other MX-based methods. MX-SO and GM have similar levels of power and computational speed under most of the simulations, but GM is more robust in the control of false discovery rate (FDR). In particular, MX-SO can only control the FDR well when there are weak correlations among explanatory variables and the sample size is at least moderate. On the contrary, GM can have the desired FDR as long as explanatory variables are not highly correlated. We further used GM and MX-based methods to detect biomarkers that are associated with the Alzheimer's disease-related PET-imaging trait and the Parkinson's disease-related T-tau of cerebrospinal fluid. We found that MX-based and GM methods are both powerful for the analysis of big biological data. Although genes selected from MX-based methods are more similar as compared to those from the GM method, both MX-based and GM methods can identify the well-known disease-associated genes for each disease. While MX-based methods can have a slightly higher power than that of the GM method, it is less robust, especially for data with small sample sizes, unknown distributions, and high correlations.

摘要

虽然高维生物学数据为识别生物标志物提供了前所未有的数据资源,但如何最好地分析这些数据仍缺乏共识。最近开发的高斯镜 (GM) 和基于 Model-X (MX) knockoff 的方法具有许多相关的模型假设,这使得它们非常适合检测新的生物标志物。然而,目前还没有关于它们实际使用的指南。在这项研究中,我们系统地比较了基于 MX 和 GM 方法的性能,评估了解释变量的分布、相关性和信噪比的影响。与其他基于 MX 的方法相比,使用二阶近似 (MX-SO) 生成 knockoff 的 MX 表现最佳。在大多数模拟中,MX-SO 和 GM 的性能和计算速度相似,但 GM 在控制假发现率 (FDR) 方面更稳健。特别是,只有当解释变量之间相关性较弱且样本量至少适中时,MX-SO 才能很好地控制 FDR。相反,只要解释变量之间不高度相关,GM 就可以达到期望的 FDR。我们进一步使用 GM 和基于 MX 的方法来检测与阿尔茨海默病相关的 PET 成像特征和帕金森病相关的脑脊液 T-tau 相关的生物标志物。我们发现,基于 MX 和 GM 的方法都非常适合分析大型生物学数据。虽然基于 MX 的方法选择的基因与 GM 方法相比更相似,但基于 MX 和 GM 的方法都可以识别出每种疾病的已知与疾病相关的基因。虽然基于 MX 的方法的功效略高于 GM 方法,但它的稳健性较差,尤其是对于样本量较小、分布未知和相关性较高的数据。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ac12/10073103/f51e2e469419/41598_2023_32605_Fig4_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ac12/10073103/fd5f4a8fec41/41598_2023_32605_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ac12/10073103/95c8d80f0c91/41598_2023_32605_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ac12/10073103/e3c9c35da790/41598_2023_32605_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ac12/10073103/f51e2e469419/41598_2023_32605_Fig4_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ac12/10073103/fd5f4a8fec41/41598_2023_32605_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ac12/10073103/95c8d80f0c91/41598_2023_32605_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ac12/10073103/e3c9c35da790/41598_2023_32605_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ac12/10073103/f51e2e469419/41598_2023_32605_Fig4_HTML.jpg

相似文献

1
The systematic comparison between Gaussian mirror and Model-X knockoff models.高斯镜与 Model-X 伪影模型的系统比较。
Sci Rep. 2023 Apr 4;13(1):5478. doi: 10.1038/s41598-023-32605-5.
2
Longitudinal structural cerebral changes related to core CSF biomarkers in preclinical Alzheimer's disease: A study of two independent datasets.与临床前阿尔茨海默病核心 CSF 生物标志物相关的纵向结构脑变化:两项独立数据集研究。
Neuroimage Clin. 2018 Apr 16;19:190-201. doi: 10.1016/j.nicl.2018.04.016. eCollection 2018.
3
Alzheimer's disease markers, hypertension, and gray matter damage in normal elderly.正常老年人的阿尔茨海默病标志物、高血压和灰质损伤。
Neurobiol Aging. 2012 Jul;33(7):1215-27. doi: 10.1016/j.neurobiolaging.2011.02.012. Epub 2011 May 6.
4
The association between biomarkers in cerebrospinal fluid and structural changes in the brain in patients with Alzheimer's disease.阿尔茨海默病患者脑脊液生物标志物与大脑结构变化的关系。
J Intern Med. 2014 Apr;275(4):418-27. doi: 10.1111/joim.12164. Epub 2013 Dec 11.
5
Folic acid supplementation and malaria susceptibility and severity among people taking antifolate antimalarial drugs in endemic areas.在流行地区,服用抗叶酸抗疟药物的人群中,叶酸补充剂与疟疾易感性和严重程度的关系。
Cochrane Database Syst Rev. 2022 Feb 1;2(2022):CD014217. doi: 10.1002/14651858.CD014217.
6
PTI-125 Reduces Biomarkers of Alzheimer's Disease in Patients.PTI-125 降低阿尔茨海默病患者的生物标志物。
J Prev Alzheimers Dis. 2020;7(4):256-264. doi: 10.14283/jpad.2020.6.
7
CSF and blood biomarkers for the diagnosis of Alzheimer's disease: a systematic review and meta-analysis.脑脊液和血液生物标志物在阿尔茨海默病诊断中的应用:系统评价和荟萃分析。
Lancet Neurol. 2016 Jun;15(7):673-684. doi: 10.1016/S1474-4422(16)00070-3. Epub 2016 Apr 8.
8
Knockoff procedure for false discovery rate control in high-dimensional data streams.高维数据流中用于错误发现率控制的仿冒程序。
J Appl Stat. 2023 May 15;50(14):2970-2983. doi: 10.1080/02664763.2023.2200496. eCollection 2023.
9
Knockoff boosted tree for model-free variable selection.无模型变量选择的仿射提升树。
Bioinformatics. 2021 May 17;37(7):976-983. doi: 10.1093/bioinformatics/btaa770.
10
Evolution of Alzheimer's Disease Cerebrospinal Fluid Biomarkers in Early Parkinson's Disease.阿尔茨海默病脑脊液生物标志物在早期帕金森病中的演变。
Ann Neurol. 2020 Sep;88(3):574-587. doi: 10.1002/ana.25811. Epub 2020 Jul 2.

本文引用的文献

1
Clipper: p-value-free FDR control on high-throughput data from two conditions.Clipper:针对两种条件下高通量数据的 p 值自由 FDR 控制。
Genome Biol. 2021 Oct 11;22(1):288. doi: 10.1186/s13059-021-02506-9.
2
False discovery rate control in genome-wide association studies with population structure.基于群体结构的全基因组关联研究中的假发现率控制。
Proc Natl Acad Sci U S A. 2021 Oct 5;118(40). doi: 10.1073/pnas.2105841118.
3
A U-statistics for integrative analysis of multilayer omics data.用于多层组学数据综合分析的 U 统计量。
Bioinformatics. 2020 Apr 15;36(8):2365-2374. doi: 10.1093/bioinformatics/btaa004.
4
Apolipoprotein E and Alzheimer disease: pathobiology and targeting strategies.载脂蛋白 E 与阿尔茨海默病:发病机制与靶向治疗策略。
Nat Rev Neurol. 2019 Sep;15(9):501-518. doi: 10.1038/s41582-019-0228-7. Epub 2019 Jul 31.
5
Non-coding variability at the APOE locus contributes to the Alzheimer's risk.APOE 基因座的非编码变异与阿尔茨海默病风险相关。
Nat Commun. 2019 Jul 25;10(1):3310. doi: 10.1038/s41467-019-10945-z.
6
A multivariate Poisson-log normal mixture model for clustering transcriptome sequencing data.一种用于转录组测序数据聚类的多元泊松-对数正态混合模型。
BMC Bioinformatics. 2019 Jul 16;20(1):394. doi: 10.1186/s12859-019-2916-0.
7
Gene hunting with hidden Markov model knockoffs.使用隐马尔可夫模型仿样进行基因搜寻。
Biometrika. 2019 Mar;106(1):1-18. doi: 10.1093/biomet/asy033. Epub 2018 Aug 4.
8
The prognostic landscape of interactive biological processes presents treatment responses in cancer.交互式生物过程的预后图景呈现出癌症的治疗反应。
EBioMedicine. 2019 Mar;41:120-133. doi: 10.1016/j.ebiom.2019.01.064. Epub 2019 Feb 22.
9
Discovery and functional prioritization of Parkinson's disease candidate genes from large-scale whole exome sequencing.通过大规模全外显子组测序发现帕金森病候选基因并进行功能优先级排序。
Genome Biol. 2017 Jan 30;18(1):22. doi: 10.1186/s13059-017-1147-9.
10
Beta-Poisson model for single-cell RNA-seq data analyses.单细胞 RNA-seq 数据分析的 Beta-Poisson 模型。
Bioinformatics. 2016 Jul 15;32(14):2128-35. doi: 10.1093/bioinformatics/btw202. Epub 2016 Apr 19.