• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

通过豪斯霍尔德反射实现的反射仿冒品:在蛋白质组学和基因精细定位中的应用

Reflection Knockoffs via Householder Reflection: Applications in Proteomics and Genetic Fine Mapping.

作者信息

Guan Yongtao, Levy Daniel

出版信息

bioRxiv. 2025 May 29:2025.01.16.633369. doi: 10.1101/2025.01.16.633369.

DOI:10.1101/2025.01.16.633369
PMID:40568129
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC12190757/
Abstract

We present a novel knockoff construction method, and demonstrate its superior performance in two applications: identifying proteomic signatures of age and genetic fine mapping. Both applications involve datasets of highly correlated features, but they differ in the abundance of driver associations. Our primary contribution is the invention of the reflection knockoff, which is constructed from mirror images - obtained via Householder reflection - of the original features. The reflection knockoffs substantially outperform Model-X knockoffs in feature selection, particularly when features are highly correlated. Our secondary contribution is a simple method to aggregate multiple sets of identically distributed knockoff statistics to improve the consistency of knockoff filters. In the study of proteomic signatures of age, single feature tests showed overly abundant proteomic association with age. Knockoff filters using reflection knockoffs and aggregation, however, revealed that a majority of these associations are hitchhikers instead of drivers. When applied to genetic fine mapping, knockoff filters using reflection knockoffs and aggregation outperform a state-of-the-art method. We discuss a potentially exciting application of reflection knockoffs: sharing genetic data without raising concerns about privacy and regulatory violations.

摘要

我们提出了一种新颖的替代构建方法,并在两个应用中展示了其卓越性能:识别年龄的蛋白质组学特征以及基因精细定位。这两个应用都涉及高度相关特征的数据集,但驱动关联的丰富程度有所不同。我们的主要贡献是发明了反射替代,它是由原始特征通过豪斯霍尔德反射获得的镜像构建而成。在特征选择方面,反射替代显著优于X模型替代,特别是当特征高度相关时。我们的次要贡献是一种简单的方法,用于聚合多组同分布的替代统计量,以提高替代筛选器的一致性。在年龄的蛋白质组学特征研究中,单特征测试显示与年龄的蛋白质组关联过多。然而,使用反射替代和聚合的替代筛选器表明,这些关联中的大多数是搭便车者而非驱动因素。当应用于基因精细定位时,使用反射替代和聚合的替代筛选器优于一种先进方法。我们讨论了反射替代一个潜在的令人兴奋的应用:在不引发隐私和监管违规担忧的情况下共享基因数据。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5a72/12190757/2e349f309e44/nihpp-2025.01.16.633369v2-f0003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5a72/12190757/c3c4c5299301/nihpp-2025.01.16.633369v2-f0001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5a72/12190757/737a8fca713a/nihpp-2025.01.16.633369v2-f0002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5a72/12190757/2e349f309e44/nihpp-2025.01.16.633369v2-f0003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5a72/12190757/c3c4c5299301/nihpp-2025.01.16.633369v2-f0001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5a72/12190757/737a8fca713a/nihpp-2025.01.16.633369v2-f0002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5a72/12190757/2e349f309e44/nihpp-2025.01.16.633369v2-f0003.jpg

相似文献

1
Reflection Knockoffs via Householder Reflection: Applications in Proteomics and Genetic Fine Mapping.通过豪斯霍尔德反射实现的反射仿冒品:在蛋白质组学和基因精细定位中的应用
bioRxiv. 2025 May 29:2025.01.16.633369. doi: 10.1101/2025.01.16.633369.
2
Reflection Knockoffs via Householder Reflection: Applications in Proteomics and Genetic Fine Mapping.通过豪斯霍尔德反射实现反射仿冒品:在蛋白质组学和基因精细定位中的应用
Genetics. 2025 Aug 29. doi: 10.1093/genetics/iyaf178.
3
Prescription of Controlled Substances: Benefits and Risks管制药品的处方:益处与风险
4
Second-order group knockoffs with applications to genome-wide association studies.二阶群组置换检验及其在全基因组关联研究中的应用。
Bioinformatics. 2024 Oct 1;40(10). doi: 10.1093/bioinformatics/btae580.
5
It's a wrap: deriving distinct discoveries with FDR control after a GWAS pipeline.大功告成:在全基因组关联研究流程之后通过错误发现率控制得出不同的发现。
bioRxiv. 2025 Jul 9:2025.06.05.658138. doi: 10.1101/2025.06.05.658138.
6
Knockoff-Based Fine Mapping of MS-Associated SNPs in Sardinian Trios.基于替代法对撒丁岛三人组中与多发性硬化症相关的单核苷酸多态性进行精细定位。
Biochem Genet. 2025 Aug 30. doi: 10.1007/s10528-025-11238-5.
7
PLSKO: a robust knockoff generator to control false discovery rate in omics variable selection.PLSKO:一种用于在组学变量选择中控制错误发现率的强大替代变量生成器。
Bioinformatics. 2025 Aug 29. doi: 10.1093/bioinformatics/btaf475.
8
Aspects of Genetic Diversity, Host Specificity and Public Health Significance of Single-Celled Intestinal Parasites Commonly Observed in Humans and Mostly Referred to as 'Non-Pathogenic'.人类常见且大多被称为“非致病性”的单细胞肠道寄生虫的遗传多样性、宿主特异性及公共卫生意义
APMIS. 2025 Sep;133(9):e70036. doi: 10.1111/apm.70036.
9
Comparison of Two Modern Survival Prediction Tools, SORG-MLA and METSSS, in Patients With Symptomatic Long-bone Metastases Who Underwent Local Treatment With Surgery Followed by Radiotherapy and With Radiotherapy Alone.两种现代生存预测工具 SORG-MLA 和 METSSS 在接受手术联合放疗和单纯放疗治疗有症状长骨转移患者中的比较。
Clin Orthop Relat Res. 2024 Dec 1;482(12):2193-2208. doi: 10.1097/CORR.0000000000003185. Epub 2024 Jul 23.
10
Search strategies to identify diagnostic accuracy studies in MEDLINE and EMBASE.在MEDLINE和EMBASE中识别诊断准确性研究的检索策略。
Cochrane Database Syst Rev. 2013 Sep 11;2013(9):MR000022. doi: 10.1002/14651858.MR000022.pub3.

本文引用的文献

1
Asymptotically exact fit for linear mixed model in genetic association studies.遗传关联研究中线性混合模型的渐近精确拟合。
Genetics. 2024 Oct 7;228(2). doi: 10.1093/genetics/iyae143.
2
Fine-mapping across diverse ancestries drives the discovery of putative causal variants underlying human complex traits and diseases.在不同的血统中进行精细映射可以发现人类复杂特征和疾病背后的潜在因果变异。
Nat Genet. 2024 Sep;56(9):1841-1850. doi: 10.1038/s41588-024-01870-z. Epub 2024 Aug 26.
3
Estimation of inbreeding and kinship coefficients via latent identity-by-descent states.
基于潜在的亲缘关系状态估计近亲系数和亲缘系数。
Bioinformatics. 2024 Feb 1;40(2). doi: 10.1093/bioinformatics/btae082.
4
XMAP: Cross-population fine-mapping by leveraging genetic diversity and accounting for confounding bias.XMAP:利用遗传多样性并考虑混杂偏差进行跨人群精细映射。
Nat Commun. 2023 Oct 28;14(1):6870. doi: 10.1038/s41467-023-42614-7.
5
Plasma proteomic associations with genetics and health in the UK Biobank.英国生物库中血浆蛋白质组与遗传学和健康的关联。
Nature. 2023 Oct;622(7982):329-338. doi: 10.1038/s41586-023-06592-6. Epub 2023 Oct 4.
6
A simple new approach to variable selection in regression, with application to genetic fine mapping.一种用于回归中变量选择的简单新方法及其在基因精细定位中的应用。
J R Stat Soc Series B Stat Methodol. 2020 Dec;82(5):1273-1300. doi: 10.1111/rssb.12388. Epub 2020 Jul 10.
7
GhostKnockoff inference empowers identification of putative causal variants in genome-wide association studies.幽灵复刻推断使全基因组关联研究中假定因果变异的识别成为可能。
Nat Commun. 2022 Nov 23;13(1):7209. doi: 10.1038/s41467-022-34932-z.
8
Identification of putative causal loci in whole-genome sequencing data via knockoff statistics.基于置换统计量的全基因组测序数据中假定因果基因座的识别。
Nat Commun. 2021 May 25;12(1):3152. doi: 10.1038/s41467-021-22889-4.
9
Plasma proteomic biomarker signature of age predicts health and life span.血浆蛋白质组生物标志物特征可预测年龄与寿命和健康。
Elife. 2020 Nov 19;9:e61073. doi: 10.7554/eLife.61073.
10
Multi-resolution localization of causal variants across the genome.全基因组因果变异的多分辨率定位。
Nat Commun. 2020 Feb 27;11(1):1093. doi: 10.1038/s41467-020-14791-2.