• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

一种用于组学全关联分析的具有错误发现率控制的新颖且稳健的特征选择方法。

A novel and robust feature selection method with FDR control for omics-wide association analysis.

作者信息

Chen Zhibo, Lu Zi-Tong, Song Xue-Ting, Gao Yu-Fan, Xiao Jian

机构信息

School of Statistics and Mathematics, Zhongnan University of Economics and Law, Wuhan, Hubei, People's Republic of China.

出版信息

PLoS One. 2025 Aug 22;20(8):e0300490. doi: 10.1371/journal.pone.0300490. eCollection 2025.

DOI:10.1371/journal.pone.0300490
PMID:40845052
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC12373251/
Abstract

Omics-wide association analysis is a very important tool for medicine and human health study. However, the modern omics data sets collected often exhibit the high-dimensionality, unknown distribution response, unknown distribution features and unknown complex association relationships between the response and its explanatory features. Reliable association analysis results depend on an accurate modeling for such data sets. Most of the existing association analysis methods rely on the specific model assumptions and lack effective false discovery rate (FDR) control. To address these limitations, the paper firstly applies a single index model for omics data. The model shows robust performance in allowing the relationships between the response variable and linear combination of covariates to be connected by any unknown monotonic link function, and both the random error and the covariates can follow any unknown distribution. Then based on this model, the paper combines rank-based approach and symmetrized data aggregation approach to develop a novel and robust feature selection method for achieving fine-mapping of risk features while controlling the false positive rate of selection. The theoretical results support the proposed method and the analysis results of simulated data show the new method possesses effective and robust performance for all the scenarios. The new method is also used to analyze the two real datasets and identifies some risk features unreported by the existing finds.

摘要

全基因组关联分析是医学和人类健康研究中非常重要的工具。然而,收集到的现代组学数据集常常呈现出高维度、响应分布未知、特征分布未知以及响应与其解释性特征之间复杂关联关系未知的特点。可靠的关联分析结果依赖于对此类数据集进行准确建模。现有的大多数关联分析方法依赖于特定的模型假设,并且缺乏有效的错误发现率(FDR)控制。为了解决这些局限性,本文首先将单指标模型应用于组学数据。该模型表现出强大的性能,它允许响应变量与协变量的线性组合之间的关系通过任何未知的单调链接函数来连接,并且随机误差和协变量都可以遵循任何未知分布。然后基于此模型,本文结合基于秩的方法和对称数据聚合方法,开发了一种新颖且强大的特征选择方法,以在控制选择假阳性率的同时实现风险特征的精细定位。理论结果支持了所提出的方法,模拟数据的分析结果表明新方法在所有场景下都具有有效且稳健的性能。新方法还被用于分析两个真实数据集,并识别出一些现有研究未报道的风险特征。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/77e1/12373251/dd4c00ef055f/pone.0300490.g022.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/77e1/12373251/8f5820f71625/pone.0300490.g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/77e1/12373251/61899cfa7d9c/pone.0300490.g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/77e1/12373251/b4920bdc71f8/pone.0300490.g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/77e1/12373251/65a0ad4e2281/pone.0300490.g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/77e1/12373251/89dbd481cc86/pone.0300490.g005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/77e1/12373251/c601d3723e27/pone.0300490.g006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/77e1/12373251/ddcbcbd887b6/pone.0300490.g007.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/77e1/12373251/b96e19e27624/pone.0300490.g008.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/77e1/12373251/92a95ddeb61d/pone.0300490.g009.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/77e1/12373251/e7292bd4b76e/pone.0300490.g010.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/77e1/12373251/c24f76477ed5/pone.0300490.g011.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/77e1/12373251/8d96b058f6b7/pone.0300490.g012.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/77e1/12373251/a2bab1f86897/pone.0300490.g013.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/77e1/12373251/c8173dcfff5c/pone.0300490.g014.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/77e1/12373251/7c5c3b1d7589/pone.0300490.g015.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/77e1/12373251/c45e5e65f5a1/pone.0300490.g016.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/77e1/12373251/f3d9b3dfe83c/pone.0300490.g017.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/77e1/12373251/e4132c84982d/pone.0300490.g018.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/77e1/12373251/1c387039392e/pone.0300490.g019.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/77e1/12373251/43b2d52eb39a/pone.0300490.g020.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/77e1/12373251/4d7b824bec9a/pone.0300490.g021.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/77e1/12373251/dd4c00ef055f/pone.0300490.g022.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/77e1/12373251/8f5820f71625/pone.0300490.g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/77e1/12373251/61899cfa7d9c/pone.0300490.g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/77e1/12373251/b4920bdc71f8/pone.0300490.g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/77e1/12373251/65a0ad4e2281/pone.0300490.g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/77e1/12373251/89dbd481cc86/pone.0300490.g005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/77e1/12373251/c601d3723e27/pone.0300490.g006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/77e1/12373251/ddcbcbd887b6/pone.0300490.g007.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/77e1/12373251/b96e19e27624/pone.0300490.g008.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/77e1/12373251/92a95ddeb61d/pone.0300490.g009.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/77e1/12373251/e7292bd4b76e/pone.0300490.g010.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/77e1/12373251/c24f76477ed5/pone.0300490.g011.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/77e1/12373251/8d96b058f6b7/pone.0300490.g012.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/77e1/12373251/a2bab1f86897/pone.0300490.g013.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/77e1/12373251/c8173dcfff5c/pone.0300490.g014.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/77e1/12373251/7c5c3b1d7589/pone.0300490.g015.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/77e1/12373251/c45e5e65f5a1/pone.0300490.g016.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/77e1/12373251/f3d9b3dfe83c/pone.0300490.g017.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/77e1/12373251/e4132c84982d/pone.0300490.g018.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/77e1/12373251/1c387039392e/pone.0300490.g019.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/77e1/12373251/43b2d52eb39a/pone.0300490.g020.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/77e1/12373251/4d7b824bec9a/pone.0300490.g021.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/77e1/12373251/dd4c00ef055f/pone.0300490.g022.jpg

相似文献

1
A novel and robust feature selection method with FDR control for omics-wide association analysis.一种用于组学全关联分析的具有错误发现率控制的新颖且稳健的特征选择方法。
PLoS One. 2025 Aug 22;20(8):e0300490. doi: 10.1371/journal.pone.0300490. eCollection 2025.
2
Prescription of Controlled Substances: Benefits and Risks管制药品的处方:益处与风险
3
[Volume and health outcomes: evidence from systematic reviews and from evaluation of Italian hospital data].[容量与健康结果:来自系统评价和意大利医院数据评估的证据]
Epidemiol Prev. 2013 Mar-Jun;37(2-3 Suppl 2):1-100.
4
Signs and symptoms to determine if a patient presenting in primary care or hospital outpatient settings has COVID-19.在基层医疗机构或医院门诊环境中,如果患者出现以下症状和体征,可判断其是否患有 COVID-19。
Cochrane Database Syst Rev. 2022 May 20;5(5):CD013665. doi: 10.1002/14651858.CD013665.pub3.
5
The Black Book of Psychotropic Dosing and Monitoring.《精神药物剂量与监测黑皮书》
Psychopharmacol Bull. 2024 Jul 8;54(3):8-59.
6
Sexual Harassment and Prevention Training性骚扰与预防培训
7
Systemic pharmacological treatments for chronic plaque psoriasis: a network meta-analysis.慢性斑块状银屑病的全身药理学治疗:一项网状荟萃分析。
Cochrane Database Syst Rev. 2017 Dec 22;12(12):CD011535. doi: 10.1002/14651858.CD011535.pub2.
8
Diagnostic test accuracy and cost-effectiveness of tests for codeletion of chromosomal arms 1p and 19q in people with glioma.染色体臂 1p 和 19q 缺失的检测在胶质瘤患者中的诊断准确性和成本效益。
Cochrane Database Syst Rev. 2022 Mar 2;3(3):CD013387. doi: 10.1002/14651858.CD013387.pub2.
9
Cost-effectiveness of using prognostic information to select women with breast cancer for adjuvant systemic therapy.利用预后信息为乳腺癌患者选择辅助性全身治疗的成本效益
Health Technol Assess. 2006 Sep;10(34):iii-iv, ix-xi, 1-204. doi: 10.3310/hta10340.
10
Eliciting adverse effects data from participants in clinical trials.从临床试验参与者中获取不良反应数据。
Cochrane Database Syst Rev. 2018 Jan 16;1(1):MR000039. doi: 10.1002/14651858.MR000039.pub2.

本文引用的文献

1
Tree-aggregated predictive modeling of microbiome data.基于树的微生物组数据预测模型构建。
Sci Rep. 2021 Jul 15;11(1):14505. doi: 10.1038/s41598-021-93645-3.
2
Testing for mediation effect with application to human microbiome data.人类微生物组数据中介效应检验及应用
Stat Biosci. 2021 Jul;13(2):313-328. doi: 10.1007/s12561-019-09253-3. Epub 2019 Jul 27.
3
Faecal microbiota transplantation halts progression of human new-onset type 1 diabetes in a randomised controlled trial.粪便微生物群移植在随机对照试验中阻止了人类新发 1 型糖尿病的进展。
Gut. 2021 Jan;70(1):92-105. doi: 10.1136/gutjnl-2020-322630. Epub 2020 Oct 26.
4
Integration of single-cell multi-omics for gene regulatory network inference.整合单细胞多组学以推断基因调控网络。
Comput Struct Biotechnol J. 2020 Jun 29;18:1925-1938. doi: 10.1016/j.csbj.2020.06.033. eCollection 2020.
5
Tara Oceans: towards global ocean ecosystems biology.塔拉海洋:走向全球海洋生态系统生物学。
Nat Rev Microbiol. 2020 Aug;18(8):428-445. doi: 10.1038/s41579-020-0364-5. Epub 2020 May 12.
6
High-dimensional regression in practice: an empirical study of finite-sample prediction, variable selection and ranking.高维回归的实际应用:有限样本预测、变量选择与排序的实证研究
Stat Comput. 2020;30(3):697-719. doi: 10.1007/s11222-019-09914-9. Epub 2019 Dec 19.
7
The advantages of the Matthews correlation coefficient (MCC) over F1 score and accuracy in binary classification evaluation.马修斯相关系数(MCC)在二分类评估中优于 F1 得分和准确率的优势。
BMC Genomics. 2020 Jan 2;21(1):6. doi: 10.1186/s12864-019-6413-7.
8
A two-stage microbial association mapping framework with advanced FDR control.一种具有先进 FDR 控制的两阶段微生物关联映射框架。
Microbiome. 2018 Jul 25;6(1):131. doi: 10.1186/s40168-018-0517-1.
9
Generalized linear models with linear constraints for microbiome compositional data.用于微生物组组成数据的具有线性约束的广义线性模型。
Biometrics. 2019 Mar;75(1):235-244. doi: 10.1111/biom.12956. Epub 2018 Aug 10.
10
False discovery rate control incorporating phylogenetic tree increases detection power in microbiome-wide multiple testing.将系统发育树纳入假发现率控制可提高微生物组广泛多重检验中的检测能力。
Bioinformatics. 2017 Sep 15;33(18):2873-2881. doi: 10.1093/bioinformatics/btx311.