• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

精确的错误发现率和错误发现比例方差的积分公式。

Exact Integral Formulas for False Discovery Rate and the Variance of False Discovery Proportion.

机构信息

Department of Biochemistry and Molecular Biology, The University of Texas Medical Branch, 301 University Blvd, Galveston, Texas 77555, United States.

出版信息

J Proteome Res. 2024 Jun 7;23(6):2298-2305. doi: 10.1021/acs.jproteome.3c00842. Epub 2024 May 29.

DOI:10.1021/acs.jproteome.3c00842
PMID:38809146
Abstract

Multiple hypothesis testing is an integral component of data analysis for large-scale technologies such as proteomics, transcriptomics, or metabolomics, for which the false discovery rate (FDR) and positive FDR (pFDR) have been accepted as error estimation and control measures. The pFDR is the expectation of false discovery proportion (FDP), which refers to the ratio of the number of null hypotheses to that of all rejected hypotheses. In practice, the expectation of ratio is approximated by the ratio of expectation; however, the conditions for transforming the former into the latter have not been investigated. This work derives exact integral expressions for the expectation (pFDR) and variance of FDP. The widely used approximation (ratio of expectations) is shown to be a particular case (in the limit of a large sample size) of the integral formula for pFDR. A recurrence formula is provided to compute the pFDR for a predefined number of null hypotheses. The variance of FDP was approximated for a practical application in peptide identification using forward and reversed protein sequences. The simulations demonstrate that the integral expression exhibits better accuracy than the approximate formula in the case of a small number of hypotheses. For large sample sizes, the pFDRs obtained by the integral expression and approximation do not differ substantially. Applications to proteomics data sets are included.

摘要

多假设检验是蛋白质组学、转录组学或代谢组学等大规模技术数据分析的一个组成部分,假发现率(FDR)和阳性 FDR(pFDR)已被接受为错误估计和控制措施。pFDR 是错误发现比例(FDP)的期望,它是指零假设数与所有拒绝假设数的比值。在实践中,通过期望的比值来近似期望的比值;然而,将前者转化为后者的条件尚未得到研究。这项工作推导出了 FDP 的期望(pFDR)和方差的精确积分表达式。广泛使用的近似值(期望的比值)是 pFDR 积分公式的一个特例(在大样本量的极限情况下)。提供了一个递归公式来计算预定数量的零假设的 pFDR。使用正向和反向蛋白质序列在肽鉴定的实际应用中对 FDP 的方差进行了近似。模拟结果表明,在假设数量较少的情况下,积分表达式比近似公式具有更好的准确性。对于大样本量,积分表达式和近似得到的 pFDR 没有显著差异。包括对蛋白质组学数据集的应用。

相似文献

1
Exact Integral Formulas for False Discovery Rate and the Variance of False Discovery Proportion.精确的错误发现率和错误发现比例方差的积分公式。
J Proteome Res. 2024 Jun 7;23(6):2298-2305. doi: 10.1021/acs.jproteome.3c00842. Epub 2024 May 29.
2
The Black Book of Psychotropic Dosing and Monitoring.《精神药物剂量与监测黑皮书》
Psychopharmacol Bull. 2024 Jul 8;54(3):8-59.
3
Comparison of Two Modern Survival Prediction Tools, SORG-MLA and METSSS, in Patients With Symptomatic Long-bone Metastases Who Underwent Local Treatment With Surgery Followed by Radiotherapy and With Radiotherapy Alone.两种现代生存预测工具 SORG-MLA 和 METSSS 在接受手术联合放疗和单纯放疗治疗有症状长骨转移患者中的比较。
Clin Orthop Relat Res. 2024 Dec 1;482(12):2193-2208. doi: 10.1097/CORR.0000000000003185. Epub 2024 Jul 23.
4
A rapid and systematic review of the clinical effectiveness and cost-effectiveness of paclitaxel, docetaxel, gemcitabine and vinorelbine in non-small-cell lung cancer.对紫杉醇、多西他赛、吉西他滨和长春瑞滨在非小细胞肺癌中的临床疗效和成本效益进行的快速系统评价。
Health Technol Assess. 2001;5(32):1-195. doi: 10.3310/hta5320.
5
Signs and symptoms to determine if a patient presenting in primary care or hospital outpatient settings has COVID-19.在基层医疗机构或医院门诊环境中,如果患者出现以下症状和体征,可判断其是否患有 COVID-19。
Cochrane Database Syst Rev. 2022 May 20;5(5):CD013665. doi: 10.1002/14651858.CD013665.pub3.
6
Health professionals' experience of teamwork education in acute hospital settings: a systematic review of qualitative literature.医疗专业人员在急症医院环境中团队合作教育的经验:对定性文献的系统综述
JBI Database System Rev Implement Rep. 2016 Apr;14(4):96-137. doi: 10.11124/JBISRIR-2016-1843.
7
[Volume and health outcomes: evidence from systematic reviews and from evaluation of Italian hospital data].[容量与健康结果:来自系统评价和意大利医院数据评估的证据]
Epidemiol Prev. 2013 Mar-Jun;37(2-3 Suppl 2):1-100.
8
Cost-effectiveness of using prognostic information to select women with breast cancer for adjuvant systemic therapy.利用预后信息为乳腺癌患者选择辅助性全身治疗的成本效益
Health Technol Assess. 2006 Sep;10(34):iii-iv, ix-xi, 1-204. doi: 10.3310/hta10340.
9
Xpert MTB/RIF assay for extrapulmonary tuberculosis and rifampicin resistance.用于肺外结核病和利福平耐药性的Xpert MTB/RIF检测
Cochrane Database Syst Rev. 2018 Aug 27;8(8):CD012768. doi: 10.1002/14651858.CD012768.pub2.
10
Can a Liquid Biopsy Detect Circulating Tumor DNA With Low-passage Whole-genome Sequencing in Patients With a Sarcoma? A Pilot Evaluation.液体活检能否通过低深度全基因组测序检测肉瘤患者的循环肿瘤DNA?一项初步评估。
Clin Orthop Relat Res. 2025 Jan 1;483(1):39-48. doi: 10.1097/CORR.0000000000003161. Epub 2024 Jun 21.

引用本文的文献

1
Turnover Rates and Numbers of Exchangeable Hydrogens in Deuterated Water Labeled Samples.氘代水标记样品中的周转率和可交换氢数量。
Int J Mol Sci. 2025 Jul 3;26(13):6398. doi: 10.3390/ijms26136398.

本文引用的文献

1
A large-scale LC-MS dataset of murine liver proteome from time course of heavy water metabolic labeling.大尺度 LC-MS 数据集:来自氘水代谢标记的鼠肝蛋白质组时程研究。
Sci Data. 2023 Sep 19;10(1):635. doi: 10.1038/s41597-023-02537-w.
2
Quantifying label enrichment from two mass isotopomers increases proteome coverage for in vivo protein turnover using heavy water metabolic labeling.通过重水代谢标记对两种质量同位素异构体的标记富集进行定量,可提高体内蛋白质周转率的蛋白质组覆盖率。
Commun Chem. 2023 Apr 17;6(1):72. doi: 10.1038/s42004-023-00873-x.
3
Harmonizing Labeling and Analytical Strategies to Obtain Protein Turnover Rates in Intact Adult Animals.
协调标签和分析策略以获得完整成年动物中的蛋白质周转率。
Mol Cell Proteomics. 2022 Jul;21(7):100252. doi: 10.1016/j.mcpro.2022.100252. Epub 2022 May 28.
4
Common Decoy Distributions Simplify False Discovery Rate Estimation in Shotgun Proteomics.通用诱饵分布简化了鸟枪法蛋白质组学中的错误发现率估计
J Proteome Res. 2022 Feb 4;21(2):339-348. doi: 10.1021/acs.jproteome.1c00600. Epub 2022 Jan 6.
5
Gentle Introduction to the Statistical Foundations of False Discovery Rate in Quantitative Proteomics.定量蛋白质组学中错误发现率统计基础的简要介绍。
J Proteome Res. 2018 Jan 5;17(1):12-22. doi: 10.1021/acs.jproteome.7b00170. Epub 2017 Nov 14.
6
MSFragger: ultrafast and comprehensive peptide identification in mass spectrometry-based proteomics.MSFragger:基于质谱的蛋白质组学中实现超快速且全面的肽段鉴定
Nat Methods. 2017 May;14(5):513-520. doi: 10.1038/nmeth.4256. Epub 2017 Apr 10.
7
Unbiased False Discovery Rate Estimation for Shotgun Proteomics Based on the Target-Decoy Approach.基于目标-诱饵法的鸟枪法蛋白质组学无偏错误发现率估计
J Proteome Res. 2017 Feb 3;16(2):393-397. doi: 10.1021/acs.jproteome.6b00144. Epub 2016 Dec 13.
8
A statistical model-building perspective to identification of MS/MS spectra with PeptideProphet.基于统计模型构建的方法识别 MS/MS 谱图与 PeptideProphet。
BMC Bioinformatics. 2012;13 Suppl 16(Suppl 16):S1. doi: 10.1186/1471-2105-13-S16-S1. Epub 2012 Nov 5.
9
Comet: an open-source MS/MS sequence database search tool.彗星:一个开源的 MS/MS 序列数据库搜索工具。
Proteomics. 2013 Jan;13(1):22-4. doi: 10.1002/pmic.201200439. Epub 2012 Dec 4.
10
A unified approach to false discovery rate estimation.一种统一的错误发现率估计方法。
BMC Bioinformatics. 2008 Jul 9;9:303. doi: 10.1186/1471-2105-9-303.