• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

解决样本混淆问题:大规模多组学研究的工具与方法

Addressing Sample Mix-Ups: Tools and Approaches for Large-Scale Multi-Omics Studies.

作者信息

Fu Yingxue, Yuan Zuo-Fei, Wu Long, Peng Junmin, Wang Xusheng, High Anthony A

机构信息

Center for Proteomics and Metabolomics, St. Jude Children's Research Hospital, Memphis, Tennessee, USA.

Department of Structural Biology, St. Jude Children's Research Hospital, Memphis, Tennessee, USA.

出版信息

Proteomics. 2025 Jan;25(1-2):e202400271. doi: 10.1002/pmic.202400271. Epub 2024 Dec 10.

DOI:10.1002/pmic.202400271
PMID:39659081
Abstract

Advances in high-throughput omics technologies have enabled system-wide characterization of biological samples across multiple molecular levels, such as the genome, transcriptome, and proteome. However, as sample sizes rapidly increase in large-scale multi-omics studies, sample mix-ups have become a prevalent issue, compromising data integrity and leading to erroneous conclusions. The interconnected nature of multi-omics data presents an opportunity to identify and correct these errors. This review examines the potential sources of sample mix-ups and evaluates the methodologies and tools developed for detecting and correcting these errors, with an emphasis on approaches applicable to proteomics data. We categorize existing tools into three main groups: expression/protein quantitative trait loci-based, genotype concordance-based, and gene/protein expression correlation-based approaches. Notably, only a handful of tools currently utilize the proteogenomics approach for correcting sample mix-ups at the proteomics level. Integrating the strengths of current tools across diverse data types could enable the development of more versatile and comprehensive solutions. In conclusion, verifying sample identity is a critical first step to reduce bias and increase precision in subsequent analyses for large-scale multi-omics studies. By leveraging these tools for identifying and correcting sample mix-ups, researchers can significantly improve the reliability and reproducibility of biomedical research.

摘要

高通量组学技术的进步使得能够在多个分子水平上对生物样本进行全系统表征,如基因组、转录组和蛋白质组。然而,在大规模多组学研究中,随着样本量迅速增加,样本混淆已成为一个普遍问题,损害了数据完整性并导致错误结论。多组学数据的相互关联特性为识别和纠正这些错误提供了契机。本综述探讨了样本混淆的潜在来源,并评估了为检测和纠正这些错误而开发的方法和工具,重点关注适用于蛋白质组学数据的方法。我们将现有工具分为三大类:基于表达/蛋白质数量性状位点的方法、基于基因型一致性的方法以及基于基因/蛋白质表达相关性的方法。值得注意的是,目前只有少数工具利用蛋白质基因组学方法在蛋白质组学水平上纠正样本混淆。整合当前工具在不同数据类型中的优势,有望开发出更通用、更全面的解决方案。总之,验证样本身份是减少大规模多组学研究后续分析中的偏差并提高精度的关键第一步。通过利用这些工具来识别和纠正样本混淆,研究人员能够显著提高生物医学研究的可靠性和可重复性。

相似文献

1
Addressing Sample Mix-Ups: Tools and Approaches for Large-Scale Multi-Omics Studies.解决样本混淆问题:大规模多组学研究的工具与方法
Proteomics. 2025 Jan;25(1-2):e202400271. doi: 10.1002/pmic.202400271. Epub 2024 Dec 10.
2
An overview of technologies for MS-based proteomics-centric multi-omics.基于 MS 的蛋白质组学中心型多组学技术概述。
Expert Rev Proteomics. 2022 Mar;19(3):165-181. doi: 10.1080/14789450.2022.2070476. Epub 2022 May 2.
3
DRAMS: A tool to detect and re-align mixed-up samples for integrative studies of multi-omics data.DRAMS:一种用于检测和重新对齐混合样本的工具,用于多组学数据的综合研究。
PLoS Comput Biol. 2020 Apr 13;16(4):e1007522. doi: 10.1371/journal.pcbi.1007522. eCollection 2020 Apr.
4
The Addition of Transcriptomics to the Bead-Enabled Accelerated Monophasic Multi-Omics Method: A Step toward Universal Sample Preparation.转录组学在珠加速单相多组学方法中的应用:迈向通用样品制备的一步。
Anal Chem. 2024 Nov 19;96(46):18343-18348. doi: 10.1021/acs.analchem.4c02835. Epub 2024 Oct 9.
5
MixupMapper: correcting sample mix-ups in genome-wide datasets increases power to detect small genetic effects.MixupMapper:纠正全基因组数据集的样本混淆可提高检测微小遗传效应的能力。
Bioinformatics. 2011 Aug 1;27(15):2104-11. doi: 10.1093/bioinformatics/btr323. Epub 2011 Jun 7.
6
A multi-omics data simulator for complex disease studies and its application to evaluate multi-omics data analysis methods for disease classification.用于复杂疾病研究的多组学数据模拟器及其在评估疾病分类的多组学数据分析方法中的应用。
Gigascience. 2019 May 1;8(5). doi: 10.1093/gigascience/giz045.
7
Identification of novel therapeutic targets for chronic kidney disease and kidney function by integrating multi-omics proteome with transcriptome.通过整合多组学蛋白质组学和转录组学,鉴定慢性肾脏病和肾功能的新治疗靶点。
Genome Med. 2024 Jun 19;16(1):84. doi: 10.1186/s13073-024-01356-x.
8
Unravelling disease complexity: integrative analysis of multi-omic data in clinical research.解析疾病复杂性:临床研究中多组学数据的综合分析
Expert Rev Proteomics. 2025 Apr;22(4):149-162. doi: 10.1080/14789450.2025.2491357. Epub 2025 Apr 13.
9
Advance computational tools for multiomics data learning.多组学数据学习的先进计算工具。
Biotechnol Adv. 2024 Dec;77:108447. doi: 10.1016/j.biotechadv.2024.108447. Epub 2024 Sep 7.
10
Integrative Analysis of Multi-omics Data for Discovery and Functional Studies of Complex Human Diseases.用于复杂人类疾病发现和功能研究的多组学数据综合分析
Adv Genet. 2016;93:147-90. doi: 10.1016/bs.adgen.2015.11.004. Epub 2016 Jan 25.