• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

癌症人群研究中的样本混淆。

Calling sample mix-ups in cancer population studies.

机构信息

Department of Oncology, University of Cambridge, Cambridge, Cambridgeshire, United Kingdom.

出版信息

PLoS One. 2012;7(8):e41815. doi: 10.1371/journal.pone.0041815. Epub 2012 Aug 9.

DOI:10.1371/journal.pone.0041815
PMID:22912679
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC3415393/
Abstract

Sample tracking errors have been and always will be a part of the practical implementation of large experiments. It has recently been proposed that expression quantitative trait loci (eQTLs) and their associated effects could be used to identify sample mix-ups and this approach has been applied to a number of large population genomics studies to illustrate the prevalence of the problem. We had adopted a similar approach, termed 'BADGER', in the METABRIC project. METABRIC is a large breast cancer study that may have been the first in which eQTL-based detection of mismatches was used during the study, rather than after the event, to aid quality assurance. We report here on the particular issues associated with large cancer studies performed using historical samples, which complicate the interpretation of such approaches. In particular we identify the complications of using tumour samples, of considering cellularity and RNA quality, of distinct subgroups existing in the study population (including family structures), and of choosing eQTLs to use. We also present some results regarding the design of experiments given consideration of these matters. The eQTL-based approach to identifying sample tracking errors is seen to be of value to these studies, but requiring care in its implementation.

摘要

样本跟踪误差一直以来都是大型实验实际实施过程中的一部分。最近有人提出,可以利用表达数量性状基因座(eQTL)及其相关效应来识别样本混淆,并且这种方法已经应用于许多大型人群基因组学研究中,以说明该问题的普遍性。在 METABRIC 项目中,我们采用了一种类似的方法,称为“BADGER”。METABRIC 是一项大型乳腺癌研究,可能是首次在研究过程中使用基于 eQTL 的错配检测,而不是在事件发生后,以帮助质量保证。我们在此报告与使用历史样本进行的大型癌症研究相关的特定问题,这些问题使此类方法的解释变得复杂。特别是,我们确定了使用肿瘤样本、考虑细胞数量和 RNA 质量、研究人群中存在的不同亚组(包括家族结构)以及选择要使用的 eQTL 所带来的复杂性。我们还根据这些因素的考虑,介绍了一些关于实验设计的结果。基于 eQTL 的方法识别样本跟踪误差被认为对这些研究具有价值,但在实施过程中需要谨慎。

相似文献

1
Calling sample mix-ups in cancer population studies.癌症人群研究中的样本混淆。
PLoS One. 2012;7(8):e41815. doi: 10.1371/journal.pone.0041815. Epub 2012 Aug 9.
2
Identification and Correction of Sample Mix-Ups in Expression Genetic Data: A Case Study.表达遗传数据中样本混淆的识别与纠正:一个案例研究
G3 (Bethesda). 2015 Aug 19;5(10):2177-86. doi: 10.1534/g3.115.019778.
3
MixupMapper: correcting sample mix-ups in genome-wide datasets increases power to detect small genetic effects.MixupMapper:纠正全基因组数据集的样本混淆可提高检测微小遗传效应的能力。
Bioinformatics. 2011 Aug 1;27(15):2104-11. doi: 10.1093/bioinformatics/btr323. Epub 2011 Jun 7.
4
Sequence polymorphisms cause many false cis eQTLs.序列多态性导致许多假的顺式 eQTL。
PLoS One. 2007 Jul 18;2(7):e622. doi: 10.1371/journal.pone.0000622.
5
Genetical toxicogenomics in Drosophila identifies master-modulatory loci that are regulated by developmental exposure to lead.果蝇遗传毒理基因组学鉴定出受铅暴露调控的主调节基因座
Neurotoxicology. 2009 Nov;30(6):898-914. doi: 10.1016/j.neuro.2009.08.011. Epub 2009 Sep 6.
6
Integrated genome-wide analysis of expression quantitative trait loci aids interpretation of genomic association studies.表达数量性状基因座的全基因组综合分析有助于解释基因组关联研究。
Genome Biol. 2017 Jan 25;18(1):16. doi: 10.1186/s13059-016-1142-6.
7
Expression Quantitative Trait Loci (eQTL) Analysis in Cancer.癌症中的表达数量性状基因座(eQTL)分析
Methods Mol Biol. 2020;2082:189-199. doi: 10.1007/978-1-0716-0026-9_13.
8
Weighting by heritability for detection of quantitative trait loci with microarray estimates of gene expression.利用基因表达的微阵列估计值通过遗传力加权来检测数量性状基因座。
Genome Biol. 2005;6(3):R27. doi: 10.1186/gb-2005-6-3-r27. Epub 2005 Feb 28.
9
ncRNA-eQTL: a database to systematically evaluate the effects of SNPs on non-coding RNA expression across cancer types.ncRNA-eQTL:一个系统评估 SNPs 对多种癌症中非编码 RNA 表达影响的数据库。
Nucleic Acids Res. 2020 Jan 8;48(D1):D956-D963. doi: 10.1093/nar/gkz711.
10
Expression quantitative trait loci (eQTL) mapping in Puerto Rican children.波多黎各儿童的表达数量性状基因座(eQTL)定位
PLoS One. 2015 Mar 27;10(3):e0122464. doi: 10.1371/journal.pone.0122464. eCollection 2015.

引用本文的文献

1
Rapid brain tumor classification from sparse epigenomic data.基于稀疏表观基因组数据的快速脑肿瘤分类
Nat Med. 2025 Mar;31(3):840-848. doi: 10.1038/s41591-024-03435-3. Epub 2025 Feb 28.
2
reGenotyper: Detecting mislabeled samples in genetic data.reGenotyper:检测基因数据中标记错误的样本。
PLoS One. 2017 Feb 13;12(2):e0171324. doi: 10.1371/journal.pone.0171324. eCollection 2017.
3
Integration of copy number and transcriptomics provides risk stratification in prostate cancer: A discovery and validation cohort study.

本文引用的文献

1
Patterns of cis regulatory variation in diverse human populations.不同人类群体中顺式调控变异的模式。
PLoS Genet. 2012;8(4):e1002639. doi: 10.1371/journal.pgen.1002639. Epub 2012 Apr 19.
2
The genomic and transcriptomic architecture of 2,000 breast tumours reveals novel subgroups.2000 个乳腺肿瘤的基因组和转录组结构揭示了新的亚群。
Nature. 2012 Apr 18;486(7403):346-52. doi: 10.1038/nature10983.
3
Bayesian method to predict individual SNP genotypes from gene expression data.贝叶斯方法从基因表达数据预测个体 SNP 基因型。
拷贝数与转录组学整合为前列腺癌提供风险分层:一项发现与验证队列研究。
EBioMedicine. 2015 Jul 29;2(9):1133-44. doi: 10.1016/j.ebiom.2015.07.017. eCollection 2015 Sep.
4
Identification and Correction of Sample Mix-Ups in Expression Genetic Data: A Case Study.表达遗传数据中样本混淆的识别与纠正:一个案例研究
G3 (Bethesda). 2015 Aug 19;5(10):2177-86. doi: 10.1534/g3.115.019778.
5
A tumor DNA complex aberration index is an independent predictor of survival in breast and ovarian cancer.肿瘤DNA复合畸变指数是乳腺癌和卵巢癌生存的独立预测指标。
Mol Oncol. 2015 Jan;9(1):115-27. doi: 10.1016/j.molonc.2014.07.019. Epub 2014 Aug 8.
Nat Genet. 2012 May;44(5):603-8. doi: 10.1038/ng.2248.
4
BeadArray expression analysis using bioconductor.使用 bioconductor 进行 BeadArray 表达谱分析。
PLoS Comput Biol. 2011 Dec;7(12):e1002276. doi: 10.1371/journal.pcbi.1002276. Epub 2011 Dec 1.
5
MixupMapper: correcting sample mix-ups in genome-wide datasets increases power to detect small genetic effects.MixupMapper:纠正全基因组数据集的样本混淆可提高检测微小遗传效应的能力。
Bioinformatics. 2011 Aug 1;27(15):2104-11. doi: 10.1093/bioinformatics/btr323. Epub 2011 Jun 7.
6
Neurospora crassa mat A-2 and mat A-3 proteins weakly interact in the yeast two-hybrid system and affect yeast growth.粗糙脉孢菌 mat A-2 和 mat A-3 蛋白在酵母双杂交系统中弱相互作用,并影响酵母生长。
Genet Mol Biol. 2009 Apr;32(2):354-61. doi: 10.1590/S1415-47572009000200023. Epub 2009 Jun 1.
7
The architecture of gene regulatory variation across multiple human tissues: the MuTHER study.人类多种组织中基因调控变异的结构:MuTHER 研究。
PLoS Genet. 2011 Feb 3;7(2):e1002003. doi: 10.1371/journal.pgen.1002003.
8
Utilization of AFFX spike-in control probes to monitor sample identity throughout Affymetrix GeneChip Array processing.利用 AFFX Spike-in 对照探针在整个 Affymetrix GeneChip 阵列处理过程中监测样本身份。
Biotechniques. 2010 May;48(5):371-8. doi: 10.2144/000113421.
9
Genetics and beyond--the transcriptome of human monocytes and disease susceptibility.遗传学及其他——人类单核细胞的转录组与疾病易感性。
PLoS One. 2010 May 18;5(5):e10693. doi: 10.1371/journal.pone.0010693.
10
A re-annotation pipeline for Illumina BeadArrays: improving the interpretation of gene expression data.Illumina BeadArray 重注释管道:改善基因表达数据的解读。
Nucleic Acids Res. 2010 Jan;38(3):e17. doi: 10.1093/nar/gkp942. Epub 2009 Nov 18.