患者来源模型基因组分析中鼠污染的影响及稳健分析的最佳实践。

Impact of mouse contamination in genomic profiling of patient-derived models and best practice for robust analysis.

机构信息

Department of Biomedical Systems Informatics and Brain Korea 21 PLUS Project for Medical Science, Yonsei University College of Medicine, Seoul, 03722, South Korea.

出版信息

Genome Biol. 2019 Nov 11;20(1):231. doi: 10.1186/s13059-019-1849-2.

DOI:10.1186/s13059-019-1849-2

PMID:31707992

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC6844030/

Abstract

BACKGROUND

Patient-derived xenograft and cell line models are popular models for clinical cancer research. However, the inevitable inclusion of a mouse genome in a patient-derived model is a remaining concern in the analysis. Although multiple tools and filtering strategies have been developed to account for this, research has yet to demonstrate the exact impact of the mouse genome and the optimal use of these tools and filtering strategies in an analysis pipeline.

RESULTS

We construct a benchmark dataset of 5 liver tissues from 3 mouse strains using human whole-exome sequencing kit. Next-generation sequencing reads from mouse tissues are mappable to 49% of the human genome and 409 cancer genes. In total, 1,207,556 mouse-specific alleles are aligned to the human genome reference, including 467,232 (38.7%) alleles with high sensitivity to contamination, which are pervasive causes of false cancer mutations in public databases and are signatures for predicting global contamination. Next, we assess the performance of 8 filtering methods in terms of mouse read filtration and reduction of mouse-specific alleles. All filtering tools generally perform well, although differences in algorithm strictness and efficiency of mouse allele removal are observed. Therefore, we develop a best practice pipeline that contains the estimation of contamination level, mouse read filtration, and variant filtration.

CONCLUSIONS

The inclusion of mouse cells in patient-derived models hinders genomic analysis and should be addressed carefully. Our suggested guidelines improve the robustness and maximize the utility of genomic analysis of these models.

摘要

背景

患者来源的异种移植和细胞系模型是临床癌症研究中常用的模型。然而，在分析中不可避免地包含了小鼠基因组，这仍然是一个令人关注的问题。尽管已经开发了多种工具和过滤策略来解决这个问题，但研究尚未证明小鼠基因组的确切影响，以及在分析管道中最佳使用这些工具和过滤策略。

结果

我们使用人类全外显子测序试剂盒构建了 3 个小鼠品系 5 个肝组织的基准数据集。来自小鼠组织的下一代测序reads 可映射到人类基因组的 49%和 409 个癌症基因。总共，1207556 个小鼠特异性等位基因与人类基因组参考序列对齐，包括 467232 个（38.7%）具有高污染敏感性的等位基因，这些等位基因是公共数据库中假癌症突变的普遍原因，也是预测全局污染的特征。接下来，我们评估了 8 种过滤方法在过滤小鼠reads 和减少小鼠特异性等位基因方面的性能。所有过滤工具通常表现良好，尽管观察到算法严格性和去除小鼠等位基因的效率存在差异。因此，我们开发了一种最佳实践管道，其中包含污染水平估计、小鼠 read 过滤和变异过滤。

结论

患者来源模型中包含的小鼠细胞阻碍了基因组分析，应谨慎处理。我们提出的指南提高了这些模型的基因组分析的稳健性和最大利用价值。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/fb62/6844030/adfdb0d4d4d6/13059_2019_1849_Fig1_HTML.jpg

相似文献

Impact of mouse contamination in genomic profiling of patient-derived models and best practice for robust analysis.患者来源模型基因组分析中鼠污染的影响及稳健分析的最佳实践。

Genome Biol. 2019 Nov 11;20(1):231. doi: 10.1186/s13059-019-1849-2.

Medical implications of technical accuracy in genome sequencing.基因组测序技术准确性的医学意义。

Genome Med. 2016 Mar 2;8(1):24. doi: 10.1186/s13073-016-0269-0.

Are special read alignment strategies necessary and cost-effective when handling sequencing reads from patient-derived tumor xenografts?在处理来自患者来源的肿瘤异种移植的测序读数时，特殊的读段比对策略是否必要且具有成本效益？

BMC Genomics. 2014 Dec 23;15(1):1172. doi: 10.1186/1471-2164-15-1172.

tarSVM: Improving the accuracy of variant calls derived from microfluidic PCR-based targeted next generation sequencing using a support vector machine.tarSVM：使用支持向量机提高基于微流控PCR的靶向新一代测序得出的变异检测准确性。

BMC Bioinformatics. 2016 Jun 10;17(1):233. doi: 10.1186/s12859-016-1108-4.

Improved multiple displacement amplification (iMDA) and ultraclean reagents.改良的多重置换扩增（iMDA）和超净试剂。

BMC Genomics. 2014 Jun 6;15(1):443. doi: 10.1186/1471-2164-15-443.

Mendelian Inconsistent Signatures from 1314 Ancestrally Diverse Family Trios Distinguish Biological Variation from Sequencing Error.来自1314个具有不同祖先的三联体家庭的孟德尔不一致特征区分了生物学变异与测序错误。

J Comput Biol. 2019 May;26(5):405-419. doi: 10.1089/cmb.2018.0253. Epub 2019 Apr 3.

Human Contamination in Public Genome Assemblies.公共基因组组装中的人类污染

PLoS One. 2016 Sep 9;11(9):e0162424. doi: 10.1371/journal.pone.0162424. eCollection 2016.

Variant detection sensitivity and biases in whole genome and exome sequencing.全基因组和外显子组测序中的变异检测灵敏度和偏倚。

BMC Bioinformatics. 2014 Jul 19;15(1):247. doi: 10.1186/1471-2105-15-247.

Comparison of solution-based exome capture methods for next generation sequencing.基于溶液的外显子组捕获方法在下一代测序中的比较。

Genome Biol. 2011 Sep 28;12(9):R94. doi: 10.1186/gb-2011-12-9-r94.

Large scale comparison of non-human sequences in human sequencing data.人类测序数据中非人类序列的大规模比较。

Genomics. 2014 Dec;104(6 Pt B):453-8. doi: 10.1016/j.ygeno.2014.08.009. Epub 2014 Aug 27.

引用本文的文献

Dissecting cell-free DNA fragmentation variation in tumors using cell line-derived xenograft mouse.利用细胞系衍生的异种移植小鼠剖析肿瘤中游离DNA的片段化变异

PLoS One. 2025 Jul 15;20(7):e0327483. doi: 10.1371/journal.pone.0327483. eCollection 2025.

Residual DNA impurities in AAV vectors-nature and transcription.腺相关病毒载体中的残留DNA杂质——性质与转录

Mol Ther Methods Clin Dev. 2025 Jun 4;33(3):101503. doi: 10.1016/j.omtm.2025.101503. eCollection 2025 Sep 11.

Benchmarking mouse contamination removing protocols in patient-derived xenografts genomic profiling.在患者来源的异种移植基因组分析中对小鼠污染去除方案进行基准测试。

NPJ Precis Oncol. 2025 Apr 17;9(1):113. doi: 10.1038/s41698-025-00902-z.

Reversion of pathogenic L1780P mutation confers resistance to PARP and ATM inhibitor in breast cancer.致病性L1780P突变的逆转赋予乳腺癌对PARP和ATM抑制剂的抗性。

iScience. 2024 Jul 6;27(8):110469. doi: 10.1016/j.isci.2024.110469. eCollection 2024 Aug 16.

Extracellular matrix regulation of cell spheroid invasion in a 3D bioprinted solid tumor-on-a-chip.三维生物打印固体肿瘤芯片中细胞球体浸润的细胞外基质调节。

Acta Biomater. 2024 Sep 15;186:156-166. doi: 10.1016/j.actbio.2024.07.040. Epub 2024 Aug 7.

Molecular phenotyping of small cell lung cancer using targeted cfDNA profiling of transcriptional regulatory regions.使用靶向转录调控区域的 cfDNA 分析对小细胞肺癌进行分子表型分析。

Sci Adv. 2024 Apr 12;10(15):eadk2082. doi: 10.1126/sciadv.adk2082. Epub 2024 Apr 10.

Genomic comparison between an in vitro three-dimensional culture model of melanoma and the original primary tumor.黑色素瘤体外三维培养模型与原始原发性肿瘤的基因组比较。

Arch Dermatol Res. 2023 Jul;315(5):1225-1231. doi: 10.1007/s00403-022-02502-4. Epub 2022 Dec 13.

Nucleosome Patterns in Circulating Tumor DNA Reveal Transcriptional Regulation of Advanced Prostate Cancer Phenotypes.循环肿瘤 DNA 中的核小体模式揭示了晚期前列腺癌表型的转录调控。

Cancer Discov. 2023 Mar 1;13(3):632-653. doi: 10.1158/2159-8290.CD-22-0692.

Characterization of Leukemic Resistance to CD19-Targeted CAR T-cell Therapy through Deep Genomic Sequencing.通过深度基因组测序分析白血病对 CD19 靶向 CAR T 细胞治疗的耐药性。

Cancer Immunol Res. 2023 Jan 3;11(1):13-19. doi: 10.1158/2326-6066.CIR-22-0095.

Weight-bearing activity impairs nuclear membrane and genome integrity via YAP activation in plantar melanoma.负重活动通过 YAP 激活破坏足底黑素瘤的核膜和基因组完整性。

Nat Commun. 2022 Apr 25;13(1):2214. doi: 10.1038/s41467-022-29925-x.

本文引用的文献

The use of technical replication for detection of low-level somatic mutations in next-generation sequencing.利用技术复制检测下一代测序中的低水平体细胞突变。

Nat Commun. 2019 Mar 5;10(1):1047. doi: 10.1038/s41467-019-09026-y.

The COSMIC Cancer Gene Census: describing genetic dysfunction across all human cancers.COSMIC 癌症基因目录：描述所有人类癌症中的遗传功能障碍。

Nat Rev Cancer. 2018 Nov;18(11):696-705. doi: 10.1038/s41568-018-0060-1.

XenofilteR: computational deconvolution of mouse and human reads in tumor xenograft sequence data.XenofilteR：肿瘤异种移植序列数据中鼠和人读取的计算去卷积。

BMC Bioinformatics. 2018 Oct 4;19(1):366. doi: 10.1186/s12859-018-2353-5.

A comparison of next-generation sequencing analysis methods for cancer xenograft samples.用于癌症异种移植物样本的下一代测序分析方法的比较。

J Genet Genomics. 2018 Jul 20;45(7):345-350. doi: 10.1016/j.jgg.2018.07.001. Epub 2018 Jul 25.

Efficient algorithms for polyploid haplotype phasing.高效的多倍体单体型相位算法。

BMC Genomics. 2018 May 9;19(Suppl 2):110. doi: 10.1186/s12864-018-4464-9.

A review of somatic single nucleotide variant calling algorithms for next-generation sequencing data.用于下一代测序数据的体细胞单核苷酸变异检测算法综述。

Comput Struct Biotechnol J. 2018 Feb 6;16:15-24. doi: 10.1016/j.csbj.2018.01.003. eCollection 2018.

Using PDX for Preclinical Cancer Drug Discovery: The Evolving Field.利用人源肿瘤异种移植模型进行临床前癌症药物研发：不断发展的领域。

J Clin Med. 2018 Mar 2;7(3):41. doi: 10.3390/jcm7030041.

Computational deconvolution of transcriptomics data from mixed cell populations.计算从混合细胞群体中转录组数据的去卷积。

Bioinformatics. 2018 Jun 1;34(11):1969-1979. doi: 10.1093/bioinformatics/bty019.

Computational approach to discriminate human and mouse sequences in patient-derived tumour xenografts.计算方法区分患者来源的肿瘤异种移植物中的人源和鼠源序列。

BMC Genomics. 2018 Jan 5;19(1):19. doi: 10.1186/s12864-017-4414-y.

Human primary liver cancer-derived organoid cultures for disease modeling and drug screening.用于疾病建模和药物筛选的人原发性肝癌来源的类器官培养物。

Nat Med. 2017 Dec;23(12):1424-1435. doi: 10.1038/nm.4438. Epub 2017 Nov 13.

文献检索

告别复杂PubMed语法，用中文像聊天一样搜索，搜遍4000万医学文献。AI智能推荐，让科研检索更轻松。

立即免费搜索

文件翻译

保留排版，准确专业，支持PDF/Word/PPT等文件格式，支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述，25分钟生成高质量综述，智能提取关键信息，辅助科研写作。

立即免费体验

患者来源模型基因组分析中鼠污染的影响及稳健分析的最佳实践。

Impact of mouse contamination in genomic profiling of patient-derived models and best practice for robust analysis.

机构信息

出版信息

BACKGROUND

RESULTS

CONCLUSIONS

背景

结果

结论

相似文献

引用本文的文献

本文引用的文献

文献检索

文件翻译

深度研究

Suppr 超能文献

相似文献

引用本文的文献

本文引用的文献