• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

PRESM:用于癌症基因组学中体细胞突变发现的个性化参考编辑器。

PRESM: personalized reference editor for somatic mutation discovery in cancer genomics.

机构信息

Departments of Biochemistry & Molecular Biology and Medical Genetics, Alberta Children's Hospital Research Institute, University of Calgary, Calgary, Canada.

Department of Cancer Biology, Wake Forest School of Medicine, Winston-Salem, NC, USA.

出版信息

Bioinformatics. 2019 May 1;35(9):1445-1452. doi: 10.1093/bioinformatics/bty812.

DOI:10.1093/bioinformatics/bty812
PMID:30247633
Abstract

MOTIVATION

Accurate detection of somatic mutations is a crucial step toward understanding cancer. Various tools have been developed to detect somatic mutations from cancer genome sequencing data by mapping reads to a universal reference genome and inferring likelihoods from complex statistical models. However, read mapping is frequently obstructed by mismatches between germline and somatic mutations on a read and the reference genome. Previous attempts to develop personalized genome tools are not compatible with downstream statistical models for somatic mutation detection.

RESULTS

We present PRESM, a tool that builds personalized reference genomes by integrating germline mutations into the reference genome. The aforementioned obstacle is circumvented by using a two-step germline substitution procedure, maintaining positional fidelity using an innovative workaround. Reads derived from tumor tissue can be positioned more accurately along a personalized reference than a universal reference due to the reduced genetic distance between the subject (tumor genome) and the target (the personalized genome). Application of PRESM's personalized genome reduced false-positive (FP) somatic mutation calls by as much as 55.5%, and facilitated the discovery of a novel somatic point mutation on a germline insertion in PDE1A, a phosphodiesterase associated with melanoma. Moreover, all improvements in calling accuracy were achieved without parameter optimization, as PRESM itself is parameter-free. Hence, similar increases in read mapping and decreases in the FP rate will persist when PRESM-built genomes are applied to any user-provided dataset.

AVAILABILITY AND IMPLEMENTATION

The software is available at https://github.com/precisionomics/PRESM.

SUPPLEMENTARY INFORMATION

Supplementary data are available at Bioinformatics online.

摘要

动机

准确检测体细胞突变是理解癌症的关键步骤。已经开发了各种工具,通过将读取映射到通用参考基因组并从复杂的统计模型推断可能性,从癌症基因组测序数据中检测体细胞突变。然而,读取映射经常受到读取和参考基因组上种系和体细胞突变之间不匹配的阻碍。以前开发个性化基因组工具的尝试与用于体细胞突变检测的下游统计模型不兼容。

结果

我们提出了 PRESM,这是一种通过将种系突变整合到参考基因组中来构建个性化参考基因组的工具。通过使用两步种系替换过程,同时使用创新的解决方法保持位置保真度,克服了上述障碍。由于主体(肿瘤基因组)和目标(个性化基因组)之间的遗传距离减小,来自肿瘤组织的读取可以更准确地沿着个性化参考基因组定位,而不是通用参考基因组。PRESM 的个性化基因组的应用减少了多达 55.5%的假阳性(FP)体细胞突变调用,并促成了在 PDE1A 上发现一种新的种系插入体细胞点突变,PDE1A 是一种与黑色素瘤相关的磷酸二酯酶。此外,所有提高呼叫准确性的改进都是在没有参数优化的情况下实现的,因为 PRESM 本身是无参数的。因此,当将 PRESM 构建的基因组应用于任何用户提供的数据集时,读取映射的类似增加和 FP 率的降低将持续存在。

可用性和实现

该软件可在 https://github.com/precisionomics/PRESM 上获得。

补充信息

补充数据可在生物信息学在线获得。

相似文献

1
PRESM: personalized reference editor for somatic mutation discovery in cancer genomics.PRESM:用于癌症基因组学中体细胞突变发现的个性化参考编辑器。
Bioinformatics. 2019 May 1;35(9):1445-1452. doi: 10.1093/bioinformatics/bty812.
2
An Individualized Approach for Somatic Variant Discovery.个体化的体细胞变异发现方法。
Methods Mol Biol. 2020;2120:11-36. doi: 10.1007/978-1-0716-0327-7_2.
3
One Size Doesn't Fit All - RefEditor: Building Personalized Diploid Reference Genome to Improve Read Mapping and Genotype Calling in Next Generation Sequencing Studies.一刀切并不适用——RefEditor:构建个性化二倍体参考基因组以改善下一代测序研究中的读段映射和基因型调用
PLoS Comput Biol. 2015 Aug 12;11(8):e1004448. doi: 10.1371/journal.pcbi.1004448. eCollection 2015 Aug.
4
Detection of oncogenic and clinically actionable mutations in cancer genomes critically depends on variant calling tools.在癌症基因组中检测致癌和具有临床可操作性的突变,关键取决于变异调用工具。
Bioinformatics. 2022 Jun 13;38(12):3181-3191. doi: 10.1093/bioinformatics/btac306.
5
VarSim: a high-fidelity simulation and validation framework for high-throughput genome sequencing with cancer applications.VarSim:一个用于癌症相关高通量基因组测序的高保真模拟与验证框架。
Bioinformatics. 2015 May 1;31(9):1469-71. doi: 10.1093/bioinformatics/btu828. Epub 2014 Dec 17.
6
Personalized genome assembly for accurate cancer somatic mutation discovery using tumor-normal paired reference samples.使用肿瘤-正常配对参考样本进行个性化基因组组装,以准确发现癌症体细胞突变。
Genome Biol. 2022 Nov 9;23(1):237. doi: 10.1186/s13059-022-02803-x.
7
Manta: rapid detection of structural variants and indels for germline and cancer sequencing applications.Manta:用于种系和癌症测序应用的结构变异和插入缺失的快速检测。
Bioinformatics. 2016 Apr 15;32(8):1220-2. doi: 10.1093/bioinformatics/btv710. Epub 2015 Dec 8.
8
Bivartect: accurate and memory-saving breakpoint detection by direct read comparison.Bivartect:通过直接读取比较实现准确且节省内存的断点检测。
Bioinformatics. 2020 May 1;36(9):2725-2730. doi: 10.1093/bioinformatics/btaa059.
9
CScape-somatic: distinguishing driver and passenger point mutations in the cancer genome.CScape-somatic:在癌症基因组中区分驱动突变和乘客突变。
Bioinformatics. 2020 Jun 1;36(12):3637-3644. doi: 10.1093/bioinformatics/btaa242.
10
An investigation of causes of false positive single nucleotide polymorphisms using simulated reads from a small eukaryote genome.利用来自小型真核生物基因组的模拟读数对单核苷酸多态性假阳性原因的调查。
BMC Bioinformatics. 2015 Nov 11;16:382. doi: 10.1186/s12859-015-0801-z.

引用本文的文献

1
Efficient and easy gene expression and genetic variation data analysis and visualization using exvar.使用exvar进行高效便捷的基因表达和遗传变异数据分析及可视化。
Sci Rep. 2025 Apr 10;15(1):12264. doi: 10.1038/s41598-025-93067-5.
2
Review of T cell proliferation regulatory factors in treatment and prognostic prediction for solid tumors.实体瘤治疗及预后预测中T细胞增殖调节因子的综述
Heliyon. 2023 Oct 29;9(11):e21329. doi: 10.1016/j.heliyon.2023.e21329. eCollection 2023 Nov.
3
Using the Random Forest for Identifying Key Physicochemical Properties of Amino Acids to Discriminate Anticancer and Non-Anticancer Peptides.
利用随机森林识别氨基酸的关键物理化学性质,以区分抗癌肽和非抗癌肽。
Int J Mol Sci. 2023 Jun 29;24(13):10854. doi: 10.3390/ijms241310854.
4
On the application, reporting, and sharing of in silico simulations for genetic studies.关于遗传研究中计算机模拟的应用、报告和共享。
Genet Epidemiol. 2021 Mar;45(2):131-141. doi: 10.1002/gepi.22362. Epub 2020 Oct 16.