• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

一种新型的错义突变相关特征提取方案,用于“驱动”突变识别。

A novel missense-mutation-related feature extraction scheme for 'driver' mutation identification.

机构信息

School of Mathematical Sciences, Beijing Normal University, Laboratory of Mathematics and Complex Systems, Ministry of Education, Beijing 100875, P.R. China.

出版信息

Bioinformatics. 2012 Nov 15;28(22):2948-55. doi: 10.1093/bioinformatics/bts558. Epub 2012 Oct 7.

DOI:10.1093/bioinformatics/bts558
PMID:23044540
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC3496432/
Abstract

MOTIVATION

It becomes widely accepted that human cancer is a disease involving dynamic changes in the genome and that the missense mutations constitute the bulk of human genetic variations. A multitude of computational algorithms, especially the machine learning-based ones, has consequently been proposed to distinguish missense changes that contribute to the cancer progression ('driver' mutation) from those that do not ('passenger' mutation). However, the existing methods have multifaceted shortcomings, in the sense that they either adopt incomplete feature space or depend on protein structural databases which are usually far from integrated.

RESULTS

In this article, we investigated multiple aspects of a missense mutation and identified a novel feature space that well distinguishes cancer-associated driver mutations from passenger ones. An index (DX score) was proposed to evaluate the discriminating capability of each feature, and a subset of these features which ranks top was selected to build the SVM classifier. Cross-validation showed that the classifier trained on our selected features significantly outperforms the existing ones both in precision and robustness. We applied our method to several datasets of missense mutations culled from published database and literature and obtained more reasonable results than previous studies.

AVAILABILITY

The software is available online at http://www.methodisthealth.com/software and https://sites.google.com/site/drivermutationidentification/.

CONTACT

xzhou@tmhs.org.

SUPPLEMENTARY INFORMATION

Supplementary data are available at Bioinformatics online.

摘要

动机

人们普遍认为,人类癌症是一种涉及基因组动态变化的疾病,而错义突变构成了人类遗传变异的大部分。因此,已经提出了许多计算算法,特别是基于机器学习的算法,以区分导致癌症进展的错义变化(“驱动”突变)和那些不导致癌症进展的变化(“乘客”突变)。然而,现有的方法存在多方面的缺点,因为它们要么采用不完整的特征空间,要么依赖于蛋白质结构数据库,而这些数据库通常远未集成。

结果

在本文中,我们研究了错义突变的多个方面,并确定了一个新的特征空间,可以很好地区分与癌症相关的驱动突变和乘客突变。提出了一个指数(DX 分数)来评估每个特征的区分能力,并选择排名靠前的这些特征的子集来构建 SVM 分类器。交叉验证表明,在精度和稳健性方面,基于我们选择的特征训练的分类器明显优于现有的分类器。我们将我们的方法应用于从已发表的数据库和文献中提取的几种错义突变数据集,并获得了比以前的研究更合理的结果。

可用性

该软件可在 http://www.methodisthealth.com/software 和 https://sites.google.com/site/drivermutationidentification/ 在线使用。

联系方式

xzhou@tmhs.org。

补充信息

补充数据可在 Bioinformatics 在线获得。

相似文献

1
A novel missense-mutation-related feature extraction scheme for 'driver' mutation identification.一种新型的错义突变相关特征提取方案,用于“驱动”突变识别。
Bioinformatics. 2012 Nov 15;28(22):2948-55. doi: 10.1093/bioinformatics/bts558. Epub 2012 Oct 7.
2
Predicting the functional consequences of somatic missense mutations found in tumors.预测肿瘤中发现的体细胞错义突变的功能后果。
Methods Mol Biol. 2014;1101:135-59. doi: 10.1007/978-1-62703-721-1_8.
3
Driver Missense Mutation Identification Using Feature Selection and Model Fusion.基于特征选择和模型融合的驱动错义突变识别
J Comput Biol. 2015 Dec;22(12):1075-85. doi: 10.1089/cmb.2015.0110. Epub 2015 Sep 24.
4
Top scoring pairs for feature selection in machine learning and applications to cancer outcome prediction.机器学习中特征选择的最佳评分对及其在癌症预后预测中的应用。
BMC Bioinformatics. 2011 Sep 23;12:375. doi: 10.1186/1471-2105-12-375.
5
TargetMM: Accurate Missense Mutation Prediction by Utilizing Local and Global Sequence Information with Classifier Ensemble.利用分类器集成利用局部和全局序列信息进行准确的错义突变预测。
Comb Chem High Throughput Screen. 2022;25(1):38-52. doi: 10.2174/1386207323666201204140438.
6
CDMPred: a tool for predicting cancer driver missense mutations with high-quality passenger mutations.CDMPred:一种用于预测具有高质量乘客突变的癌症驱动点突变的工具。
PeerJ. 2024 Sep 6;12:e17991. doi: 10.7717/peerj.17991. eCollection 2024.
7
CScape-somatic: distinguishing driver and passenger point mutations in the cancer genome.CScape-somatic:在癌症基因组中区分驱动突变和乘客突变。
Bioinformatics. 2020 Jun 1;36(12):3637-3644. doi: 10.1093/bioinformatics/btaa242.
8
Identification and analysis of driver missense mutations using rotation forest with feature selection.使用带特征选择的旋转森林算法识别和分析驱动型错义突变
Biomed Res Int. 2014;2014:905951. doi: 10.1155/2014/905951. Epub 2014 Aug 27.
9
CanDrA: cancer-specific driver missense mutation annotation with optimized features.CanDrA:具有优化特征的癌症特异性驱动子错义突变注释。
PLoS One. 2013 Oct 30;8(10):e77945. doi: 10.1371/journal.pone.0077945. eCollection 2013.
10
Predicting the functional consequences of cancer-associated amino acid substitutions.预测癌症相关氨基酸取代的功能后果。
Bioinformatics. 2013 Jun 15;29(12):1504-10. doi: 10.1093/bioinformatics/btt182. Epub 2013 Apr 25.

引用本文的文献

1
iSoMAs: Finding isoform expression and somatic mutation associations in human cancers.iSoMAs:在人类癌症中寻找异构体表达与体细胞突变的关联
PLoS Comput Biol. 2025 Mar 7;21(3):e1012847. doi: 10.1371/journal.pcbi.1012847. eCollection 2025 Mar.
2
Radiogenomics-Based Risk Prediction of Glioblastoma Multiforme with Clinical Relevance.基于放射组学的与临床相关的多形性胶质母细胞瘤风险预测。
Genes (Basel). 2024 Jun 1;15(6):718. doi: 10.3390/genes15060718.
3
Differential diagnosis of pancreatic cystic neoplasms through a radiomics-assisted system.通过放射组学辅助系统对胰腺囊性肿瘤进行鉴别诊断。
Front Oncol. 2022 Dec 16;12:941744. doi: 10.3389/fonc.2022.941744. eCollection 2022.
4
Differentiating TP53 Mutation Status in Pancreatic Ductal Adenocarcinoma Using Multiparametric MRI-Derived Radiomics.利用多参数MRI衍生的放射组学鉴别胰腺导管腺癌中的TP53突变状态
Front Oncol. 2021 May 17;11:632130. doi: 10.3389/fonc.2021.632130. eCollection 2021.
5
Somatic mutation in noncoding regions: The sound of silence.非编码区域的体细胞突变:沉默之声
EBioMedicine. 2020 Nov;61:103084. doi: 10.1016/j.ebiom.2020.103084. Epub 2020 Oct 20.
6
miRactDB characterizes miRNA-gene relation switch between normal and cancer tissues across pan-cancer.miRactDB 描绘了泛癌中正常组织和癌症组织之间 miRNA-基因关系的开关。
Brief Bioinform. 2021 May 20;22(3). doi: 10.1093/bib/bbaa089.
7
A novel microRNA signature for pathological grading in lung adenocarcinoma based on TCGA and GEO data.基于 TCGA 和 GEO 数据的肺腺癌病理分级新型 microRNA 特征。
Int J Mol Med. 2020 May;45(5):1397-1408. doi: 10.3892/ijmm.2020.4526. Epub 2020 Mar 4.
8
Association of and polymorphisms with neuroblastoma risk in Eastern Chinese population: a three-center case-control study.中国东部人群中[具体基因名称1]和[具体基因名称2]多态性与神经母细胞瘤风险的关联:一项三中心病例对照研究
Biosci Rep. 2019 Jun 7;39(6). doi: 10.1042/BSR20181907. Print 2019 Jun 28.
9
Pan-cancer analysis on microRNA-associated gene activation.泛癌症分析中 miRNA 相关基因激活。
EBioMedicine. 2019 May;43:82-97. doi: 10.1016/j.ebiom.2019.03.082. Epub 2019 Apr 5.
10
Challenges in the introduction of next-generation sequencing (NGS) for diagnostics of myeloid malignancies into clinical routine use.将下一代测序(NGS)引入髓系恶性肿瘤的临床常规诊断中所面临的挑战。
Blood Cancer J. 2018 Nov 12;8(11):113. doi: 10.1038/s41408-018-0148-6.

本文引用的文献

1
Improving the assessment of the outcome of nonsynonymous SNVs with a consensus deleteriousness score, Condel.利用共识致病变异评分提高非同义 SNV 结果的评估,Condel。
Am J Hum Genet. 2011 Apr 8;88(4):440-9. doi: 10.1016/j.ajhg.2011.03.004. Epub 2011 Mar 31.
2
A semi-supervised learning approach to predict synthetic genetic interactions by combining functional and topological properties of functional gene network.一种通过组合功能基因网络的功能和拓扑性质来预测合成遗传相互作用的半监督学习方法。
BMC Bioinformatics. 2010 Jun 24;11:343. doi: 10.1186/1471-2105-11-343.
3
Cancer-specific high-throughput annotation of somatic mutations: computational prediction of driver missense mutations.体细胞突变的癌症特异性高通量注释:驱动错义突变的计算预测
Cancer Res. 2009 Aug 15;69(16):6660-7. doi: 10.1158/0008-5472.CAN-09-1133. Epub 2009 Aug 4.
4
The cancer genome.癌症基因组
Nature. 2009 Apr 9;458(7239):719-24. doi: 10.1038/nature07943.
5
Core signaling pathways in human pancreatic cancers revealed by global genomic analyses.通过全基因组分析揭示的人类胰腺癌核心信号通路。
Science. 2008 Sep 26;321(5897):1801-6. doi: 10.1126/science.1164368. Epub 2008 Sep 4.
6
An integrated genomic analysis of human glioblastoma multiforme.多形性胶质母细胞瘤的综合基因组分析
Science. 2008 Sep 26;321(5897):1807-12. doi: 10.1126/science.1164382. Epub 2008 Sep 4.
7
AAindex: amino acid index database, progress report 2008.AAindex:氨基酸索引数据库,2008年进展报告。
Nucleic Acids Res. 2008 Jan;36(Database issue):D202-5. doi: 10.1093/nar/gkm998. Epub 2007 Nov 12.
8
The genomic landscapes of human breast and colorectal cancers.人类乳腺癌和结直肠癌的基因组图谱。
Science. 2007 Nov 16;318(5853):1108-13. doi: 10.1126/science.1145720. Epub 2007 Oct 11.
9
Comment on "The consensus coding sequences of human breast and colorectal cancers".关于《人类乳腺癌和结直肠癌的共有编码序列》的评论
Science. 2007 Sep 14;317(5844):1500. doi: 10.1126/science.1138956.
10
Comment on "The consensus coding sequences of human breast and colorectal cancers".关于“人类乳腺癌和结直肠癌的一致性编码序列”的评论
Science. 2007 Sep 14;317(5844):1500. doi: 10.1126/science.1138764.