• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

dbCID:一个用于探索人类癌症中驱动因子插入缺失的人工 curated 资源。

dbCID: a manually curated resource for exploring the driver indels in human cancer.

机构信息

Institute of Physical Science and Information Technology, School of Computer Science and Technology, Anhui University, Hefei, Anhui, China.

School of Life Sciences, Anhui University, Hefei, Anhui, China.

出版信息

Brief Bioinform. 2019 Sep 27;20(5):1925-1933. doi: 10.1093/bib/bby059.

DOI:10.1093/bib/bby059
PMID:30016397
Abstract

While recent advances in next-generation sequencing technologies have enabled the creation of a multitude of databases in cancer genomic research, there is no comprehensive database focusing on the annotation of driver indels (insertions and deletions) yet. Therefore, we have developed the database of Cancer driver InDels (dbCID), which is a collection of known coding indels that likely to be engaged in cancer development, progression or therapy. dbCID contains experimentally supported and putative driver indels derived from manual curation of literature and is freely available online at http://bioinfo.ahu.edu.cn:8080/dbCID. Using the data deposited in dbCID, we summarized features of driver indels in four levels (gene, DNA, transcript and protein) through comparing with putative neutral indels. We found that most of the genes containing driver indels in dbCID are known cancer genes playing a role in tumorigenesis. Contrary to the expectation, the sequences affected by driver frameshift indels are not larger than those by neutral ones. In addition, the frameshift and inframe driver indels prefer to disrupt high-conservative regions both in DNA sequences and protein domains. Finally, we developed a computational method for discriminating cancer driver from neutral frameshift indels based on the deposited data in dbCID. The proposed method outperformed other widely used non-cancer-specific predictors on an external test set, which demonstrated the usefulness of the data deposited in dbCID. We hope dbCID will be a benchmark for improving and evaluating prediction algorithms, and the characteristics summarized here may assist with investigating the mechanism of indel-cancer association.

摘要

虽然新一代测序技术的最新进展使得在癌症基因组研究中创建了众多数据库,但目前还没有一个专门针对驱动插入缺失(indels)注释的综合数据库。因此,我们开发了癌症驱动 indels 数据库(dbCID),这是一个包含已知编码 indels 的集合,这些 indels 可能参与癌症的发生、发展或治疗。dbCID 包含通过文献手动整理得到的实验支持的和推测的驱动 indels,并可在 http://bioinfo.ahu.edu.cn:8080/dbCID 上免费获取。使用 dbCID 中存储的数据,我们通过与推测的中性 indels 进行比较,总结了驱动 indels 在四个层次(基因、DNA、转录本和蛋白质)上的特征。我们发现,dbCID 中包含驱动 indels 的大多数基因都是已知的癌症基因,在肿瘤发生中发挥作用。与预期相反,受驱动移码 indels 影响的序列并不大于受中性 indels 影响的序列。此外,移码和框架内驱动 indels 倾向于破坏 DNA 序列和蛋白质结构域中高保守区域。最后,我们基于 dbCID 中存储的数据开发了一种区分癌症驱动和中性移码 indels 的计算方法。该方法在外部测试集上的表现优于其他广泛使用的非癌症特异性预测器,证明了 dbCID 中存储的数据的有用性。我们希望 dbCID 将成为改进和评估预测算法的基准,并且这里总结的特征可能有助于研究 indels 与癌症关联的机制。

相似文献

1
dbCID: a manually curated resource for exploring the driver indels in human cancer.dbCID:一个用于探索人类癌症中驱动因子插入缺失的人工 curated 资源。
Brief Bioinform. 2019 Sep 27;20(5):1925-1933. doi: 10.1093/bib/bby059.
2
PredCID: prediction of driver frameshift indels in human cancer.PredCID:人类癌症中驱动移码插入缺失的预测
Brief Bioinform. 2021 May 20;22(3). doi: 10.1093/bib/bbaa119.
3
Identifying the Impact of Inframe Insertions and Deletions on Protein Function in Cancer.确定框内插入和缺失对癌症中蛋白质功能的影响。
J Comput Biol. 2020 May;27(5):786-795. doi: 10.1089/cmb.2018.0192. Epub 2019 Aug 28.
4
dbCPM: a manually curated database for exploring the cancer passenger mutations.dbCPM:一个用于探索癌症乘客突变的人工整理数据库。
Brief Bioinform. 2020 Jan 17;21(1):309-317. doi: 10.1093/bib/bby105.
5
Discriminating between deleterious and neutral non-frameshifting indels based on protein interaction networks and hybrid properties.基于蛋白质相互作用网络和混合特性区分有害和中性非移码插入缺失。
Mol Genet Genomics. 2015 Feb;290(1):343-52. doi: 10.1007/s00438-014-0922-5. Epub 2014 Sep 24.
6
DDIG-in: discriminating between disease-associated and neutral non-frameshifting micro-indels.DDIG-in:区分疾病相关和中性非移码微插入缺失
Genome Biol. 2013 Mar 13;14(3):R23. doi: 10.1186/gb-2013-14-3-r23.
7
Important role of indels in somatic mutations of human cancer genes.插入缺失在人类癌症基因体细胞突变中的重要作用。
BMC Med Genet. 2010 Sep 1;11:128. doi: 10.1186/1471-2350-11-128.
8
Vindel: a simple pipeline for checking indel redundancy.Vindel:一个用于检查插入缺失冗余的简单流程。
BMC Bioinformatics. 2014 Nov 19;15(1):359. doi: 10.1186/s12859-014-0359-1.
9
INDELseek: detection of complex insertions and deletions from next-generation sequencing data.INDELseek:从下一代测序数据中检测复杂插入和缺失
BMC Genomics. 2017 Jan 5;18(1):16. doi: 10.1186/s12864-016-3449-9.
10
IndelFR: a database of indels in protein structures and their flanking regions.IndelFR:蛋白质结构及其侧翼区域插入缺失数据库。
Nucleic Acids Res. 2012 Jan;40(Database issue):D512-8. doi: 10.1093/nar/gkr1107. Epub 2011 Nov 29.

引用本文的文献

1
Variant Impact Predictor database (VIPdb), version 2: trends from three decades of genetic variant impact predictors.变异影响预测器数据库(VIPdb),版本 2:三十年来遗传变异影响预测器的趋势。
Hum Genomics. 2024 Aug 28;18(1):90. doi: 10.1186/s40246-024-00663-z.
2
Variant Impact Predictor database (VIPdb), version 2: Trends from 25 years of genetic variant impact predictors.变异影响预测数据库(VIPdb),版本2:25年基因变异影响预测的趋势
bioRxiv. 2024 Jun 28:2024.06.25.600283. doi: 10.1101/2024.06.25.600283.
3
Extend the benchmarking indel set by manual review using the individual cell line sequencing data from the Sequencing Quality Control 2 (SEQC2) project.
使用来自测序质量控制 2(SEQC2)项目的个体细胞系测序数据,通过手动审查扩展基准插入缺失集。
Sci Rep. 2024 Mar 25;14(1):7028. doi: 10.1038/s41598-024-57439-7.
4
VariBench, new variation benchmark categories and data sets.VariBench,新的变异基准类别和数据集。
Front Bioinform. 2023 Sep 19;3:1248732. doi: 10.3389/fbinf.2023.1248732. eCollection 2023.
5
Structural and functional analysis of somatic coding and UTR indels in breast and lung cancer genomes.体细胞编码和 UTR 插入缺失在乳腺癌和肺癌基因组中的结构和功能分析。
Sci Rep. 2021 Oct 27;11(1):21178. doi: 10.1038/s41598-021-00583-1.
6
DLFF-ACP: prediction of ACPs based on deep learning and multi-view features fusion.DLFF-ACP:基于深度学习和多视图特征融合的急性冠脉综合征预测
PeerJ. 2021 Aug 3;9:e11906. doi: 10.7717/peerj.11906. eCollection 2021.
7
DriverSubNet: A Novel Algorithm for Identifying Cancer Driver Genes by Subnetwork Enrichment Analysis.驱动子网:一种通过子网富集分析识别癌症驱动基因的新算法。
Front Genet. 2021 Feb 19;11:607798. doi: 10.3389/fgene.2020.607798. eCollection 2020.
8
Utility of a custom designed next generation DNA sequencing gene panel to molecularly classify endometrial cancers according to The Cancer Genome Atlas subgroups.根据癌症基因组图谱(The Cancer Genome Atlas,TCGA)亚组,定制设计的下一代 DNA 测序基因面板在分子水平上对子宫内膜癌进行分类的效用。
BMC Med Genomics. 2020 Nov 30;13(1):179. doi: 10.1186/s12920-020-00824-8.
9
Predicting Hot Spot Residues at Protein-DNA Binding Interfaces Based on Sequence Information.基于序列信息预测蛋白质-DNA 结合界面的热点残基。
Interdiscip Sci. 2021 Mar;13(1):1-11. doi: 10.1007/s12539-020-00399-z. Epub 2020 Oct 17.
10
Pathogenicity and functional impact of non-frameshifting insertion/deletion variation in the human genome.人类基因组中非移码插入/缺失变异的致病性和功能影响。
PLoS Comput Biol. 2019 Jun 14;15(6):e1007112. doi: 10.1371/journal.pcbi.1007112. eCollection 2019 Jun.