• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

KNIndex:用于 k- 碱基对核苷酸的物理化学性质的综合数据库。

KNIndex: a comprehensive database of physicochemical properties for k-tuple nucleotides.

机构信息

College of Intelligence and Computing, Tianjin University.

School of Life Sciences, North China University of Science and Technology.

出版信息

Brief Bioinform. 2021 Jul 20;22(4). doi: 10.1093/bib/bbaa284.

DOI:10.1093/bib/bbaa284
PMID:33147622
Abstract

With the development of high-throughput sequencing technology, the genomic sequences increased exponentially over the last decade. In order to decode these new genomic data, machine learning methods were introduced for genome annotation and analysis. Due to the requirement of most machines learning methods, the biological sequences must be represented as fixed-length digital vectors. In this representation procedure, the physicochemical properties of k-tuple nucleotides are important information. However, the values of the physicochemical properties of k-tuple nucleotides are scattered in different resources. To facilitate the studies on genomic sequences, we developed the first comprehensive database, namely KNIndex (https://knindex.pufengdu.org), for depositing and visualizing physicochemical properties of k-tuple nucleotides. Currently, the KNIndex database contains 182 properties including one for mononucleotide (DNA), 169 for dinucleotide (147 for DNA and 22 for RNA) and 12 for trinucleotide (DNA). KNIndex database also provides a user-friendly web-based interface for the users to browse, query, visualize and download the physicochemical properties of k-tuple nucleotides. With the built-in conversion and visualization functions, users are allowed to display DNA/RNA sequences as curves of multiple physicochemical properties. We wish that the KNIndex will facilitate the related studies in computational biology.

摘要

随着高通量测序技术的发展,在过去十年中,基因组序列呈指数级增长。为了解码这些新的基因组数据,引入了机器学习方法来进行基因组注释和分析。由于大多数机器学习方法的要求,生物序列必须表示为固定长度的数字向量。在这种表示过程中,k 元核苷酸的理化性质是重要信息。然而,k 元核苷酸理化性质的值分散在不同的资源中。为了方便对基因组序列的研究,我们开发了第一个综合数据库,即 KNIndex(https://knindex.pufengdu.org),用于存储和可视化 k 元核苷酸的理化性质。目前,KNIndex 数据库包含 182 种属性,包括单核苷酸(DNA)的一种、二核苷酸(147 种 DNA 和 22 种 RNA)的 169 种和三核苷酸(DNA)的 12 种。KNIndex 数据库还为用户提供了一个用户友好的基于网络的界面,用于浏览、查询、可视化和下载 k 元核苷酸的理化性质。通过内置的转换和可视化功能,用户可以将 DNA/RNA 序列显示为多个理化性质的曲线。我们希望 KNIndex 将有助于计算生物学的相关研究。

相似文献

1
KNIndex: a comprehensive database of physicochemical properties for k-tuple nucleotides.KNIndex:用于 k- 碱基对核苷酸的物理化学性质的综合数据库。
Brief Bioinform. 2021 Jul 20;22(4). doi: 10.1093/bib/bbaa284.
2
PseKNC: a flexible web server for generating pseudo K-tuple nucleotide composition.PseKNC:一个用于生成伪K元核苷酸组成的灵活网络服务器。
Anal Biochem. 2014 Jul 1;456:53-60. doi: 10.1016/j.ab.2014.04.001. Epub 2014 Apr 13.
3
iDNA6mA-PseKNC: Identifying DNA N-methyladenosine sites by incorporating nucleotide physicochemical properties into PseKNC.iDNA6mA-PseKNC:通过将核苷酸理化性质纳入 PseKNC 来鉴定 DNA N6-甲基腺苷位点。
Genomics. 2019 Jan;111(1):96-102. doi: 10.1016/j.ygeno.2018.01.005. Epub 2018 Jan 31.
4
iOri-Human: identify human origin of replication by incorporating dinucleotide physicochemical properties into pseudo nucleotide composition.iOri-Human:通过将二核苷酸物理化学性质纳入伪核苷酸组成来识别人类复制起点。
Oncotarget. 2016 Oct 25;7(43):69783-69793. doi: 10.18632/oncotarget.11975.
5
repRNA: a web server for generating various feature vectors of RNA sequences.repRNA:一个用于生成RNA序列各种特征向量的网络服务器。
Mol Genet Genomics. 2016 Feb;291(1):473-81. doi: 10.1007/s00438-015-1078-7. Epub 2015 Jun 18.
6
LncACTdb 2.0: an updated database of experimentally supported ceRNA interactions curated from low- and high-throughput experiments.LncACTdb 2.0:一个经过实验验证的 ceRNA 相互作用数据库,包含来自低通量和高通量实验的数据。
Nucleic Acids Res. 2019 Jan 8;47(D1):D121-D127. doi: 10.1093/nar/gky1144.
7
PseKNC-General: a cross-platform package for generating various modes of pseudo nucleotide compositions.PseKNC通用版:一个用于生成各种伪核苷酸组成模式的跨平台软件包。
Bioinformatics. 2015 Jan 1;31(1):119-20. doi: 10.1093/bioinformatics/btu602. Epub 2014 Sep 16.
8
iEnhancer-2L: a two-layer predictor for identifying enhancers and their strength by pseudo k-tuple nucleotide composition.iEnhancer-2L:一种通过伪 k-元核苷酸组成识别增强子及其强度的两层预测器。
Bioinformatics. 2016 Feb 1;32(3):362-9. doi: 10.1093/bioinformatics/btv604. Epub 2015 Oct 17.
9
BiasAway: command-line and web server to generate nucleotide composition-matched DNA background sequences.BiasAway:用于生成核苷酸组成匹配的 DNA 背景序列的命令行和网络服务器。
Bioinformatics. 2021 Jul 12;37(11):1607-1609. doi: 10.1093/bioinformatics/btaa928.
10
Effect of k-tuple length on sample-comparison with high-throughput sequencing data.k元组长度对高通量测序数据样本比较的影响。
Biochem Biophys Res Commun. 2016 Jan 22;469(4):1021-7. doi: 10.1016/j.bbrc.2015.11.094. Epub 2015 Dec 22.

引用本文的文献

1
StackEPI: identification of cell line-specific enhancer-promoter interactions based on stacking ensemble learning.StackEPI:基于堆叠集成学习的细胞系特异性增强子-启动子相互作用识别。
BMC Bioinformatics. 2022 Jul 11;23(1):272. doi: 10.1186/s12859-022-04821-9.
2
XGEM: Predicting Essential miRNAs by the Ensembles of Various Sequence-Based Classifiers With XGBoost Algorithm.XGEM:使用XGBoost算法通过各种基于序列的分类器集成来预测必需的微小RNA
Front Genet. 2022 Mar 28;13:877409. doi: 10.3389/fgene.2022.877409. eCollection 2022.
3
Recognition of Metal Ion Ligand-Binding Residues by Adding Correlation Features and Propensity Factors.
通过添加相关特征和倾向因子识别金属离子配体结合残基
Front Genet. 2022 Jan 4;12:793800. doi: 10.3389/fgene.2021.793800. eCollection 2021.
4
mRNALocater: Enhance the prediction accuracy of eukaryotic mRNA subcellular localization by using model fusion strategy.mRNA 定位器:通过模型融合策略提高真核 mRNA 亚细胞定位的预测准确性。
Mol Ther. 2021 Aug 4;29(8):2617-2623. doi: 10.1016/j.ymthe.2021.04.004. Epub 2021 Apr 3.