• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

利用序列固有组成对蛋白编码和长非编码转录本进行分类。

Utilizing sequence intrinsic composition to classify protein-coding and long non-coding transcripts.

机构信息

Bioinformatics Research Group, Advanced Computing Research Laboratory, Institute of Computing Technology, Chinese Academy of Sciences, Beijing 100190, China, College of Computer Science and Technology, Jilin University, Changchun 130012, China and Laboratory of Bioinformatics and Non-coding RNA, Institute of Biophysics, Chinese Academy of Sciences, Beijing 100101, China.

出版信息

Nucleic Acids Res. 2013 Sep;41(17):e166. doi: 10.1093/nar/gkt646. Epub 2013 Jul 27.

DOI:10.1093/nar/gkt646
PMID:23892401
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC3783192/
Abstract

It is a challenge to classify protein-coding or non-coding transcripts, especially those re-constructed from high-throughput sequencing data of poorly annotated species. This study developed and evaluated a powerful signature tool, Coding-Non-Coding Index (CNCI), by profiling adjoining nucleotide triplets to effectively distinguish protein-coding and non-coding sequences independent of known annotations. CNCI is effective for classifying incomplete transcripts and sense-antisense pairs. The implementation of CNCI offered highly accurate classification of transcripts assembled from whole-transcriptome sequencing data in a cross-species manner, that demonstrated gene evolutionary divergence between vertebrates, and invertebrates, or between plants, and provided a long non-coding RNA catalog of orangutan. CNCI software is available at http://www.bioinfo.org/software/cnci.

摘要

对蛋白质编码或非编码转录本进行分类是一项挑战,特别是对那些来自注释较差的物种的高通量测序数据进行重建的转录本。本研究通过分析相邻的三核苷酸来开发和评估一种强大的特征工具——编码-非编码指数(CNCI),从而有效区分蛋白质编码和非编码序列,而无需依赖已知的注释。CNCI 可有效用于分类不完整的转录本和有义-反义对。CNCI 的实现以跨物种的方式对来自全转录组测序数据组装的转录本进行了高度准确的分类,这表明了脊椎动物和无脊椎动物之间,或植物和动物之间的基因进化分歧,并提供了猩猩的长非编码 RNA 目录。CNCI 软件可在 http://www.bioinfo.org/software/cnci 上获得。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3ac7/3783192/926947841e45/gkt646f2p.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3ac7/3783192/1d1c031e85d4/gkt646f1p.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3ac7/3783192/926947841e45/gkt646f2p.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3ac7/3783192/1d1c031e85d4/gkt646f1p.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3ac7/3783192/926947841e45/gkt646f2p.jpg

相似文献

1
Utilizing sequence intrinsic composition to classify protein-coding and long non-coding transcripts.利用序列固有组成对蛋白编码和长非编码转录本进行分类。
Nucleic Acids Res. 2013 Sep;41(17):e166. doi: 10.1093/nar/gkt646. Epub 2013 Jul 27.
2
De novo approach to classify protein-coding and noncoding transcripts based on sequence composition.基于序列组成对蛋白质编码和非编码转录本进行分类的从头开始方法。
Methods Mol Biol. 2014;1182:203-7. doi: 10.1007/978-1-4939-1062-5_18.
3
CNIT: a fast and accurate web tool for identifying protein-coding and long non-coding transcripts based on intrinsic sequence composition.CNIT:一种快速准确的基于内在序列组成识别蛋白质编码和长非编码转录本的网络工具。
Nucleic Acids Res. 2019 Jul 2;47(W1):W516-W522. doi: 10.1093/nar/gkz400.
4
PLEK: a tool for predicting long non-coding RNAs and messenger RNAs based on an improved k-mer scheme.PLEK:一种基于改进的k-mer方案预测长链非编码RNA和信使RNA的工具。
BMC Bioinformatics. 2014 Sep 19;15(1):311. doi: 10.1186/1471-2105-15-311.
5
CPC2: a fast and accurate coding potential calculator based on sequence intrinsic features.CPC2:一种基于序列固有特征的快速准确编码潜能计算器。
Nucleic Acids Res. 2017 Jul 3;45(W1):W12-W16. doi: 10.1093/nar/gkx428.
6
TERIUS: accurate prediction of lncRNA via high-throughput sequencing data representing RNA-binding protein association.TERIUS:通过代表 RNA 结合蛋白结合的高通量测序数据进行 lncRNA 的准确预测。
BMC Bioinformatics. 2018 Feb 19;19(Suppl 1):41. doi: 10.1186/s12859-018-2013-9.
7
LncRNApred: Classification of Long Non-Coding RNAs and Protein-Coding Transcripts by the Ensemble Algorithm with a New Hybrid Feature.LncRNApred:基于具有新型混合特征的集成算法对长链非编码RNA和蛋白质编码转录本进行分类
PLoS One. 2016 May 26;11(5):e0154567. doi: 10.1371/journal.pone.0154567. eCollection 2016.
8
BASiNET-BiologicAl Sequences NETwork: a case study on coding and non-coding RNAs identification.BASiNET-生物序列 NETwork:一个关于编码和非编码 RNA 鉴定的案例研究。
Nucleic Acids Res. 2018 Sep 19;46(16):e96. doi: 10.1093/nar/gky462.
9
lncScore: alignment-free identification of long noncoding RNA from assembled novel transcripts.lncScore:基于组装新转录本的长链非编码 RNA 无比对识别
Sci Rep. 2016 Oct 6;6:34838. doi: 10.1038/srep34838.
10
Identification of long non-coding transcripts with feature selection: a comparative study.基于特征选择的长链非编码转录本识别:一项比较研究。
BMC Bioinformatics. 2017 Mar 23;18(1):187. doi: 10.1186/s12859-017-1594-z.

引用本文的文献

1
Transcriptome profiling of mRNA and lncRNA involved in wax biosynthesis in cauliflower.花椰菜蜡质生物合成中涉及的mRNA和lncRNA转录组分析
Sci Data. 2025 Aug 29;12(1):1511. doi: 10.1038/s41597-025-05816-w.
2
Integrating Full-Length and Second-Generation Transcriptomes to Elucidate the ApNPV-Induced Transcriptional Reprogramming in Midgut.整合全长转录组和第二代转录组以阐明苜蓿银纹夜蛾核多角体病毒诱导的中肠转录重编程
Insects. 2025 Jul 31;16(8):792. doi: 10.3390/insects16080792.
3
Differential gene expression drives muscle metabolic and structural differences in Liang Guang small spotted vs. large white pigs.

本文引用的文献

1
CPAT: Coding-Potential Assessment Tool using an alignment-free logistic regression model.CPAT:一种使用无比对逻辑回归模型的编码潜力评估工具。
Nucleic Acids Res. 2013 Apr 1;41(6):e74. doi: 10.1093/nar/gkt006. Epub 2013 Jan 17.
2
Long non-coding RNAs function annotation: a global prediction method based on bi-colored networks.长非编码 RNA 功能注释:一种基于双色网络的全局预测方法。
Nucleic Acids Res. 2013 Jan;41(2):e35. doi: 10.1093/nar/gks967. Epub 2012 Nov 5.
3
The GENCODE v7 catalog of human long noncoding RNAs: analysis of their gene structure, evolution, and expression.
差异基因表达导致两广小花猪与大白猪在肌肉代谢和结构上的差异。
Sci Rep. 2025 Aug 27;15(1):31564. doi: 10.1038/s41598-025-17179-8.
4
Integrated Transcriptomic and Metabolomic Analyses Shed Light on the Regulation of Aromatic Amino Acid Biosynthesis in a Novel Albino Tea () Mutation.整合转录组学和代谢组学分析揭示新型白化茶()突变体中芳香族氨基酸生物合成的调控机制
Curr Issues Mol Biol. 2025 Aug 12;47(8):644. doi: 10.3390/cimb47080644.
5
Whole-transcriptome insights into follicle selection: deciphering key regulatory networks in Luxi gamecock.卵泡选择的全转录组见解:解析鲁西斗鸡的关键调控网络。
Front Genet. 2025 Aug 6;16:1620058. doi: 10.3389/fgene.2025.1620058. eCollection 2025.
6
Decoding circRNA translation: challenges and advances in computational method development.环状RNA翻译的解码:计算方法开发中的挑战与进展
Front Genet. 2025 Jul 29;16:1654305. doi: 10.3389/fgene.2025.1654305. eCollection 2025.
7
Long-read sequencing uncovers key regulatory genes involved in the differentiation of preadipocytes of Chinese red steppe cattle.长读长测序揭示了参与中国红草原牛前脂肪细胞分化的关键调控基因。
Sci Rep. 2025 Aug 12;15(1):29459. doi: 10.1038/s41598-025-15106-5.
8
Preliminary investigation of the effect of ferulic acid on miRNAs and LncRNAs in Mongolian horse skeletal muscle satellite cells.阿魏酸对蒙古马骨骼肌卫星细胞中微小RNA(miRNAs)和长链非编码RNA(LncRNAs)影响的初步研究
Front Genet. 2025 Jul 18;16:1630614. doi: 10.3389/fgene.2025.1630614. eCollection 2025.
9
Exploring the regulatory role of long non-coding RNAs in pigmentation in juvenile Plectropomus leopardus.探索长链非编码RNA在豹纹鳃棘鲈幼鱼色素沉着中的调控作用。
Sci Rep. 2025 Jul 31;15(1):27977. doi: 10.1038/s41598-025-13347-y.
10
Integrated Multi-Omics Reveals DAM-Mediated Phytohormone Regulatory Networks Driving Bud Dormancy in 'Mixue' Pears.整合多组学揭示了 DAM 介导的植物激素调控网络驱动‘蜜雪’梨芽休眠
Plants (Basel). 2025 Jul 14;14(14):2172. doi: 10.3390/plants14142172.
GENCODE v7 人类长非编码 RNA 目录:基因结构、进化和表达分析。
Genome Res. 2012 Sep;22(9):1775-89. doi: 10.1101/gr.132159.111.
4
Landscape of transcription in human cells.人类细胞中的转录景观。
Nature. 2012 Sep 6;489(7414):101-8. doi: 10.1038/nature11233.
5
An integrated encyclopedia of DNA elements in the human genome.人类基因组中 DNA 元件的综合百科全书。
Nature. 2012 Sep 6;489(7414):57-74. doi: 10.1038/nature11247.
6
Differential gene and transcript expression analysis of RNA-seq experiments with TopHat and Cufflinks.RNA-seq 实验中使用 TopHat 和 Cufflinks 的差异基因和转录本表达分析。
Nat Protoc. 2012 Mar 1;7(3):562-78. doi: 10.1038/nprot.2012.016.
7
Modular regulatory principles of large non-coding RNAs.大型非编码 RNA 的模块化调控原理。
Nature. 2012 Feb 15;482(7385):339-46. doi: 10.1038/nature10887.
8
NONCODE v3.0: integrative annotation of long noncoding RNAs.NONCODE v3.0:长非编码 RNA 的综合注释。
Nucleic Acids Res. 2012 Jan;40(Database issue):D210-5. doi: 10.1093/nar/gkr1175. Epub 2011 Dec 1.
9
Ensembl Genomes: an integrative resource for genome-scale data from non-vertebrate species.Ensembl Genomes:一个整合了非脊椎动物物种基因组规模数据的资源。
Nucleic Acids Res. 2012 Jan;40(Database issue):D91-7. doi: 10.1093/nar/gkr895. Epub 2011 Nov 8.
10
The evolution of gene expression levels in mammalian organs.哺乳动物器官中基因表达水平的演变。
Nature. 2011 Oct 19;478(7369):343-8. doi: 10.1038/nature10532.