• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

相似文献

1
Locating protein-coding regions in human DNA sequences by a multiple sensor-neural network approach.通过多传感器神经网络方法在人类DNA序列中定位蛋白质编码区域。
Proc Natl Acad Sci U S A. 1991 Dec 15;88(24):11261-5. doi: 10.1073/pnas.88.24.11261.
2
Locating protein coding regions in human DNA using a decision tree algorithm.使用决策树算法在人类DNA中定位蛋白质编码区域。
J Comput Biol. 1995 Fall;2(3):473-85. doi: 10.1089/cmb.1995.2.473.
3
Identification of coding regions in genomic DNA sequences: an application of dynamic programming and neural networks.基因组DNA序列中编码区域的识别:动态规划和神经网络的应用
Nucleic Acids Res. 1993 Feb 11;21(3):607-13. doi: 10.1093/nar/21.3.607.
4
Recognizing exons in genomic sequence using GRAIL II.使用GRAIL II在基因组序列中识别外显子。
Genet Eng (N Y). 1994;16:241-53.
5
Prediction of function in DNA sequence analysis.DNA序列分析中的功能预测
J Comput Biol. 1995 Spring;2(1):87-115. doi: 10.1089/cmb.1995.2.87.
6
[Analysis, identification and correction of some errors of model refseqs appeared in NCBI Human Gene Database by in silico cloning and experimental verification of novel human genes].[通过新型人类基因的电子克隆和实验验证对NCBI人类基因数据库中出现的模型参考序列的一些错误进行分析、鉴定和校正]
Yi Chuan Xue Bao. 2004 May;31(5):431-43.
7
Recognition of human genes by stochastic parsing.通过随机解析识别人类基因。
Pac Symp Biocomput. 1998:228-39.
8
Potentially functional regions of nucleic acids recognized by a Kohonen's self-organizing map.由科赫伦自组织映射识别的核酸潜在功能区域。
Comput Appl Biosci. 1993 Dec;9(6):687-93. doi: 10.1093/bioinformatics/9.6.687.
9
Artificial neural network method for discriminating coding regions of eukaryotic genes.用于鉴别真核基因编码区的人工神经网络方法
Comput Appl Biosci. 1995 Oct;11(5):497-501. doi: 10.1093/bioinformatics/11.5.497.
10
A tool for analyzing and annotating genomic sequences.一种用于分析和注释基因组序列的工具。
Genomics. 1997 Nov 15;46(1):37-45. doi: 10.1006/geno.1997.4984.

引用本文的文献

1
gene prediction for protein-coding regions.蛋白质编码区域的基因预测。
Bioinform Adv. 2023 Aug 10;3(1):vbad105. doi: 10.1093/bioadv/vbad105. eCollection 2023.
2
Exploring Yeast as a Study Model of Pantothenate Kinase-Associated Neurodegeneration and for the Identification of Therapeutic Compounds.探讨酵母作为泛酸激酶相关神经退行性疾病的研究模型及治疗化合物的鉴定。
Int J Mol Sci. 2020 Dec 30;22(1):293. doi: 10.3390/ijms22010293.
3
Modeling human Coenzyme A synthase mutation in yeast reveals altered mitochondrial function, lipid content and iron metabolism.在酵母中模拟人类辅酶A合成酶突变揭示了线粒体功能、脂质含量和铁代谢的改变。
Microb Cell. 2015 Apr 6;2(4):126-135. doi: 10.15698/mic2015.04.196.
4
IN-MACA-MCC: Integrated Multiple Attractor Cellular Automata with Modified Clonal Classifier for Human Protein Coding and Promoter Prediction.IN-MACA-MCC:用于人类蛋白质编码和启动子预测的带有改进克隆分类器的集成多吸引子细胞自动机
Adv Bioinformatics. 2014;2014:261362. doi: 10.1155/2014/261362. Epub 2014 Jul 15.
5
Biologically inspired intelligent decision making: a commentary on the use of artificial neural networks in bioinformatics.受生物启发的智能决策:关于人工神经网络在生物信息学中应用的评论
Bioengineered. 2014 Mar-Apr;5(2):80-95. doi: 10.4161/bioe.26997. Epub 2013 Dec 16.
6
Machine learning and genome annotation: a match meant to be?机器学习与基因组注释:天作之合?
Genome Biol. 2013 May 29;14(5):205. doi: 10.1186/gb-2013-14-5-205.
7
The glutathione reductase GSR-1 determines stress tolerance and longevity in Caenorhabditis elegans.谷胱甘肽还原酶 GSR-1 决定秀丽隐杆线虫的应激耐受和寿命。
PLoS One. 2013 Apr 8;8(4):e60731. doi: 10.1371/journal.pone.0060731. Print 2013.
8
Normal and compound poisson approximations for pattern occurrences in NGS reads.下一代测序(NGS)读段中模式出现的正态和复合泊松近似
J Comput Biol. 2012 Jun;19(6):839-54. doi: 10.1089/cmb.2012.0029.
9
Classifying coding DNA with nucleotide statistics.利用核苷酸统计对编码DNA进行分类。
Bioinform Biol Insights. 2009 Oct 28;3:141-54. doi: 10.4137/bbi.s3030.
10
Marine organism cell biology and regulatory sequence discoveryin comparative functional genomics.海洋生物细胞生物学与调控序列发现——比较功能基因组学
Cytotechnology. 2004 Oct;46(2-3):123-37. doi: 10.1007/s10616-005-1719-5. Epub 2005 Nov 30.

本文引用的文献

1
Learning algorithms and probability distributions in feed-forward and feed-back networks.前馈和反馈网络中的学习算法和概率分布。
Proc Natl Acad Sci U S A. 1987 Dec;84(23):8429-33. doi: 10.1073/pnas.84.23.8429.
2
Fractal geometry of music.音乐的分形几何
Proc Natl Acad Sci U S A. 1990 Feb 1;87(3):938-41. doi: 10.1073/pnas.87.3.938.
3
Recognition of protein coding regions in DNA sequences.DNA序列中蛋白质编码区域的识别。
Nucleic Acids Res. 1982 Sep 11;10(17):5303-18. doi: 10.1093/nar/10.17.5303.
4
Codon preference and its use in identifying protein coding regions in long DNA sequences.密码子偏好性及其在长DNA序列中识别蛋白质编码区的应用。
Nucleic Acids Res. 1982 Jan 11;10(1):141-56. doi: 10.1093/nar/10.1.141.
5
Neural networks and physical systems with emergent collective computational abilities.具有涌现集体计算能力的神经网络与物理系统。
Proc Natl Acad Sci U S A. 1982 Apr;79(8):2554-8. doi: 10.1073/pnas.79.8.2554.
6
A comprehensive set of sequence analysis programs for the VAX.一套适用于VAX的综合序列分析程序。
Nucleic Acids Res. 1984 Jan 11;12(1 Pt 1):387-95. doi: 10.1093/nar/12.1part1.387.
7
A method for measuring the non-random bias of a codon usage table.一种测量密码子使用表非随机偏差的方法。
Nucleic Acids Res. 1984 Dec 21;12(24):9567-75. doi: 10.1093/nar/12.24.9567.
8
Do exons code for structural or functional units in proteins?外显子是否编码蛋白质中的结构或功能单元?
Proc Natl Acad Sci U S A. 1988 May;85(9):2944-8. doi: 10.1073/pnas.85.9.2944.
9
The GenBank genetic sequence data bank.基因银行基因序列数据库。
Nucleic Acids Res. 1988 Mar 11;16(5):1861-3. doi: 10.1093/nar/16.5.1861.
10
A common language for physical mapping of the human genome.人类基因组物理图谱的通用语言。
Science. 1989 Sep 29;245(4925):1434-5. doi: 10.1126/science.2781285.

通过多传感器神经网络方法在人类DNA序列中定位蛋白质编码区域。

Locating protein-coding regions in human DNA sequences by a multiple sensor-neural network approach.

作者信息

Uberbacher E C, Mural R J

机构信息

Biology Division, Oak Ridge National Laboratory, TN.

出版信息

Proc Natl Acad Sci U S A. 1991 Dec 15;88(24):11261-5. doi: 10.1073/pnas.88.24.11261.

DOI:10.1073/pnas.88.24.11261
PMID:1763041
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC53114/
Abstract

Genes in higher eukaryotes may span tens or hundreds of kilobases with the protein-coding regions accounting for only a few percent of the total sequence. Identifying genes within large regions of uncharacterized DNA is a difficult undertaking and is currently the focus of many research efforts. We describe a reliable computational approach for locating protein-coding portions of genes in anonymous DNA sequence. Using a concept suggested by robotic environmental sensing, our method combines a set of sensor algorithms and a neural network to localize the coding regions. Several algorithms that report local characteristics of the DNA sequence, and therefore act as sensors, are also described. In its current configuration the "coding recognition module" identifies 90% of coding exons of length 100 bases or greater with less than one false positive coding exon indicated per five coding exons indicated. This is a significantly lower false positive rate than any method of which we are aware. This module demonstrates a method with general applicability to sequence-pattern recognition problems and is available for current research efforts.

摘要

高等真核生物中的基因可能跨越数十或数百千碱基对,而蛋白质编码区域仅占总序列的百分之几。在大片未表征的DNA区域中识别基因是一项艰巨的任务,也是目前许多研究工作的重点。我们描述了一种可靠的计算方法,用于在匿名DNA序列中定位基因的蛋白质编码部分。利用机器人环境感知提出的概念,我们的方法结合了一组传感器算法和一个神经网络来定位编码区域。还描述了几种报告DNA序列局部特征、因此起到传感器作用的算法。在其当前配置中,“编码识别模块”识别出90%长度为100个碱基或更长的编码外显子,每指出五个编码外显子中不到一个错误阳性编码外显子。这一错误阳性率明显低于我们所知的任何方法。该模块展示了一种对序列模式识别问题具有普遍适用性的方法,可供当前的研究工作使用。