• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

生物信息:使其易于获取并整合(并尝试理解其意义)。

Biological information: making it accessible and integrated (and trying to make sense of it).

作者信息

Hubbard Tim

机构信息

Sanger Institute, Cambridgeshire, UK.

出版信息

Bioinformatics. 2002;18 Suppl 2:S140. doi: 10.1093/bioinformatics/18.suppl_2.s140.

DOI:10.1093/bioinformatics/18.suppl_2.s140
PMID:12385995
Abstract

The availability of the genome sequences of human and mouse, human sequence variation data and other large genetic data sets will lead to a revolution in understanding of the human machine and the treatment of its diseases. The success of the international genome sequencing consortiums shows what can be achieved by well coordinated large scale public domain projects and the benefits of data access to all. It is already clear that the availability of this sequence is having a huge impact on research worldwide. Complete genome sequences provide a framework to pull all biological data together such that each piece has the potential to say something about biology as a whole. Biology is too complex for any organisation to have a monopoly of ideas or data, so the collection, analysis and access to this data can be contributed to by research institutes around the world. However, although it is possible for all this data to be accessible to all through the internet, the more organisations provide data or analysis separately, the harder it becomes for anyone to collect and integrate the results. To address these problems of intergration of data, open standards for biological data exchange, such as the 'Distributed Annotation System' (DAS) are being developed and bioinformatics (Dowell et al., 2001) as a whole is now being strongly driven by the open source software (OSS) model for collaborative software development (Hubbard and Birney, 1999). The leading provider of human genome annotation, the Ensembl project (http://www.ensembl.org), is entirely an OSS project and has been widely adopted by academic and commerical organisations alike (Hubbard et al., 2002). Accurate automatic annotation of features such as genes in vertebrate genomes currently relies on supporting evidence in the form of homologies to mRNAs, ESTs or protein. However, it appears that sufficient high quality experimentally curated annotation now exists to be used as a substrate for machine learning algorithms to create effective models of biological signal sequences (Down and Hubbard, 2002). Is there hope for ab initio prediction methods after all?

摘要

人类和小鼠基因组序列、人类序列变异数据以及其他大型遗传数据集的可得性,将引发一场对人类机体及其疾病治疗理解的革命。国际基因组测序联盟的成功表明,精心协调的大规模公共领域项目能够取得怎样的成果,以及数据对所有人开放的益处。很明显,这些序列的可得性正在对全球研究产生巨大影响。完整的基因组序列提供了一个框架,可将所有生物数据整合在一起,使每一部分都有可能揭示关于整个生物学的信息。生物学过于复杂,任何一个组织都无法垄断思想或数据,因此世界各地的研究机构都可以为这些数据的收集、分析和获取做出贡献。然而,尽管所有这些数据有可能通过互联网供所有人访问,但越多的组织单独提供数据或分析,就越难有人收集和整合这些结果。为了解决数据整合的这些问题,正在开发生物数据交换的开放标准,如“分布式注释系统”(DAS),并且作为一个整体的生物信息学(Dowell等人,2001年)现在正受到开源软件(OSS)协作软件开发模式的有力推动(Hubbard和Birney,1999年)。人类基因组注释的领先提供者Ensembl项目(http://www.ensembl.org)完全是一个OSS项目,已被学术和商业组织广泛采用(Hubbard等人,2002年)。目前,脊椎动物基因组中基因等特征的准确自动注释依赖于与mRNA、EST或蛋白质同源性形式的支持证据。然而,似乎现在已经存在足够高质量的经实验整理的注释,可作为机器学习算法创建生物信号序列有效模型的基础(Down和Hubbard,2002年)。那么,从头预测方法到底还有希望吗?

相似文献

1
Biological information: making it accessible and integrated (and trying to make sense of it).生物信息:使其易于获取并整合(并尝试理解其意义)。
Bioinformatics. 2002;18 Suppl 2:S140. doi: 10.1093/bioinformatics/18.suppl_2.s140.
2
The Ensembl genome database project.Ensembl基因组数据库项目。
Nucleic Acids Res. 2002 Jan 1;30(1):38-41. doi: 10.1093/nar/30.1.38.
3
Integrating sequence and structural biology with DAS.将序列和结构生物学与DAS相结合。
BMC Bioinformatics. 2007 Sep 12;8:333. doi: 10.1186/1471-2105-8-333.
4
Ensembl 2004.Ensembl 2004。
Nucleic Acids Res. 2004 Jan 1;32(Database issue):D468-70. doi: 10.1093/nar/gkh038.
5
Enabling high-throughput data management for systems biology: the Bioinformatics Resource Manager.为系统生物学实现高通量数据管理:生物信息学资源管理器
Bioinformatics. 2007 Apr 1;23(7):906-9. doi: 10.1093/bioinformatics/btm031. Epub 2007 Feb 25.
6
Statistical Viewer: a tool to upload and integrate linkage and association data as plots displayed within the Ensembl genome browser.统计查看器:一种用于上传和整合连锁与关联数据并将其作为图谱显示在Ensembl基因组浏览器中的工具。
BMC Bioinformatics. 2005 Apr 12;6:95. doi: 10.1186/1471-2105-6-95.
7
A Chado case study: an ontology-based modular schema for representing genome-associated biological information.一个Chado案例研究:用于表示基因组相关生物信息的基于本体的模块化模式。
Bioinformatics. 2007 Jul 1;23(13):i337-46. doi: 10.1093/bioinformatics/btm189.
8
Using the Ensembl genome server to browse genomic sequence data.使用Ensembl基因组服务器浏览基因组序列数据。
Curr Protoc Bioinformatics. 2007 Jan;Chapter 1:Unit 1.15. doi: 10.1002/0471250953.bi0115s16.
9
SEED servers: high-performance access to the SEED genomes, annotations, and metabolic models.SEED 服务器:高性能访问 SEED 基因组、注释和代谢模型。
PLoS One. 2012;7(10):e48053. doi: 10.1371/journal.pone.0048053. Epub 2012 Oct 24.
10
Interoperability with Moby 1.0--it's better than sharing your toothbrush!与Moby 1.0的互操作性——这可比共用牙刷好多了!
Brief Bioinform. 2008 May;9(3):220-31. doi: 10.1093/bib/bbn003. Epub 2008 Jan 31.

引用本文的文献

1
SKINOMICS: Transcriptional Profiling in Dermatology and Skin Biology.皮肤组学:皮肤病学和皮肤生物学中的转录组学分析。
Curr Genomics. 2012 Aug;13(5):363-8. doi: 10.2174/138920212801619241.
2
Statistical Viewer: a tool to upload and integrate linkage and association data as plots displayed within the Ensembl genome browser.统计查看器:一种用于上传和整合连锁与关联数据并将其作为图谱显示在Ensembl基因组浏览器中的工具。
BMC Bioinformatics. 2005 Apr 12;6:95. doi: 10.1186/1471-2105-6-95.
3
Systematic recovery and analysis of full-ORF human cDNA clones.
全长开放阅读框人源cDNA克隆的系统回收与分析
Genome Res. 2004 Oct;14(10B):2083-92. doi: 10.1101/gr.2473704.
4
Web-based physician order entry: an open source solution with broad physician involvement.基于网络的医生医嘱录入:一项有广泛医生参与的开源解决方案。
AMIA Annu Symp Proc. 2003;2003:724-7.