• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

GenBank与PubMed:它们的联系有多紧密?

GenBank and PubMed: How connected are they?

作者信息

Miller Holly, Norton Catherine N, Sarkar Indra Neil

机构信息

MBLWHOI Library, Marine Biological Laboratory, 7 MBL Street, Woods Hole, MA 02543, USA.

出版信息

BMC Res Notes. 2009 Jun 9;2:101. doi: 10.1186/1756-0500-2-101.

DOI:10.1186/1756-0500-2-101
PMID:19508734
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC2704225/
Abstract

BACKGROUND

GenBank is a public repository of all publicly available molecular sequence data from a range of sources. In addition to relevant metadata (e.g., sequence description, source organism and taxonomy), publication information is recorded in the GenBank data file. The identification of literature associated with a given molecular sequence may be an essential first step in developing research hypotheses. Although many of the publications associated with GenBank records may not be linked into or part of complementary literature databases (e.g., PubMed), GenBank records associated with literature indexed in Medline are identifiable as they contain PubMed identifiers (PMIDs).

RESULTS

Here we show that an analysis of 87,116,501 GenBank sequence files reveals that 42% are associated with a publication or patent. Of these, 71% are associated with PMIDs, and can therefore be linked to a citation record in the PubMed database. The remaining (29%) of publication-associated GenBank entries either do not have PMIDs or cite a publication that is not currently indexed by PubMed. We also identify the journal titles that are linked through citations in the GenBank files to the largest number of sequences.

CONCLUSION

Our analysis suggests that GenBank contains molecular sequences from a range of disciplines beyond biomedicine, the initial scope of PubMed. The findings thus suggest opportunities to develop mechanisms for integrating biological knowledge beyond the biomedical field.

摘要

背景

GenBank是一个来自一系列来源的所有公开可用分子序列数据的公共储存库。除了相关的元数据(例如,序列描述、来源生物体和分类学)之外,出版信息也记录在GenBank数据文件中。识别与给定分子序列相关的文献可能是提出研究假设的关键第一步。尽管许多与GenBank记录相关的出版物可能未链接到补充文献数据库(例如,PubMed)中或不是其一部分,但与Medline索引文献相关的GenBank记录是可识别的,因为它们包含PubMed标识符(PMID)。

结果

我们在此表明,对87,116,501个GenBank序列文件的分析显示,42%与出版物或专利相关。其中,71%与PMID相关,因此可以链接到PubMed数据库中的引用记录。其余(29%)与出版物相关的GenBank条目要么没有PMID,要么引用了当前未被PubMed索引的出版物。我们还确定了通过GenBank文件中的引用与最多序列相关联的期刊标题。

结论

我们的分析表明,GenBank包含来自生物医学之外一系列学科的分子序列,而生物医学是PubMed的初始范围。因此,这些发现表明有机会开发整合生物医学领域之外生物知识的机制。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/12df/2704225/352dcaf63750/1756-0500-2-101-2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/12df/2704225/ad55f07fe88a/1756-0500-2-101-1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/12df/2704225/352dcaf63750/1756-0500-2-101-2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/12df/2704225/ad55f07fe88a/1756-0500-2-101-1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/12df/2704225/352dcaf63750/1756-0500-2-101-2.jpg

相似文献

1
GenBank and PubMed: How connected are they?GenBank与PubMed:它们的联系有多紧密?
BMC Res Notes. 2009 Jun 9;2:101. doi: 10.1186/1756-0500-2-101.
2
GenBank.基因银行
Nucleic Acids Res. 2016 Jan 4;44(D1):D67-72. doi: 10.1093/nar/gkv1276. Epub 2015 Nov 20.
3
GenBank.基因银行
Nucleic Acids Res. 2017 Jan 4;45(D1):D37-D42. doi: 10.1093/nar/gkw1070. Epub 2016 Nov 28.
4
GenBank.基因银行
Nucleic Acids Res. 2008 Jan;36(Database issue):D25-30. doi: 10.1093/nar/gkm929. Epub 2007 Dec 11.
5
GenBank.GenBank。
Nucleic Acids Res. 2018 Jan 4;46(D1):D41-D47. doi: 10.1093/nar/gkx1094.
6
GenBank.基因银行
Nucleic Acids Res. 2009 Jan;37(Database issue):D26-31. doi: 10.1093/nar/gkn723. Epub 2008 Oct 21.
7
GenBank.基因银行
Nucleic Acids Res. 2007 Jan;35(Database issue):D21-5. doi: 10.1093/nar/gkl986.
8
GenBank.基因银行
Nucleic Acids Res. 2003 Jan 1;31(1):23-7. doi: 10.1093/nar/gkg057.
9
GenBank.基因银行
Nucleic Acids Res. 2005 Jan 1;33(Database issue):D34-8. doi: 10.1093/nar/gki063.
10
A high-precision rule-based extraction system for expanding geospatial metadata in GenBank records.一种用于扩展GenBank记录中地理空间元数据的基于规则的高精度提取系统。
J Am Med Inform Assoc. 2016 Sep;23(5):934-41. doi: 10.1093/jamia/ocv172. Epub 2016 Jan 17.

引用本文的文献

1
Liberating links between datasets using lightweight data publishing: an example using plant names and the taxonomic literature.使用轻量级数据发布解放数据集之间的链接:以植物名称和分类学文献为例
Biodivers Data J. 2018 Jul 23(6):e27539. doi: 10.3897/BDJ.6.e27539. eCollection 2018.
2
A high-precision rule-based extraction system for expanding geospatial metadata in GenBank records.一种用于扩展GenBank记录中地理空间元数据的基于规则的高精度提取系统。
J Am Med Inform Assoc. 2016 Sep;23(5):934-41. doi: 10.1093/jamia/ocv172. Epub 2016 Jan 17.
3
Knowledge-driven geospatial location resolution for phylogeographic models of virus migration.

本文引用的文献

1
GenBank.基因银行
Nucleic Acids Res. 2009 Jan;37(Database issue):D26-31. doi: 10.1093/nar/gkn723. Epub 2008 Oct 21.
2
Parasite misidentifications in GenBank: how to minimize their number?GenBank 中的寄生虫错误鉴定:如何将其数量降至最低?
Trends Parasitol. 2008 Jun;24(6):247-8. doi: 10.1016/j.pt.2008.03.004. Epub 2008 Apr 25.
3
Preserving accuracy in GenBank.保持GenBank中的准确性。
用于病毒迁移系统发育地理学模型的知识驱动型地理空间定位解析
Bioinformatics. 2015 Jun 15;31(12):i348-56. doi: 10.1093/bioinformatics/btv259.
4
BioNames: linking taxonomy, texts, and trees.生物命名:连接分类法、文本和树。
PeerJ. 2013 Oct 29;1:e190. doi: 10.7717/peerj.190. eCollection 2013.
5
SeedSeq: off-target transcriptome database.SeedSeq:脱靶转录组数据库。
Biomed Res Int. 2013;2013:905429. doi: 10.1155/2013/905429. Epub 2013 Aug 29.
6
Metadata management and semantics in microarray repositories.微阵列数据库中的元数据管理与语义学
Balkan J Med Genet. 2011 Dec;14(2):49-64. doi: 10.2478/v10034-011-0047-7.
7
Detection of horizontal gene transfers from phylogenetic comparisons.通过系统发育比较检测水平基因转移。
Int J Evol Biol. 2012;2012:813015. doi: 10.1155/2012/813015. Epub 2012 May 23.
8
A vector space model approach to identify genetically related diseases.一种基于向量空间模型的方法,用于识别具有遗传关系的疾病。
J Am Med Inform Assoc. 2012 Mar-Apr;19(2):249-54. doi: 10.1136/amiajnl-2011-000480. Epub 2012 Jan 6.
9
pubmed2ensembl: a resource for mining the biological literature on genes.pubmed2ensembl:一个挖掘基因相关生物文献的资源
PLoS One. 2011;6(9):e24716. doi: 10.1371/journal.pone.0024716. Epub 2011 Sep 29.
10
Marine natural products: a new wave of drugs?海洋天然产物:新药浪潮?
Future Med Chem. 2011 Sep;3(12):1475-89. doi: 10.4155/fmc.11.118.
Science. 2008 Mar 21;319(5870):1616. doi: 10.1126/science.319.5870.1616a.
4
DNA data. Proposal to 'Wikify' GenBank meets stiff resistance.DNA数据。将GenBank“维基化”的提议遭遇强烈抵制。
Science. 2008 Mar 21;319(5870):1598-9. doi: 10.1126/science.319.5870.1598.
5
Mining metadata from unidentified ITS sequences in GenBank: a case study in Inocybe (Basidiomycota).从GenBank中未鉴定的ITS序列挖掘元数据:丝盖伞属(担子菌门)的一个案例研究
BMC Evol Biol. 2008 Feb 18;8:50. doi: 10.1186/1471-2148-8-50.
6
The Sorcerer II Global Ocean Sampling expedition: northwest Atlantic through eastern tropical Pacific.“魔法师二号”全球海洋采样探险:从西北大西洋到东热带太平洋
PLoS Biol. 2007 Mar;5(3):e77. doi: 10.1371/journal.pbio.0050077.
7
Structural and functional diversity of the microbial kinome.微生物激酶组的结构与功能多样性
PLoS Biol. 2007 Mar;5(3):e17. doi: 10.1371/journal.pbio.0050017.
8
The Sorcerer II Global Ocean Sampling expedition: expanding the universe of protein families.“魔法师二号”全球海洋采样考察:拓展蛋白质家族的范畴
PLoS Biol. 2007 Mar;5(3):e16. doi: 10.1371/journal.pbio.0050016.
9
Genome re-annotation: a wiki solution?基因组重新注释:一种维基解决方案?
Genome Biol. 2007;8(1):102. doi: 10.1186/gb-2007-8-1-102.
10
Data sharing: how much doesn't get submitted to GenBank?数据共享:有多少未提交至GenBank?
PLoS Biol. 2006 Jul;4(7):e228. doi: 10.1371/journal.pbio.0040228. Epub 2006 Jul 11.