比较毒理学基因组学数据库中用于科学文献人工注释的注释范例和应用工具。

The curation paradigm and application tool used for manual curation of the scientific literature at the Comparative Toxicogenomics Database.

机构信息

Department of Bioinformatics, The Mount Desert Island Biological Laboratory, Salisbury Cove, ME 04672, USA.

出版信息

Database (Oxford). 2011 Sep 20;2011:bar034. doi: 10.1093/database/bar034. Print 2011.

DOI:10.1093/database/bar034

PMID:21933848

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC3176677/

Abstract

The Comparative Toxicogenomics Database (CTD) is a public resource that promotes understanding about the effects of environmental chemicals on human health. CTD biocurators read the scientific literature and convert free-text information into a structured format using official nomenclature, integrating third party controlled vocabularies for chemicals, genes, diseases and organisms, and a novel controlled vocabulary for molecular interactions. Manual curation produces a robust, richly annotated dataset of highly accurate and detailed information. Currently, CTD describes over 349,000 molecular interactions between 6800 chemicals, 20,900 genes (for 330 organisms) and 4300 diseases that have been manually curated from over 25,400 peer-reviewed articles. This manually curated data are further integrated with other third party data (e.g. Gene Ontology, KEGG and Reactome annotations) to generate a wealth of toxicogenomic relationships. Here, we describe our approach to manual curation that uses a powerful and efficient paradigm involving mnemonic codes. This strategy allows biocurators to quickly capture detailed information from articles by generating simple statements using codes to represent the relationships between data types. The paradigm is versatile, expandable, and able to accommodate new data challenges that arise. We have incorporated this strategy into a web-based curation tool to further increase efficiency and productivity, implement quality control in real-time and accommodate biocurators working remotely. Database URL: http://ctd.mdibl.org.

摘要

比较毒理学基因组学数据库（CTD）是一个公共资源，旨在增进对环境化学物质对人类健康影响的了解。CTD 生物注释员阅读科学文献，并使用官方命名法将自由文本信息转换为结构化格式，整合化学物质、基因、疾病和生物体的第三方控制词汇表，以及用于分子相互作用的新型控制词汇表。人工注释生成了一个强大、丰富的数据集，其中包含高度准确和详细的信息。目前，CTD 从超过 25400 篇同行评审文章中手动注释了超过 349000 个化学物质、20900 个基因（来自 330 个生物体）和 4300 种疾病之间的分子相互作用。这些手动注释的数据与其他第三方数据（如基因本体论、KEGG 和 Reactome 注释）进一步整合，生成了丰富的毒理基因组学关系。在这里，我们描述了我们使用强大而高效的助记符代码的人工注释方法。这种策略允许生物注释员通过使用代码来表示不同数据类型之间的关系，快速从文章中捕获详细信息，生成简单的语句。该范式具有多功能性、可扩展性，并且能够适应新出现的数据挑战。我们已经将这种策略整合到一个基于网络的注释工具中，以进一步提高效率和生产力，实时实施质量控制，并容纳远程工作的生物注释员。数据库 URL：http://ctd.mdibl.org。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d5b5/3176677/6a6afc32be28/bar034f1.jpg

相似文献

The curation paradigm and application tool used for manual curation of the scientific literature at the Comparative Toxicogenomics Database.比较毒理学基因组学数据库中用于科学文献人工注释的注释范例和应用工具。

Database (Oxford). 2011 Sep 20;2011:bar034. doi: 10.1093/database/bar034. Print 2011.

MEDIC: a practical disease vocabulary used at the Comparative Toxicogenomics Database.医学：比较毒理学基因组学数据库中使用的实用疾病词汇。

Database (Oxford). 2012 Mar 20;2012:bar065. doi: 10.1093/database/bar065. Print 2012.

Comparative Toxicogenomics Database: a knowledgebase and discovery tool for chemical-gene-disease networks.比较毒理基因组学数据库：一个关于化学物质-基因-疾病网络的知识库和发现工具。

Nucleic Acids Res. 2009 Jan;37(Database issue):D786-92. doi: 10.1093/nar/gkn580. Epub 2008 Sep 9.

Text mining effectively scores and ranks the literature for improving chemical-gene-disease curation at the comparative toxicogenomics database.文本挖掘有效地对文献进行评分和排序，以提高比较毒理学基因组学数据库中的化学物质-基因-疾病的编纂工作。

PLoS One. 2013 Apr 17;8(4):e58201. doi: 10.1371/journal.pone.0058201. Print 2013.

A CTD-Pfizer collaboration: manual curation of 88,000 scientific articles text mined for drug-disease and drug-phenotype interactions.CTD-Pfizer 合作项目：对 88000 篇经文本挖掘的科学文章进行人工注释，以发现药物-疾病和药物-表型相互作用。

Database (Oxford). 2013 Nov 28;2013:bat080. doi: 10.1093/database/bat080. Print 2013.

Targeted journal curation as a method to improve data currency at the Comparative Toxicogenomics Database.靶向期刊策展作为一种提高比较毒理学基因组学数据库数据时效性的方法。

Database (Oxford). 2012 Dec 6;2012:bas051. doi: 10.1093/database/bas051. Print 2012.

Text mining and manual curation of chemical-gene-disease networks for the comparative toxicogenomics database (CTD).文本挖掘和化学-基因-疾病网络的人工整理用于比较毒理学基因组数据库（CTD）。

BMC Bioinformatics. 2009 Oct 8;10:326. doi: 10.1186/1471-2105-10-326.

The Comparative Toxicogenomics Database: update 2011.比较毒理基因组学数据库：2011年更新版

Nucleic Acids Res. 2011 Jan;39(Database issue):D1067-72. doi: 10.1093/nar/gkq813. Epub 2010 Sep 22.

The comparative toxicogenomics database: a cross-species resource for building chemical-gene interaction networks.比较毒理基因组学数据库：构建化学物质-基因相互作用网络的跨物种资源。

Toxicol Sci. 2006 Aug;92(2):587-95. doi: 10.1093/toxsci/kfl008. Epub 2006 May 4.

Prioritizing PubMed articles for the Comparative Toxicogenomic Database utilizing semantic information.利用语义信息为比较毒理学基因组数据库对 PubMed 文章进行优先级排序。

Database (Oxford). 2012 Nov 17;2012:bas042. doi: 10.1093/database/bas042. Print 2012.

引用本文的文献

Integrating AI-powered text mining from PubTator into the manual curation workflow at the Comparative Toxicogenomics Database.将来自PubTator的人工智能文本挖掘技术整合到比较毒理基因组学数据库的人工编目工作流程中。

Database (Oxford). 2025 Feb 21;2025. doi: 10.1093/database/baaf013.

ZFIN updates to support zebrafish environmental exposure data.ZFIN更新以支持斑马鱼环境暴露数据。

Genetics. 2025 Mar 17;229(3). doi: 10.1093/genetics/iyaf021.

Comparative Toxicogenomics Database's 20th anniversary: update 2025.比较毒理基因组学数据库成立20周年：2025年更新

Nucleic Acids Res. 2025 Jan 6;53(D1):D1328-D1334. doi: 10.1093/nar/gkae883.

Transforming environmental health datasets from the comparative toxicogenomics database into chord diagrams to visualize molecular mechanisms.将来自比较毒理基因组学数据库的环境卫生数据集转换为弦图，以可视化分子机制。

Front Toxicol. 2024 Jul 22;6:1437884. doi: 10.3389/ftox.2024.1437884. eCollection 2024.

CTD tetramers: a new online tool that computationally links curated chemicals, genes, phenotypes, and diseases to inform molecular mechanisms for environmental health.CTD 四聚体：一个新的在线工具，可从计算上链接经策展的化学品、基因、表型和疾病，为环境健康的分子机制提供信息。

Toxicol Sci. 2023 Sep 28;195(2):155-168. doi: 10.1093/toxsci/kfad069.

Comparative Toxicogenomics Database (CTD): update 2023.比较毒理学基因组数据库（CTD）：2023 年更新。

Nucleic Acids Res. 2023 Jan 6;51(D1):D1257-D1262. doi: 10.1093/nar/gkac833.

Construction of Mode of Action for Cadmium-Induced Renal Tubular Dysfunction Based on a Toxicity Pathway-Oriented Approach.基于毒性途径导向方法构建镉诱导肾小管功能障碍的作用模式

Front Genet. 2021 Jul 23;12:696892. doi: 10.3389/fgene.2021.696892. eCollection 2021.

CTD Anatomy: analyzing chemical-induced phenotypes and exposures from an anatomical perspective, with implications for environmental health studies.CTD解剖学：从解剖学角度分析化学物质诱导的表型和暴露情况，对环境卫生研究具有启示意义。

Curr Res Toxicol. 2021;2:128-139. doi: 10.1016/j.crtox.2021.03.001. Epub 2021 Mar 5.

Public data sources to support systems toxicology applications.支持系统毒理学应用的公共数据源。

Curr Opin Toxicol. 2019 Aug;16:17-24. doi: 10.1016/j.cotox.2019.03.002. Epub 2019 Mar 11.

Comparative Toxicogenomics Database (CTD): update 2021.比较毒理学基因组学数据库（CTD）：2021 年更新。

Nucleic Acids Res. 2021 Jan 8;49(D1):D1138-D1143. doi: 10.1093/nar/gkaa891.

本文引用的文献

Preferential regulation of miRNA targets by environmental chemicals in the human genome.环境化学物质在人类基因组中对 miRNA 靶标的优先调控。

BMC Genomics. 2011 May 18;12:244. doi: 10.1186/1471-2164-12-244.

Methods and strategies for gene structure curation in WormBase.在 WormBase 中进行基因结构整理的方法和策略。

Database (Oxford). 2011 May 3;2011:baq039. doi: 10.1093/database/baq039. Print 2011.

A new face and new challenges for Online Mendelian Inheritance in Man (OMIM®).在线孟德尔遗传数据库（OMIM®）迎来新面貌与新挑战。

Hum Mutat. 2011 May;32(5):564-7. doi: 10.1002/humu.21466. Epub 2011 Apr 5.

The Rat Genome Database curation tool suite: a set of optimized software tools enabling efficient acquisition, organization, and presentation of biological data.大鼠基因组数据库注释工具套件：一组优化的软件工具，可实现生物数据的高效获取、组织和呈现。

Database (Oxford). 2011 Feb 14;2011:bar002. doi: 10.1093/database/bar002. Print 2011.

Activity profiles of 309 ToxCast™ chemicals evaluated across 292 biochemical targets.309 种 ToxCastTM 化学物质在 292 个生化靶标上的活性谱图。

Toxicology. 2011 Mar 28;282(1-2):1-15. doi: 10.1016/j.tox.2010.12.010. Epub 2011 Jan 18.

Database resources of the National Center for Biotechnology Information.美国国立生物技术信息中心的数据库资源。

Nucleic Acids Res. 2011 Jan;39(Database issue):D38-51. doi: 10.1093/nar/gkq1172. Epub 2010 Nov 21.

Reactome: a database of reactions, pathways and biological processes.Reactome：一个关于反应、通路和生物过程的数据库。

Nucleic Acids Res. 2011 Jan;39(Database issue):D691-7. doi: 10.1093/nar/gkq1018. Epub 2010 Nov 9.

The Comparative Toxicogenomics Database: update 2011.比较毒理基因组学数据库：2011年更新版

Nucleic Acids Res. 2011 Jan;39(Database issue):D1067-72. doi: 10.1093/nar/gkq813. Epub 2010 Sep 22.

GeneComps and ChemComps: a new CTD metric to identify genes and chemicals with shared toxicogenomic profiles.基因组分与化学组分：一种用于识别具有共享毒理基因组学特征的基因和化学物质的新CTD指标。

Bioinformation. 2009 Oct 15;4(4):173-4. doi: 10.6026/97320630004173.

Integrating text mining into the MGI biocuration workflow.将文本挖掘整合到MGI生物编目工作流程中。

Database (Oxford). 2009;2009:bap019. doi: 10.1093/database/bap019. Epub 2009 Nov 21.

文献检索

告别复杂PubMed语法，用中文像聊天一样搜索，搜遍4000万医学文献。AI智能推荐，让科研检索更轻松。

立即免费搜索

文件翻译

保留排版，准确专业，支持PDF/Word/PPT等文件格式，支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述，25分钟生成高质量综述，智能提取关键信息，辅助科研写作。

立即免费体验

比较毒理学基因组学数据库中用于科学文献人工注释的注释范例和应用工具。

The curation paradigm and application tool used for manual curation of the scientific literature at the Comparative Toxicogenomics Database.

机构信息

出版信息

相似文献

引用本文的文献

本文引用的文献

文献检索

文件翻译

深度研究

Suppr 超能文献

相似文献

引用本文的文献

本文引用的文献