• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

在大数据时代,我们需要从基因到功能保留事实、预测和假设的可重现轨迹。

We need to keep a reproducible trace of facts, predictions, and hypotheses from gene to function in the era of big data.

机构信息

Bioinformatics Program, Boston University, Boston, Massachusetts, United States of America.

Department of Biomedical Engineering, Boston University, Boston, Massachusetts, United States of America.

出版信息

PLoS Biol. 2020 Nov 30;18(11):e3000999. doi: 10.1371/journal.pbio.3000999. eCollection 2020 Nov.

DOI:10.1371/journal.pbio.3000999
PMID:33253151
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC7728211/
Abstract

How do we scale biological science to the demand of next generation biology and medicine to keep track of the facts, predictions, and hypotheses? These days, enormous amounts of DNA sequence and other omics data are generated. Since these data contain the blueprint for life, it is imperative that we interpret it accurately. The abundance of DNA is only one part of the challenge. Artificial Intelligence (AI) and network methods routinely build on large screens, single cell technologies, proteomics, and other modalities to infer or predict biological functions and phenotypes associated with proteins, pathways, and organisms. As a first step, how do we systematically trace the provenance of knowledge from experimental ground truth to gene function predictions and annotations? Here, we review the main challenges in tracking the evolution of biological knowledge and propose several specific solutions to provenance and computational tracing of evidence in functional linkage networks.

摘要

我们如何将生物学科学扩展到下一代生物学和医学的需求,以跟踪事实、预测和假设?如今,大量的 DNA 序列和其他组学数据正在生成。由于这些数据包含生命的蓝图,因此我们必须准确地解释它。DNA 的丰富度只是挑战的一部分。人工智能 (AI) 和网络方法通常基于大屏幕、单细胞技术、蛋白质组学和其他模式来推断或预测与蛋白质、途径和生物体相关的生物功能和表型。作为第一步,我们如何系统地跟踪从实验事实到基因功能预测和注释的知识来源?在这里,我们回顾了跟踪生物知识演变的主要挑战,并提出了几种用于功能链接网络中证据的来源和计算跟踪的具体解决方案。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2079/7728211/96151a18685e/pbio.3000999.g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2079/7728211/295610b95f86/pbio.3000999.g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2079/7728211/96151a18685e/pbio.3000999.g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2079/7728211/295610b95f86/pbio.3000999.g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2079/7728211/96151a18685e/pbio.3000999.g002.jpg

相似文献

1
We need to keep a reproducible trace of facts, predictions, and hypotheses from gene to function in the era of big data.在大数据时代,我们需要从基因到功能保留事实、预测和假设的可重现轨迹。
PLoS Biol. 2020 Nov 30;18(11):e3000999. doi: 10.1371/journal.pbio.3000999. eCollection 2020 Nov.
2
3
Emerging Trends in Big Data Analysis in Computational Biology and Bioinformatics in Health Informatics: A Case Study on Epilepsy and Seizures.大数据分析在健康信息学中的计算生物学和生物信息学中的新兴趋势:以癫痫和发作为例。
Methods Mol Biol. 2024;2719:99-119. doi: 10.1007/978-1-0716-3461-5_6.
4
Introduction: Cancer Gene Networks.引言:癌症基因网络
Methods Mol Biol. 2017;1513:1-9. doi: 10.1007/978-1-4939-6539-7_1.
5
Inference of dynamic networks using time-course data.使用时间序列数据推断动态网络。
Brief Bioinform. 2014 Mar;15(2):212-28. doi: 10.1093/bib/bbt028. Epub 2013 May 21.
6
Improving computational predictions of cis-regulatory binding sites.提高顺式调控结合位点的计算预测能力。
Pac Symp Biocomput. 2006:391-402.
7
Gene expression complex networks: synthesis, identification, and analysis.基因表达复杂网络:合成、识别与分析。
J Comput Biol. 2011 Oct;18(10):1353-67. doi: 10.1089/cmb.2010.0118. Epub 2011 May 6.
8
Where are we in genomics?我们在基因组学领域处于什么位置?
J Physiol Pharmacol. 2005 Jun;56 Suppl 3:37-70.
9
Using protein-protein interactions for refining gene networks estimated from microarray data by Bayesian networks.利用蛋白质-蛋白质相互作用来优化通过贝叶斯网络从微阵列数据估计的基因网络。
Pac Symp Biocomput. 2004:336-47. doi: 10.1142/9789812704856_0032.
10
Cancer omics: from regulatory networks to clinical outcomes.癌症组学:从调控网络到临床结局。
Cancer Lett. 2013 Nov 1;340(2):277-83. doi: 10.1016/j.canlet.2012.11.033. Epub 2012 Nov 29.

引用本文的文献

1
Beyond blast: enabling microbiologists to better extract literature, taxonomic distributions and gene neighbourhood information for protein families.超越 blast:使微生物学家能够更好地提取文献、分类分布和基因邻域信息的蛋白质家族。
Microb Genom. 2024 Feb;10(2). doi: 10.1099/mgen.0.001183.
2
Targeted characterization of fusion transcripts in tumor and normal tissues via FusionInspector.通过 FusionInspector 对肿瘤和正常组织中的融合转录本进行靶向特征分析。
Cell Rep Methods. 2023 May 8;3(5):100467. doi: 10.1016/j.crmeth.2023.100467. eCollection 2023 May 22.
3
Beyond Blast: Enabling Microbiologists to Better Extract Literature, Taxonomic Distributions and Gene Neighborhood Information for Protein Families.

本文引用的文献

1
Machine-learning-guided directed evolution for protein engineering.基于机器学习的定向进化蛋白质工程。
Nat Methods. 2019 Aug;16(8):687-694. doi: 10.1038/s41592-019-0496-6. Epub 2019 Jul 15.
2
One for all and all for One: Improving replication of genetic studies through network diffusion.众志成城:通过网络传播提高遗传研究的可重复性。
PLoS Genet. 2018 Apr 23;14(4):e1007306. doi: 10.1371/journal.pgen.1007306. eCollection 2018 Apr.
3
Perturb-Seq: Dissecting Molecular Circuits with Scalable Single-Cell RNA Profiling of Pooled Genetic Screens.
超越Blast:助力微生物学家更好地提取蛋白质家族的文献、分类分布和基因邻域信息。
bioRxiv. 2024 Jan 2:2023.05.03.539116. doi: 10.1101/2023.05.03.539116.
4
Seminars in immunology special issue: Nutrition, microbiota and immunity The unexplored microbes in health and disease.研讨会免疫学特刊:营养、微生物组和免疫——健康和疾病中未被探索的微生物。
Semin Immunol. 2023 Mar;66:101735. doi: 10.1016/j.smim.2023.101735. Epub 2023 Feb 27.
5
Data Incompleteness May form a Hard-to-Overcome Barrier to Decoding Life's Mechanism.数据不完整性可能构成解码生命机制难以逾越的障碍。
Biology (Basel). 2022 Aug 12;11(8):1208. doi: 10.3390/biology11081208.
6
Interpretable network propagation with application to expanding the repertoire of human proteins that interact with SARS-CoV-2.可解释的网络传播及其在扩展与 SARS-CoV-2 相互作用的人类蛋白质组中的应用。
Gigascience. 2021 Dec 29;10(12). doi: 10.1093/gigascience/giab082.
Perturb-Seq:通过对汇集基因筛选进行可扩展的单细胞RNA分析来剖析分子回路。
Cell. 2016 Dec 15;167(7):1853-1866.e17. doi: 10.1016/j.cell.2016.11.038.
4
GENOME ENGINEERING. The Genome Project-Write.基因组工程。基因组编写计划。
Science. 2016 Jul 8;353(6295):126-7. doi: 10.1126/science.aaf6850. Epub 2016 Jun 2.
5
Design and synthesis of a minimal bacterial genome.最小细菌基因组的设计与合成。
Science. 2016 Mar 25;351(6280):aad6253. doi: 10.1126/science.aad6253.
6
COMBREX-DB: an experiment centered database of protein function: knowledge, predictions and knowledge gaps.COMBREX-DB:一个以实验为中心的蛋白质功能数据库:知识、预测与知识空白。
Nucleic Acids Res. 2016 Jan 4;44(D1):D330-5. doi: 10.1093/nar/gkv1324. Epub 2015 Dec 3.
7
The COMBREX project: design, methodology, and initial results.COMBREX 项目:设计、方法学和初步结果。
PLoS Biol. 2013;11(8):e1001638. doi: 10.1371/journal.pbio.1001638. Epub 2013 Aug 27.
8
STRING v9.1: protein-protein interaction networks, with increased coverage and integration.STRING v9.1:蛋白质-蛋白质相互作用网络,具有更高的覆盖度和集成度。
Nucleic Acids Res. 2013 Jan;41(Database issue):D808-15. doi: 10.1093/nar/gks1094. Epub 2012 Nov 29.
9
An introduction to causal inference.因果推断导论。
Int J Biostat. 2010 Feb 26;6(2):Article 7. doi: 10.2202/1557-4679.1203.
10
Nature of the protein universe.蛋白质世界的本质。
Proc Natl Acad Sci U S A. 2009 Jul 7;106(27):11079-84. doi: 10.1073/pnas.0905029106. Epub 2009 Jun 18.