• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

大数据视角下的生物数据综述。

A Survey of Biological Data in a Big Data Perspective.

机构信息

Computational Biology and Bioinformatics Laboratory, Biotechnology Institute, Department of Life Sciences, University of Caxias do Sul, Caxias do Sul, Brazil.

Genome Science and Technology Program, Faculty of Science, The University of British Columbia, Vancouver, Canada.

出版信息

Big Data. 2022 Aug;10(4):279-297. doi: 10.1089/big.2020.0383. Epub 2022 Apr 7.

DOI:10.1089/big.2020.0383
PMID:35394342
Abstract

The amount of available data is continuously growing. This phenomenon promotes a new concept, named big data. The highlight technologies related to big data are cloud computing (infrastructure) and Not Only SQL (NoSQL; data storage). In addition, for data analysis, machine learning algorithms such as decision trees, support vector machines, artificial neural networks, and clustering techniques present promising results. In a biological context, big data has many applications due to the large number of biological databases available. Some limitations of biological big data are related to the inherent features of these data, such as high degrees of complexity and heterogeneity, since biological systems provide information from an atomic level to interactions between organisms or their environment. Such characteristics make most bioinformatic-based applications difficult to build, configure, and maintain. Although the rise of big data is relatively recent, it has contributed to a better understanding of the underlying mechanisms of life. The main goal of this article is to provide a concise and reliable survey of the application of big data-related technologies in biology. As such, some fundamental concepts of information technology, including storage resources, analysis, and data sharing, are described along with their relation to biological data.

摘要

可用数据量不断增加。这一现象催生了一个新概念,称为大数据。与大数据相关的重点技术包括云计算(基础设施)和非关系型数据库(NoSQL;数据存储)。此外,对于数据分析,决策树、支持向量机、人工神经网络和聚类技术等机器学习算法提供了有前景的结果。在生物学背景下,由于有大量的生物学数据库,大数据有许多应用。生物大数据的一些限制与这些数据固有的特征有关,例如高度的复杂性和异质性,因为生物系统提供的信息从原子水平到生物体之间或它们与环境的相互作用。这些特征使得大多数基于生物信息学的应用程序难以构建、配置和维护。尽管大数据的兴起相对较晚,但它有助于更好地理解生命的潜在机制。本文的主要目的是提供一个简洁可靠的大数据相关技术在生物学中应用的综述。因此,描述了信息技术的一些基本概念,包括存储资源、分析和数据共享,并说明了它们与生物数据的关系。

相似文献

1
A Survey of Biological Data in a Big Data Perspective.大数据视角下的生物数据综述。
Big Data. 2022 Aug;10(4):279-297. doi: 10.1089/big.2020.0383. Epub 2022 Apr 7.
2
Big Data Precision Marketing Approach under IoT Cloud Platform Information Mining.物联网云平台信息挖掘下的大数据精准营销方法。
Comput Intell Neurosci. 2022 Jan 12;2022:4828108. doi: 10.1155/2022/4828108. eCollection 2022.
3
Big data handling mechanisms in the healthcare applications: A comprehensive and systematic literature review.医疗应用中的大数据处理机制:全面而系统的文献综述。
J Biomed Inform. 2018 Jun;82:47-62. doi: 10.1016/j.jbi.2018.03.014. Epub 2018 Apr 12.
4
Advances in Machine Learning Processing of Big Data from Disease Diagnosis Sensors.大数据在疾病诊断传感器中机器学习处理的进展。
ACS Sens. 2024 Mar 22;9(3):1134-1148. doi: 10.1021/acssensors.3c02670. Epub 2024 Feb 16.
5
The Construction of Big Data Computational Intelligence System for E-Government in Cloud Computing Environment and Its Development Impact.云计算环境下电子政务大数据计算智能系统构建及其发展影响。
Comput Intell Neurosci. 2022 Mar 24;2022:7295060. doi: 10.1155/2022/7295060. eCollection 2022.
6
Modified Immune Evolutionary Algorithm for Medical Data Clustering and Feature Extraction under Cloud Computing Environment.云计算环境下医学数据聚类和特征提取的改进免疫进化算法。
J Healthc Eng. 2020 Jan 20;2020:1051394. doi: 10.1155/2020/1051394. eCollection 2020.
7
Artificial Intelligence and Big Data Science in Neurocritical Care.人工智能与神经危重症大数据科学
Crit Care Clin. 2023 Jan;39(1):235-242. doi: 10.1016/j.ccc.2022.07.008. Epub 2022 Oct 9.
8
Application and Exploration of Big Data Mining in Clinical Medicine.大数据挖掘在临床医学中的应用与探索
Chin Med J (Engl). 2016 Mar 20;129(6):731-8. doi: 10.4103/0366-6999.178019.
9
A review of Cloud computing technologies for comprehensive microRNA analyses.云计算技术在全面 miRNA 分析中的应用综述。
Comput Biol Chem. 2020 Oct;88:107365. doi: 10.1016/j.compbiolchem.2020.107365. Epub 2020 Aug 29.
10
Machine Learning for Knowledge Extraction from PHR Big Data.用于从个人健康记录大数据中提取知识的机器学习
Stud Health Technol Inform. 2014;202:36-9.

引用本文的文献

1
Deciphering the proteome of K-12: Integrating transcriptomics and machine learning to annotate hypothetical proteins.解析K-12的蛋白质组:整合转录组学与机器学习以注释假设蛋白质。
Comput Struct Biotechnol J. 2025 Jul 24;27:3565-3578. doi: 10.1016/j.csbj.2025.07.036. eCollection 2025.
2
Machine learning identifies potential diagnostic biomarkers associated with ferroptosis in obstructive sleep apnea.机器学习识别出与阻塞性睡眠呼吸暂停中铁死亡相关的潜在诊断生物标志物。
Exp Ther Med. 2025 Mar 13;29(5):95. doi: 10.3892/etm.2025.12845. eCollection 2025 May.
3
Development and evaluation of a training curriculum to engage researchers on accessing and analyzing the All of Us data.
开发并评估一项培训课程,以使研究人员能够获取和分析“我们所有人”计划的数据。
J Am Med Inform Assoc. 2024 Dec 1;31(12):2857-2868. doi: 10.1093/jamia/ocae240.
4
Identification of biomarkers in multiple myeloma: A comprehensive study combining microarray analysis and Mendelian randomization.多发性骨髓瘤的生物标志物鉴定:结合微阵列分析和孟德尔随机化的综合研究。
J Cell Mol Med. 2024 Jun;28(12):e18504. doi: 10.1111/jcmm.18504.
5
CREDO: a friendly Customizable, REproducible, DOcker file generator for bioinformatics applications.CREDO:一个用于生物信息学应用的友好的可定制、可重复、Docker 文件生成器。
BMC Bioinformatics. 2024 Mar 12;25(1):110. doi: 10.1186/s12859-024-05695-9.
6
Conceptual breakthroughs of the long noncoding RNA functional system and its endogenous regulatory role in the cancerous regime.长链非编码RNA功能系统的概念性突破及其在癌症状态下的内源性调控作用
Explor Target Antitumor Ther. 2024;5(1):170-186. doi: 10.37349/etat.2024.00211. Epub 2024 Feb 27.
7
Y chromosome sequence and epigenomic reconstruction across human populations.人类群体中的 Y 染色体序列和表观基因组重建。
Commun Biol. 2023 Jun 9;6(1):623. doi: 10.1038/s42003-023-05004-9.