• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

相似文献

1
Big data bioinformatics.大数据生物信息学。
J Cell Physiol. 2014 Dec;229(12):1896-900. doi: 10.1002/jcp.24662.
2
Big biological data: challenges and opportunities.大数据:挑战与机遇。
Genomics Proteomics Bioinformatics. 2014 Oct;12(5):187-9. doi: 10.1016/j.gpb.2014.10.001. Epub 2014 Oct 14.
3
Feature selection methods for big data bioinformatics: A survey from the search perspective.大数据生物信息学中的特征选择方法:基于搜索视角的综述
Methods. 2016 Dec 1;111:21-31. doi: 10.1016/j.ymeth.2016.08.014. Epub 2016 Aug 31.
4
SVS: data and knowledge integration in computational biology.SVS:计算生物学中的数据与知识整合
Annu Int Conf IEEE Eng Med Biol Soc. 2011;2011:6474-8. doi: 10.1109/IEMBS.2011.6091598.
5
Introducing Machine Learning Concepts with WEKA.使用WEKA介绍机器学习概念。
Methods Mol Biol. 2016;1418:353-78. doi: 10.1007/978-1-4939-3578-9_17.
6
Advances in Machine Learning Processing of Big Data from Disease Diagnosis Sensors.大数据在疾病诊断传感器中机器学习处理的进展。
ACS Sens. 2024 Mar 22;9(3):1134-1148. doi: 10.1021/acssensors.3c02670. Epub 2024 Feb 16.
7
Chapter 16: text mining for translational bioinformatics.第十六章:转化生物信息学中的文本挖掘。
PLoS Comput Biol. 2013 Apr;9(4):e1003044. doi: 10.1371/journal.pcbi.1003044. Epub 2013 Apr 25.
8
Chemical screening: thinking big with big data.化学筛选:借助大数据进行宏观思考。
Comb Chem High Throughput Screen. 2014;17(6):483-4. doi: 10.2174/138620731706140626145336.
9
Machine learning for Big Data analytics in plants.植物大数据分析的机器学习。
Trends Plant Sci. 2014 Dec;19(12):798-808. doi: 10.1016/j.tplants.2014.08.004. Epub 2014 Sep 12.
10
Revisit of Machine Learning Supported Biological and Biomedical Studies.机器学习支持的生物学和生物医学研究回顾
Methods Mol Biol. 2018;1754:183-204. doi: 10.1007/978-1-4939-7717-8_11.

引用本文的文献

1
Diversity and Seasonal Abundance of (Diptera: Ceratopogonidae) in Tengchong County of Yunnan, China.中国云南省腾冲县蠓科(双翅目:蠓科)的多样性与季节丰度
Insects. 2025 Jul 30;16(8):780. doi: 10.3390/insects16080780.
2
Application of Bioinformatics and Machine Learning Tools in Food Safety.生物信息学和机器学习工具在食品安全中的应用。
Curr Nutr Rep. 2025 May 19;14(1):67. doi: 10.1007/s13668-025-00657-w.
3
PmiR-Select - a computational approach to plant pre-miRNA identification in genomes.PmiR-Select——一种在基因组中识别植物前体微小RNA的计算方法。
Mol Genet Genomics. 2025 Jan 3;300(1):12. doi: 10.1007/s00438-024-02221-7.
4
Single-Molecule Bioelectronic Sensors with AI-Aided Data Analysis: Convergence and Challenges.具有人工智能辅助数据分析的单分子生物电子传感器:融合与挑战
Precis Chem. 2024 Sep 16;2(10):518-538. doi: 10.1021/prechem.4c00048. eCollection 2024 Oct 28.
5
FAIR privacy-preserving operation of large genomic variant calling format (VCF) data without download or installation.大型基因组变异调用格式(VCF)数据的公平隐私保护操作,无需下载或安装。
AMIA Jt Summits Transl Sci Proc. 2024 May 31;2024:65-74. eCollection 2024.
6
Robust identification of interactions between heat-stress responsive genes in the chicken brain using Bayesian networks and augmented expression data.使用贝叶斯网络和扩充表达数据稳健识别鸡脑热应激响应基因之间的相互作用。
Sci Rep. 2024 Apr 19;14(1):9019. doi: 10.1038/s41598-024-58679-3.
7
A Practical Guide to Evaluating and Using Big Data in Digestive Disease Research.消化系统疾病研究中评估和使用大数据实用指南
Gastroenterology. 2024 Feb;166(2):240-247. doi: 10.1053/j.gastro.2023.11.292. Epub 2023 Dec 3.
8
Integrative approaches based on genomic techniques in the functional studies on enhancers.基于基因组技术的增强子功能研究的综合方法。
Brief Bioinform. 2023 Nov 22;25(1). doi: 10.1093/bib/bbad442.
9
Genomic object detection: An improved approach for transposable elements detection and classification using convolutional neural networks.基因组对象检测:一种使用卷积神经网络改进的转座元件检测和分类方法。
PLoS One. 2023 Sep 21;18(9):e0291925. doi: 10.1371/journal.pone.0291925. eCollection 2023.
10
Recent Advances in DNA Nanomaterials.DNA纳米材料的最新进展
Nanomaterials (Basel). 2023 Aug 29;13(17):2449. doi: 10.3390/nano13172449.

本文引用的文献

1
Principles and methods of integrative genomic analyses in cancer.癌症综合基因组分析的原则和方法。
Nat Rev Cancer. 2014 May;14(5):299-313. doi: 10.1038/nrc3721.
2
Defining cell-type specificity at the transcriptional level in human disease.在人类疾病中定义转录水平的细胞类型特异性。
Genome Res. 2013 Nov;23(11):1862-73. doi: 10.1101/gr.155697.113. Epub 2013 Aug 15.
3
Biology: The big challenges of big data.生物学:大数据的巨大挑战。
Nature. 2013 Jun 13;498(7453):255-60. doi: 10.1038/498255a.
4
ArrayExpress update--trends in database growth and links to data analysis tools.ArrayExpress 更新——数据库增长趋势及与数据分析工具的链接。
Nucleic Acids Res. 2013 Jan;41(Database issue):D987-90. doi: 10.1093/nar/gks1174. Epub 2012 Nov 27.
5
Comprehensive molecular portraits of human breast tumours.人类乳腺肿瘤的全面分子特征图谱。
Nature. 2012 Oct 4;490(7418):61-70. doi: 10.1038/nature11412. Epub 2012 Sep 23.
6
Understanding transcriptional regulation by integrative analysis of transcription factor binding data.综合分析转录因子结合数据来理解转录调控。
Genome Res. 2012 Sep;22(9):1658-67. doi: 10.1101/gr.136838.111.
7
An integrated encyclopedia of DNA elements in the human genome.人类基因组中 DNA 元件的综合百科全书。
Nature. 2012 Sep 6;489(7414):57-74. doi: 10.1038/nature11247.
8
Classification of human genomic regions based on experimentally determined binding sites of more than 100 transcription-related factors.基于超过 100 个与转录相关的因子的实验确定的结合位点对人类基因组区域进行分类。
Genome Biol. 2012 Sep 26;13(9):R48. doi: 10.1186/gb-2012-13-9-r48.
9
The genomic and transcriptomic architecture of 2,000 breast tumours reveals novel subgroups.2000 个乳腺肿瘤的基因组和转录组结构揭示了新的亚群。
Nature. 2012 Apr 18;486(7403):346-52. doi: 10.1038/nature10983.
10
ChromHMM: automating chromatin-state discovery and characterization.ChromHMM:染色质状态发现与特征描述自动化
Nat Methods. 2012 Feb 28;9(3):215-6. doi: 10.1038/nmeth.1906.

大数据生物信息学。

Big data bioinformatics.

机构信息

Department of Genetics, Geisel School of Medicine at Dartmouth, Hanover, New Hampshire; Institute for Quantitative Biomedical Sciences, Geisel School of Medicine at Dartmouth, Lebanon, New Hampshire; Norris Cotton Cancer Center, Geisel School of Medicine at Dartmouth, Lebanon, New Hampshire.

出版信息

J Cell Physiol. 2014 Dec;229(12):1896-900. doi: 10.1002/jcp.24662.

DOI:10.1002/jcp.24662
PMID:24799088
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC5604462/
Abstract

Recent technological advances allow for high throughput profiling of biological systems in a cost-efficient manner. The low cost of data generation is leading us to the "big data" era. The availability of big data provides unprecedented opportunities but also raises new challenges for data mining and analysis. In this review, we introduce key concepts in the analysis of big data, including both "machine learning" algorithms as well as "unsupervised" and "supervised" examples of each. We note packages for the R programming language that are available to perform machine learning analyses. In addition to programming based solutions, we review webservers that allow users with limited or no programming background to perform these analyses on large data compendia.

摘要

最近的技术进步使得以经济高效的方式对生物系统进行高通量分析成为可能。数据生成成本的降低使我们进入了“大数据”时代。大数据的可用性为数据挖掘和分析提供了前所未有的机遇,但也带来了新的挑战。在这篇综述中,我们介绍了大数据分析中的关键概念,包括“机器学习”算法以及每种算法的“无监督”和“监督”示例。我们注意到了可用于执行机器学习分析的 R 编程语言包。除了基于编程的解决方案外,我们还审查了允许具有有限或没有编程背景的用户对大型数据汇编执行这些分析的网络服务器。