• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

组学中的小数据方法:以一当十。

Small data methods in omics: the power of one.

机构信息

Department of Mathematics, University of California, Irvine, Irvine, CA, USA.

Department of Anatomy and Neurobiology, School of Medicine, University of California, Irvine, Irvine, CA, USA.

出版信息

Nat Methods. 2024 Sep;21(9):1597-1602. doi: 10.1038/s41592-024-02390-8. Epub 2024 Aug 22.

DOI:10.1038/s41592-024-02390-8
PMID:39174710
Abstract

Over the last decade, biology has begun utilizing 'big data' approaches, resulting in large, comprehensive atlases in modalities ranging from transcriptomics to neural connectomics. However, these approaches must be complemented and integrated with 'small data' approaches to efficiently utilize data from individual labs. Integration of smaller datasets with major reference atlases is critical to provide context to individual experiments, and approaches toward integration of large and small data have been a major focus in many fields in recent years. Here we discuss progress in integration of small data with consortium-sized atlases across multiple modalities, and its potential applications. We then examine promising future directions for utilizing the power of small data to maximize the information garnered from small-scale experiments. We envision that, in the near future, international consortia comprising many laboratories will work together to collaboratively build reference atlases and foundation models using small data methods.

摘要

在过去的十年中,生物学开始利用“大数据”方法,从而在从转录组学到神经连接组学等多种模式中产生了大型综合图谱。然而,这些方法必须与“小数据”方法相结合并进行整合,以便有效地利用各个实验室的数据。将较小的数据集与主要参考图谱集成对于为单个实验提供上下文至关重要,并且近年来,将大数据和小数据集成的方法一直是许多领域的主要关注点。在这里,我们讨论了在多个模式下将小数据与联盟规模的图谱集成的进展及其潜在应用。然后,我们研究了利用小数据的强大功能来最大程度地从小规模实验中获取信息的有前途的未来方向。我们设想,在不久的将来,由许多实验室组成的国际联盟将共同努力,使用小数据方法来协作构建参考图谱和基础模型。

相似文献

1
Small data methods in omics: the power of one.组学中的小数据方法:以一当十。
Nat Methods. 2024 Sep;21(9):1597-1602. doi: 10.1038/s41592-024-02390-8. Epub 2024 Aug 22.
2
Integrative Analysis of Omics Big Data.组学大数据的综合分析
Methods Mol Biol. 2018;1754:109-135. doi: 10.1007/978-1-4939-7717-8_7.
3
Multi -omics and metabolic modelling pipelines: challenges and tools for systems microbiology.多组学和代谢建模管道:系统微生物学的挑战和工具。
Microbiol Res. 2015 Feb;171:52-64. doi: 10.1016/j.micres.2015.01.003. Epub 2015 Jan 7.
4
Big science and big data in nephrology.肾脏病学中的大科学和大数据。
Kidney Int. 2019 Jun;95(6):1326-1337. doi: 10.1016/j.kint.2018.11.048. Epub 2019 Mar 5.
5
Towards multi-omics synthetic data integration.迈向多组学综合数据集成。
Brief Bioinform. 2024 Mar 27;25(3). doi: 10.1093/bib/bbae213.
6
Biomolecular Databases and Subnetwork Identification Approaches of Interest to Big Data Community: An Expert Review.生物分子数据库和子网络识别方法:大数据社区的兴趣所在——专家综述
OMICS. 2019 Mar;23(3):138-151. doi: 10.1089/omi.2018.0205.
7
Machine learning meets omics: applications and perspectives.机器学习与组学的融合:应用与展望。
Brief Bioinform. 2022 Jan 17;23(1). doi: 10.1093/bib/bbab460.
8
Machine learning: its challenges and opportunities in plant system biology.机器学习:在植物系统生物学中的挑战与机遇。
Appl Microbiol Biotechnol. 2022 May;106(9-10):3507-3530. doi: 10.1007/s00253-022-11963-6. Epub 2022 May 16.
9
Handling High-Throughput Omics Data for Systems Genetics Analysis.处理高通量组学数据进行系统遗传学分析。
Methods Mol Biol. 2021;2325:183-190. doi: 10.1007/978-1-0716-1507-2_12.
10
The big data challenges of connectomics.连接组学的大数据挑战。
Nat Neurosci. 2014 Nov;17(11):1448-54. doi: 10.1038/nn.3837. Epub 2014 Oct 28.

引用本文的文献

1
Integrated Patient Digital and Biomimetic Twins for Precision Medicine: A Perspective.用于精准医学的综合患者数字双胞胎和仿生双胞胎:一种观点
Semin Liver Dis. 2025 Jul 23. doi: 10.1055/a-2649-1560.
2
Development and validation of a small-sample machine learning model to predict 5-year overall survival in patients with hepatocellular carcinoma.用于预测肝细胞癌患者5年总生存率的小样本机器学习模型的开发与验证
BMC Cancer. 2025 Jul 1;25(1):1040. doi: 10.1186/s12885-025-14425-0.
3
scValue: value-based subsampling of large-scale single-cell transcriptomic data for machine and deep learning tasks.

本文引用的文献

1
CZ CELLxGENE Discover: a single-cell data platform for scalable exploration, analysis and modeling of aggregated data.CZ CELLxGENE发现平台:一个用于对聚合数据进行可扩展探索、分析和建模的单细胞数据平台。
Nucleic Acids Res. 2025 Jan 6;53(D1):D886-D900. doi: 10.1093/nar/gkae1142.
2
Transformers in single-cell omics: a review and new perspectives.单细胞组学中的转换器:综述与新视角。
Nat Methods. 2024 Aug;21(8):1430-1443. doi: 10.1038/s41592-024-02353-z. Epub 2024 Aug 9.
3
scGPT: toward building a foundation model for single-cell multi-omics using generative AI.
scValue:用于机器学习和深度学习任务的大规模单细胞转录组数据的基于值的二次采样。
Brief Bioinform. 2025 May 1;26(3). doi: 10.1093/bib/bbaf279.
4
A Dataset for Investigations of Amine-Impregnated Solid Adsorbent for Direct Air Capture.用于胺浸渍固体吸附剂直接空气捕获研究的数据集。
Sci Data. 2025 May 1;12(1):724. doi: 10.1038/s41597-025-05037-1.
5
A highly resolved integrated transcriptomic atlas of human breast cancers.一份高分辨率的人类乳腺癌综合转录组图谱。
bioRxiv. 2025 May 3:2025.03.13.643025. doi: 10.1101/2025.03.13.643025.
6
A highly resolved integrated single-cell atlas of HPV-negative head and neck cancer.一份高分辨率的HPV阴性头颈癌综合单细胞图谱。
bioRxiv. 2025 Mar 4:2025.03.02.640812. doi: 10.1101/2025.03.02.640812.
scGPT:迈向使用生成式人工智能构建单细胞多组学基础模型
Nat Methods. 2024 Aug;21(8):1470-1480. doi: 10.1038/s41592-024-02201-0. Epub 2024 Feb 26.
4
scDisInFact: disentangled learning for integration and prediction of multi-batch multi-condition single-cell RNA-sequencing data.scDisInFact:用于多批次多条件单细胞 RNA-seq 数据的集成和预测的解缠学习。
Nat Commun. 2024 Jan 30;15(1):912. doi: 10.1038/s41467-024-45227-w.
5
scPerturb: harmonized single-cell perturbation data.scPerturb:协调的单细胞扰动数据。
Nat Methods. 2024 Mar;21(3):531-540. doi: 10.1038/s41592-023-02144-y. Epub 2024 Jan 26.
6
A fast, scalable and versatile tool for analysis of single-cell omics data.一种快速、可扩展且功能多样的单细胞组学数据分析工具。
Nat Methods. 2024 Feb;21(2):217-227. doi: 10.1038/s41592-023-02139-9. Epub 2024 Jan 8.
7
Single-cell analysis of chromatin accessibility in the adult mouse brain.成年鼠脑染色质可及性的单细胞分析。
Nature. 2023 Dec;624(7991):378-389. doi: 10.1038/s41586-023-06824-9. Epub 2023 Dec 13.
8
A high-resolution transcriptomic and spatial atlas of cell types in the whole mouse brain.全脑细胞类型的高分辨率转录组学和空间图谱
Nature. 2023 Dec;624(7991):317-332. doi: 10.1038/s41586-023-06812-z. Epub 2023 Dec 13.
9
Molecularly defined and spatially resolved cell atlas of the whole mouse brain.分子定义和空间分辨的全鼠脑细胞图谱。
Nature. 2023 Dec;624(7991):343-354. doi: 10.1038/s41586-023-06808-9. Epub 2023 Dec 13.
10
Inferring gene regulatory network from single-cell transcriptomes with graph autoencoder model.基于图自动编码器模型从单细胞转录组数据推断基因调控网络。
PLoS Genet. 2023 Sep 13;19(9):e1010942. doi: 10.1371/journal.pgen.1010942. eCollection 2023 Sep.