Suppr超能文献

SPLASH:一种无参考基因组统计算法统一了生物学发现。

SPLASH: a statistical, reference-free genomic algorithm unifies biological discovery.

作者信息

Chaung Kaitlin, Baharav Tavor Z, Henderson George, Zheludev Ivan N, Wang Peter L, Salzman Julia

机构信息

Department of Biomedical Data Science, Stanford University, Stanford, 94305, USA.

Department of Biochemistry, Stanford University, Stanford, 94305, USA.

出版信息

bioRxiv. 2023 Jul 31:2022.06.24.497555. doi: 10.1101/2022.06.24.497555.

Abstract

Today's genomics workflows typically require alignment to a reference sequence, which limits discovery. We introduce a new unifying paradigm, SPLASH (Statistically Primary aLignment Agnostic Sequence Homing), an approach that directly analyzes raw sequencing data to detect a signature of regulation: sample-specific sequence variation. The approach, which includes a new statistical test, is computationally efficient and can be run at scale. SPLASH unifies detection of myriad forms of sequence variation. We demonstrate that SPLASH identifies complex mutation patterns in SARS-CoV-2 strains, discovers regulated RNA isoforms at the single cell level, documents the vast sequence diversity of adaptive immune receptors, and uncovers biology in non-model organisms undocumented in their reference genomes: geographic and seasonal variation and diatom association in eelgrass, an oceanic plant impacted by climate change, and tissue-specific transcripts in octopus. SPLASH is a new unifying approach to genomic analysis that enables an expansive scope of discovery without metadata or references.

摘要

当今的基因组学工作流程通常需要与参考序列进行比对,这限制了新发现。我们引入了一种新的统一范式——SPLASH(统计上主要的无比对序列归巢),这是一种直接分析原始测序数据以检测调控特征的方法:样本特异性序列变异。该方法包括一种新的统计检验,计算效率高且可大规模运行。SPLASH统一了多种形式序列变异的检测。我们证明,SPLASH可识别SARS-CoV-2毒株中的复杂突变模式,在单细胞水平发现受调控的RNA异构体,记录适应性免疫受体的巨大序列多样性,并揭示参考基因组中未记录的非模式生物的生物学特征:气候变化影响的海洋植物鳗草中的地理和季节变异以及硅藻关联,以及章鱼中的组织特异性转录本。SPLASH是一种新的统一基因组分析方法,无需元数据或参考即可实现广泛的发现范围。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/da94/10395249/923863bb5ee2/nihpp-2022.06.24.497555v4-f0001.jpg

文献AI研究员

20分钟写一篇综述,助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型,支持多种主流文档格式。

立即体验