• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

转录组学与长读长基因组学的整合确定了罕见病中的结构变异优先级。

Integration of transcriptomics and long-read genomics prioritizes structural variants in rare disease.

作者信息

Jensen Tanner D, Ni Bohan, Reuter Chloe M, Gorzynski John E, Fazal Sarah, Bonner Devon, Ungar Rachel A, Goddard Pagé C, Raja Archana, Ashley Euan A, Bernstein Jonathan A, Zuchner Stephan, Greicius Michael D, Montgomery Stephen B, Schatz Michael C, Wheeler Matthew T, Battle Alexis

机构信息

Department of Genetics, Stanford University, Stanford, California 94305, USA.

Department of Computer Science, Johns Hopkins University, Baltimore, Maryland 21218, USA.

出版信息

Genome Res. 2025 Apr 14;35(4):914-928. doi: 10.1101/gr.279323.124.

DOI:10.1101/gr.279323.124
PMID:40113264
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC12047269/
Abstract

Rare structural variants (SVs)-insertions, deletions, and complex rearrangements-can cause Mendelian disease, yet they remain difficult to accurately detect and interpret. We sequenced and analyzed Oxford Nanopore Technologies long-read genomes of 68 individuals from the undiagnosed disease network (UDN) with no previously identified diagnostic mutations from short-read sequencing. Using our optimized SV detection pipelines and 571 control long-read genomes, we detected 716 long-read rare (MAF < 0.01) SV alleles per genome on average, achieving a 2.4× increase from short reads. To characterize the functional effects of rare SVs, we assessed their relationship with gene expression from blood or fibroblasts from the same individuals and found that rare SVs overlapping enhancers were enriched (LOR = 0.46) near expression outliers. We also evaluated tandem repeat expansions (TREs) and found 14 rare TREs per genome; notably, these TREs were also enriched near overexpression outliers. To prioritize candidate functional SVs, we developed Watershed-SV, a probabilistic model that integrates expression data with SV-specific genomic annotations, which significantly outperforms baseline models that do not incorporate expression data. Watershed-SV identified a median of eight high-confidence functional SVs per UDN genome. Notably, this included compound heterozygous deletions in shared by two siblings, which were likely causal for a rare neurodevelopmental disorder. Our observations demonstrate the promise of integrating long-read sequencing with gene expression toward improving the prioritization of functional SVs and TREs in rare disease patients.

摘要

罕见结构变异(SVs)——插入、缺失和复杂重排——可导致孟德尔疾病,但它们仍难以准确检测和解读。我们对来自未确诊疾病网络(UDN)的68名个体的牛津纳米孔技术长读长基因组进行了测序和分析,这些个体之前的短读长测序未发现诊断性突变。使用我们优化的SV检测流程和571个对照长读长基因组,我们平均每个基因组检测到716个长读长罕见(MAF < 0.01)SV等位基因,比短读长检测数量增加了2.4倍。为了表征罕见SVs的功能效应,我们评估了它们与来自同一受试者血液或成纤维细胞的基因表达之间的关系,发现与增强子重叠的罕见SVs在表达异常值附近富集(LOR = 0.46)。我们还评估了串联重复扩增(TREs),每个基因组发现14个罕见TREs;值得注意的是,这些TREs在过表达异常值附近也有富集。为了对候选功能SVs进行优先级排序,我们开发了Watershed-SV,这是一种将表达数据与SV特异性基因组注释整合的概率模型,其性能显著优于未纳入表达数据的基线模型。Watershed-SV在每个UDN基因组中鉴定出中位数为8个高置信度功能SVs。值得注意的是,这包括两个兄弟姐妹共有的复合杂合缺失,这可能是一种罕见神经发育障碍的病因。我们的观察结果表明,将长读长测序与基因表达相结合有望改善罕见病患者中功能SVs和TREs的优先级排序。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c60d/12047269/d1bb7d0a2355/914f05.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c60d/12047269/aa69ab170a06/914f01.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c60d/12047269/f2e60fefbfd5/914f02.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c60d/12047269/7dc7136f70cc/914f03.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c60d/12047269/23b455c5798c/914f04.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c60d/12047269/d1bb7d0a2355/914f05.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c60d/12047269/aa69ab170a06/914f01.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c60d/12047269/f2e60fefbfd5/914f02.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c60d/12047269/7dc7136f70cc/914f03.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c60d/12047269/23b455c5798c/914f04.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c60d/12047269/d1bb7d0a2355/914f05.jpg

相似文献

1
Integration of transcriptomics and long-read genomics prioritizes structural variants in rare disease.转录组学与长读长基因组学的整合确定了罕见病中的结构变异优先级。
Genome Res. 2025 Apr 14;35(4):914-928. doi: 10.1101/gr.279323.124.
2
Integration of transcriptomics and long-read genomics prioritizes structural variants in rare disease.转录组学与长读长基因组学的整合确定了罕见病中的结构变异优先级。
medRxiv. 2024 Mar 26:2024.03.22.24304565. doi: 10.1101/2024.03.22.24304565.
3
Expectations and blind spots for structural variation detection from long-read assemblies and short-read genome sequencing technologies.从长读序列组装和短读基因组测序技术中检测结构变异的预期和盲点。
Am J Hum Genet. 2021 May 6;108(5):919-928. doi: 10.1016/j.ajhg.2021.03.014. Epub 2021 Mar 30.
4
A large structural variant collection in Holstein cattle and associated database for variant discovery, characterization, and application.荷斯坦牛大型结构变异组库及相关数据库的建立,用于变异的发现、鉴定和应用。
BMC Genomics. 2024 Sep 30;25(1):903. doi: 10.1186/s12864-024-10812-2.
5
A Comparison of Structural Variant Calling from Short-Read and Nanopore-Based Whole-Genome Sequencing Using Optical Genome Mapping as a Benchmark.基于光学基因组图谱作为基准的短读长和纳米孔全基因组测序的结构变异调用比较。
Genes (Basel). 2024 Jul 16;15(7):925. doi: 10.3390/genes15070925.
6
Combined use of Oxford Nanopore and Illumina sequencing yields insights into soybean structural variation biology.联合使用牛津纳米孔和 Illumina 测序技术揭示了大豆结构变异生物学的见解。
BMC Biol. 2022 Feb 23;20(1):53. doi: 10.1186/s12915-022-01255-w.
7
Oxford Nanopore and Bionano Genomics technologies evaluation for plant structural variation detection.牛津纳米孔和百奥诺基因组学技术在植物结构变异检测中的评估。
BMC Genomics. 2022 Apr 21;23(1):317. doi: 10.1186/s12864-022-08499-4.
8
Benchmarking long-read structural variant calling tools and combinations for detecting somatic variants in cancer genomes.评估用于检测癌症基因组中体细胞变异的长读长结构变异检测工具及组合。
Sci Rep. 2025 Mar 13;15(1):8707. doi: 10.1038/s41598-025-92750-x.
9
Whole-genome sequencing with long reads reveals complex structure and origin of structural variation in human genetic variations and somatic mutations in cancer.全基因组测序与长读长揭示了人类遗传变异和癌症体细胞突变中结构变异的复杂结构和起源。
Genome Med. 2021 Apr 29;13(1):65. doi: 10.1186/s13073-021-00883-1.
10
Comparison and benchmark of structural variants detected from long read and long-read assembly.长读与长读组装检测到的结构变异的比较和基准测试。
Brief Bioinform. 2023 Jul 20;24(4). doi: 10.1093/bib/bbad188.

引用本文的文献

1
A Hitchhiker's Guide to long-read genomic analysis.长读长基因组分析指南
Genome Res. 2025 Apr 14;35(4):545-558. doi: 10.1101/gr.279975.124.
2
Unraveling the hidden complexity of cancer through long-read sequencing.通过长读长测序揭示癌症隐藏的复杂性。
Genome Res. 2025 Apr 14;35(4):599-620. doi: 10.1101/gr.280041.124.
3
Exploration of Neurodegenerative Diseases Using Long-Read Sequencing and Optical Genome Mapping Technologies.利用长读长测序和光学基因组图谱技术探索神经退行性疾病

本文引用的文献

1
A 3'UTR Insertion Is a Candidate Causal Variant at the Locus Associated With Increased Risk for FTLD-TDP.一个3'非翻译区插入是与额颞叶痴呆-TDP风险增加相关位点的候选致病变异。
Neurol Genet. 2024 Feb 5;10(1):e200124. doi: 10.1212/NXG.0000000000200124. eCollection 2024 Feb.
2
High-coverage nanopore sequencing of samples from the 1000 Genomes Project to build a comprehensive catalog of human genetic variation.对 1000 基因组计划样本进行高覆盖度的纳米孔测序,构建人类遗传变异综合目录。
Genome Res. 2024 Nov 20;34(11):2061-2073. doi: 10.1101/gr.279273.124.
3
Misexpression of inactive genes in whole blood is associated with nearby rare structural variants.
Mov Disord. 2025 Jun;40(6):996-1008. doi: 10.1002/mds.30151. Epub 2025 Mar 3.
4
Detailed tandem repeat allele profiling in 1,027 long-read genomes reveals genome-wide patterns of pathogenicity.对1027个长读长基因组进行详细的串联重复等位基因分析揭示了全基因组范围的致病性模式。
bioRxiv. 2025 Jan 20:2025.01.06.631535. doi: 10.1101/2025.01.06.631535.
5
GREGoR: Accelerating Genomics for Rare Diseases.GREGoR:加速罕见病基因组学研究
ArXiv. 2024 Dec 18:arXiv:2412.14338v1.
全血中无活性基因的异常表达与附近罕见的结构变异有关。
Am J Hum Genet. 2024 Aug 8;111(8):1524-1543. doi: 10.1016/j.ajhg.2024.06.017. Epub 2024 Jul 24.
4
Impact of genome build on RNA-seq interpretation and diagnostics.基因组构建对 RNA-seq 解读和诊断的影响。
Am J Hum Genet. 2024 Jul 11;111(7):1282-1300. doi: 10.1016/j.ajhg.2024.05.005. Epub 2024 Jun 3.
5
Loss of function of FAM177A1, a Golgi complex localized protein, causes a novel neurodevelopmental disorder.FAM177A1 是一种定位于高尔基体复合物的蛋白,其功能丧失可导致一种新型神经发育障碍。
Genet Med. 2024 Sep;26(9):101166. doi: 10.1016/j.gim.2024.101166. Epub 2024 May 17.
6
Genome sequencing as a generic diagnostic strategy for rare disease.基因组测序作为一种罕见病的通用诊断策略。
Genome Med. 2024 Feb 14;16(1):32. doi: 10.1186/s13073-024-01301-y.
7
RExPRT: a machine learning tool to predict pathogenicity of tandem repeat loci.RExPRT:一种用于预测串联重复序列座致病性的机器学习工具。
Genome Biol. 2024 Jan 31;25(1):39. doi: 10.1186/s13059-024-03171-4.
8
Utility of long-read sequencing for All of Us.长读测序在“所有人”研究中的应用。
Nat Commun. 2024 Jan 29;15(1):837. doi: 10.1038/s41467-024-44804-3.
9
Detection of mosaic and population-level structural variants with Sniffles2.使用 Sniffles2 检测嵌合体和群体水平的结构变异。
Nat Biotechnol. 2024 Oct;42(10):1571-1580. doi: 10.1038/s41587-023-02024-y. Epub 2024 Jan 2.
10
PhenoSV: interpretable phenotype-aware model for the prioritization of genes affected by structural variants.PhenoSV:一种可解释的表型感知模型,用于优先考虑受结构变异影响的基因。
Nat Commun. 2023 Nov 28;14(1):7805. doi: 10.1038/s41467-023-43651-y.