Suppr超能文献

STRavinsky STR 数据库和 PGTailor PGT 工具表明,在基于 STR 的应用中,CHM13-T2T 优于 hg38 和 hg19。

STRavinsky STR database and PGTailor PGT tool demonstrate superiority of CHM13-T2T over hg38 and hg19 for STR-based applications.

机构信息

Morris Kahn Laboratory of Human Genetics, NIBN and Faculty of Health Sciences, Ben Gurion University of the Negev, Beer Sheva, Israel.

Genetics Institute, Soroka Medical Center, Beer Sheva, Israel.

出版信息

Eur J Hum Genet. 2023 Jul;31(7):738-743. doi: 10.1038/s41431-023-01352-6. Epub 2023 Apr 13.

Abstract

Short-Tandem-Repeats (STRs) have long been studied for possible roles in biological phenomena, and are utilized in multiple applications such as forensics, evolutionary studies and pre-implantation-genetic-testing (PGT). The two reference genomes most used by clinicians and researchers are GRCh37/hg19 and GRCh38/hg38, both constructed using mainly short-read-sequencing (SRS) in which all-STR-containing-reads cannot be assembled to the reference genome. With the introduction of long-read-sequencing (LRS) methods and the generation of the CHM13 reference genome, also known as T2T, many previously unmapped STRs were finally localized within the human genome. We generated STRavinsky, a compact STR database for three reference genomes, including T2T. We proceeded to demonstrate the advantages of T2T over hg19 and hg38, identifying nearly double the number of STRs throughout all chromosomes. Through STRavinsky, providing a resolution down to a specific genomic coordinate, we demonstrated extreme propensity of TGGAA repeats in p arms of acrocentric chromosomes, substantially corroborating early molecular studies suggesting a possible role in formation of Robertsonian translocations. Moreover, we delineated unique propensity of TGGAA repeats specifically in chromosome 16q11.2 and in 9q12. Finally, we harness the superior capabilities of T2T and STRavinsky to generate PGTailor, a novel web application dramatically facilitating design of STR-based PGT tests in mere minutes.

摘要

短串联重复序列 (STRs) 长期以来一直被研究其在生物学现象中的可能作用,并被广泛应用于法医鉴定、进化研究和胚胎植入前基因检测 (PGT) 等多个领域。临床医生和研究人员最常使用的两个参考基因组是 GRCh37/hg19 和 GRCh38/hg38,它们都是使用主要基于短读长测序 (SRS) 构建的,所有包含 STR 的读段都无法组装到参考基因组上。随着长读长测序 (LRS) 方法的引入和 CHM13 参考基因组的生成,即 T2T,许多以前未映射的 STR 最终被定位在人类基因组中。我们生成了 STRavinsky,这是一个用于三个参考基因组的紧凑型 STR 数据库,包括 T2T。我们接着展示了 T2T 相对于 hg19 和 hg38 的优势,在所有染色体上鉴定到的 STR 数量几乎翻了一番。通过 STRavinsky,我们可以提供精确到特定基因组坐标的分辨率,我们展示了在近端着丝粒染色体的 p 臂中 TGGAA 重复的极端倾向,这与早期的分子研究结果非常吻合,这些研究结果表明它们可能在罗伯逊易位的形成中发挥作用。此外,我们还确定了 TGGAA 重复在 16q11.2 和 9q12 上的独特倾向性。最后,我们利用 T2T 和 STRavinsky 的卓越能力生成了 PGTailor,这是一种新型的网络应用程序,仅在几分钟内就可以极大地简化基于 STR 的 PGT 测试的设计。

相似文献

10

本文引用的文献

1
Utility of long-read sequencing for All of Us.长读测序在“所有人”研究中的应用。
Nat Commun. 2024 Jan 29;15(1):837. doi: 10.1038/s41467-024-44804-3.
4
The complete sequence of a human genome.人类基因组的完整序列。
Science. 2022 Apr;376(6588):44-53. doi: 10.1126/science.abj6987. Epub 2022 Mar 31.
5
Familial long-read sequencing increases yield of de novo mutations.家系长读测序提高从头突变的检出率。
Am J Hum Genet. 2022 Apr 7;109(4):631-646. doi: 10.1016/j.ajhg.2022.02.014. Epub 2022 Mar 14.

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验