Suppr超能文献

靶向短读测序和重排及候选基因座的组装提供了兆碱基的二倍体型。

Targeted short read sequencing and assembly of re-arrangements and candidate gene loci provide megabase diplotypes.

机构信息

Division of Oncology, Department of Medicine, Stanford University School of Medicine, Stanford, CA 94305, USA.

Sage Science, Inc., Beverly, MA 01915, USA.

出版信息

Nucleic Acids Res. 2019 Nov 4;47(19):e115. doi: 10.1093/nar/gkz661.

Abstract

The human genome is composed of two haplotypes, otherwise called diplotypes, which denote phased polymorphisms and structural variations (SVs) that are derived from both parents. Diplotypes place genetic variants in the context of cis-related variants from a diploid genome. As a result, they provide valuable information about hereditary transmission, context of SV, regulation of gene expression and other features which are informative for understanding human genetics. Successful diplotyping with short read whole genome sequencing generally requires either a large population or parent-child trio samples. To overcome these limitations, we developed a targeted sequencing method for generating megabase (Mb)-scale haplotypes with short reads. One selects specific 0.1-0.2 Mb high molecular weight DNA targets with custom-designed Cas9-guide RNA complexes followed by sequencing with barcoded linked reads. To test this approach, we designed three assays, targeting the BRCA1 gene, the entire 4-Mb major histocompatibility complex locus and 18 well-characterized SVs, respectively. Using an integrated alignment- and assembly-based approach, we generated comprehensive variant diplotypes spanning the entirety of the targeted loci and characterized SVs with exact breakpoints. Our results were comparable in quality to long read sequencing.

摘要

人类基因组由两个单倍型组成,也称为二倍型,它们表示来自父母双方的阶段性多态性和结构变异(SV)。二倍型将遗传变异置于来自二倍体基因组的顺式相关变异的背景下。因此,它们提供了有关遗传传递、SV 上下文、基因表达调控和其他有助于理解人类遗传学的特征的有价值信息。使用短读全基因组测序进行成功的二倍型分析通常需要大量人群或父母-子女三人样本。为了克服这些限制,我们开发了一种靶向测序方法,可使用短读长生成兆碱基(Mb)规模的单倍型。该方法选择具有定制 Cas9-guide RNA 复合物的特定 0.1-0.2 Mb 高分子量 DNA 靶标,然后用带有条形码的连接读进行测序。为了测试这种方法,我们设计了三个检测,分别针对 BRCA1 基因、整个 4-Mb 主要组织相容性复合物基因座和 18 个特征明确的 SV。我们使用基于整合比对和组装的方法,生成了跨越靶向基因座的完整变异二倍型,并对具有确切断点的 SV 进行了特征描述。我们的结果与长读测序的质量相当。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/21c5/6821272/af4ed9c5907d/gkz661fig1.jpg

文献AI研究员

20分钟写一篇综述,助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型,支持多种主流文档格式。

立即体验