• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

从非洲裔 910 人的深度测序中组装泛基因组。

Assembly of a pan-genome from deep sequencing of 910 humans of African descent.

机构信息

Center for Computational Biology, McKusick-Nathans Institute of Genetic Medicine, Johns Hopkins University, Baltimore, MD, USA.

Department of Computer Science, Johns Hopkins University, Baltimore, MD, USA.

出版信息

Nat Genet. 2019 Jan;51(1):30-35. doi: 10.1038/s41588-018-0273-y. Epub 2018 Nov 19.

DOI:10.1038/s41588-018-0273-y
PMID:30455414
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC6309586/
Abstract

We used a deeply sequenced dataset of 910 individuals, all of African descent, to construct a set of DNA sequences that is present in these individuals but missing from the reference human genome. We aligned 1.19 trillion reads from the 910 individuals to the reference genome (GRCh38), collected all reads that failed to align, and assembled these reads into contiguous sequences (contigs). We then compared all contigs to one another to identify a set of unique sequences representing regions of the African pan-genome missing from the reference genome. Our analysis revealed 296,485,284 bp in 125,715 distinct contigs present in the populations of African descent, demonstrating that the African pan-genome contains ~10% more DNA than the current human reference genome. Although the functional significance of nearly all of this sequence is unknown, 387 of the novel contigs fall within 315 distinct protein-coding genes, and the rest appear to be intergenic.

摘要

我们使用了一个深度测序数据集,其中包含 910 名全部来自非洲血统的个体,构建了一组存在于这些个体中但在参考人类基因组中缺失的 DNA 序列。我们将这 910 个人的 11.9 万亿条读取与参考基因组(GRCh38)进行比对,收集所有无法比对的读取,并将这些读取组装成连续的序列(contigs)。然后,我们将所有 contigs 相互比较,以确定一组代表参考基因组中缺失的非洲泛基因组区域的独特序列。我们的分析揭示了在非洲血统人群中存在的 125715 个独特 contigs 中,有 296485284bp,这表明非洲泛基因组包含比当前人类参考基因组多约 10%的 DNA。尽管几乎所有这些序列的功能意义都未知,但 387 个新的 contigs 位于 315 个不同的蛋白编码基因内,其余的似乎位于基因间。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0d5c/6309586/e1579f99158f/nihms-1509230-f0003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0d5c/6309586/21405849f30a/nihms-1509230-f0001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0d5c/6309586/048acd079384/nihms-1509230-f0002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0d5c/6309586/e1579f99158f/nihms-1509230-f0003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0d5c/6309586/21405849f30a/nihms-1509230-f0001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0d5c/6309586/048acd079384/nihms-1509230-f0002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0d5c/6309586/e1579f99158f/nihms-1509230-f0003.jpg

相似文献

1
Assembly of a pan-genome from deep sequencing of 910 humans of African descent.从非洲裔 910 人的深度测序中组装泛基因组。
Nat Genet. 2019 Jan;51(1):30-35. doi: 10.1038/s41588-018-0273-y. Epub 2018 Nov 19.
2
HUPAN: a pan-genome analysis pipeline for human genomes.HUPAN:一个用于人类基因组的泛基因组分析流水线。
Genome Biol. 2019 Jul 31;20(1):149. doi: 10.1186/s13059-019-1751-y.
3
Nanopore sequencing and assembly of a human genome with ultra-long reads.纳米孔测序和超长读长组装人类基因组。
Nat Biotechnol. 2018 Apr;36(4):338-345. doi: 10.1038/nbt.4060. Epub 2018 Jan 29.
4
Anchored pseudo-de novo assembly of human genomes identifies extensive sequence variation from unmapped sequence reads.人类基因组的锚定伪从头组装可从未映射的序列读取中识别出广泛的序列变异。
Hum Genet. 2016 Jul;135(7):727-40. doi: 10.1007/s00439-016-1667-5. Epub 2016 Apr 9.
5
The first complete genome sequence of the African swine fever virus genotype X and serogroup 7 isolated in domestic pigs from the Democratic Republic of Congo.刚果民主共和国家猪中分离的非洲猪瘟病毒基因型 X 和血清群 7 的首个完整基因组序列。
Virol J. 2021 Jan 21;18(1):23. doi: 10.1186/s12985-021-01497-0.
6
Discovery of Novel Sequences in 1,000 Swedish Genomes.在 1000 个瑞典基因组中发现新序列。
Mol Biol Evol. 2020 Jan 1;37(1):18-30. doi: 10.1093/molbev/msz176.
7
AlignGraph2: similar genome-assisted reassembly pipeline for PacBio long reads.AlignGraph2:用于 PacBio 长读长的相似基因组辅助重组装流程。
Brief Bioinform. 2021 Sep 2;22(5). doi: 10.1093/bib/bbab022.
8
Demonstrating the utility of flexible sequence queries against indexed short reads with FlexTyper.使用 FlexTyper 对索引短读取进行灵活序列查询的实用性展示。
PLoS Comput Biol. 2021 Mar 22;17(3):e1008815. doi: 10.1371/journal.pcbi.1008815. eCollection 2021 Mar.
9
GapPredict - A Language Model for Resolving Gaps in Draft Genome Assemblies.GapPredict - 一种用于解决基因组草图组装中缺口的语言模型。
IEEE/ACM Trans Comput Biol Bioinform. 2021 Nov-Dec;18(6):2802-2808. doi: 10.1109/TCBB.2021.3109557. Epub 2021 Dec 8.
10
Assembly and Analysis of Unmapped Genome Sequence Reads Reveal Novel Sequence and Variation in Dogs.组装和分析未映射的基因组序列读段揭示了狗的新序列和变异。
Sci Rep. 2018 Jul 18;8(1):10862. doi: 10.1038/s41598-018-29190-3.

引用本文的文献

1
The highly dynamic pangenome of basal chordates is enriched in defence and immunity genes and is inherited following the Mendelian law.基础脊索动物高度动态的泛基因组富含防御和免疫基因,并遵循孟德尔定律遗传。
PLoS Genet. 2025 Aug 18;21(8):e1011833. doi: 10.1371/journal.pgen.1011833. eCollection 2025 Aug.
2
Mango pangenome reveals dramatic impacts of reference bias on population genomic analyses.芒果泛基因组揭示了参考偏差对群体基因组分析的巨大影响。
Hortic Res. 2025 Jul 1;12(9):uhaf166. doi: 10.1093/hr/uhaf166. eCollection 2025 Sep.
3
Human-specific gene expansions contribute to brain evolution.
人类特有的基因扩增促进大脑进化。
Cell. 2025 Jul 18. doi: 10.1016/j.cell.2025.06.037.
4
Integrating parental genomes to reduce reference bias and identify intramuscular fat genes in Qinchuan Black pigs.整合亲本基因组以减少参考偏差并鉴定秦川黑猪的肌内脂肪基因。
J Anim Sci Biotechnol. 2025 Jul 20;16(1):104. doi: 10.1186/s40104-025-01236-3.
5
Construction of the graph genomes of Takifugu provides novel insights into the genomic mechanisms of population structure and migratory traits.构建红鳍东方鲀的图形基因组为种群结构和洄游特性的基因组机制提供了新见解。
BMC Biol. 2025 Jul 1;23(1):195. doi: 10.1186/s12915-025-02296-7.
6
Nephrogenomics, precision medicine and the role of genetic testing in adult kidney disease management.肾基因组学、精准医学以及基因检测在成人肾脏疾病管理中的作用。
Nat Rev Nephrol. 2025 Jun 16. doi: 10.1038/s41581-025-00970-1.
7
Ancestry-linked stromal variations impact breast epithelial cell invasion.与祖先相关的基质变异影响乳腺上皮细胞的侵袭。
iScience. 2025 May 16;28(6):112686. doi: 10.1016/j.isci.2025.112686. eCollection 2025 Jun 20.
8
Assembling unmapped reads reveals hidden variation in South Asian genomes.组装未映射的 reads 揭示了南亚基因组中隐藏的变异。
bioRxiv. 2025 May 14:2025.05.14.653340. doi: 10.1101/2025.05.14.653340.
9
Lossless Pangenome Indexing Using Tag Arrays.使用标签数组的无损全基因组索引
bioRxiv. 2025 May 15:2025.05.12.653561. doi: 10.1101/2025.05.12.653561.
10
Genome Sequence of a Marine Threespine Stickleback (Gasterosteus aculeatus) from Rabbit Slough in the Cook Inlet.来自库克湾兔子泥沼的一条海洋三刺鱼(Gasterosteus aculeatus)的基因组序列。
G3 (Bethesda). 2025 May 23. doi: 10.1093/g3journal/jkaf114.