• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

利用泛基因图探索基因内容。

Exploring gene content with pangene graphs.

作者信息

Li Heng, Marin Maximillian, Farhat Maha Reda

机构信息

Dana-Farber Cancer Institute, Boston, MA 02215, USA.

Harvard Medical School, Boston, MA 02215, USA.

出版信息

Bioinformatics. 2024 Jul 23;40(7). doi: 10.1093/bioinformatics/btae456.

DOI:10.1093/bioinformatics/btae456
PMID:39041615
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC11286280/
Abstract

MOTIVATION

The gene content regulates the biology of an organism. It varies between species and between individuals of the same species. Although tools have been developed to identify gene content changes in bacterial genomes, none is applicable to collections of large eukaryotic genomes such as the human pangenome.

RESULTS

We developed pangene, a computational tool to identify gene orientation, gene order and gene copy-number changes in a collection of genomes. Pangene aligns a set of input protein sequences to the genomes, resolves redundancies between protein sequences and constructs a gene graph with each genome represented as a walk in the graph. It additionally finds subgraphs, which we call bibubbles, that capture gene content changes. Applied to the human pangenome, pangene identifies known gene-level variations and reveals complex haplotypes that are not well studied before. Pangene also works with high-quality bacterial pangenome and reports similar numbers of core and accessory genes in comparison to existing tools.

AVAILABILITY AND IMPLEMENTATION

Source code at https://github.com/lh3/pangene; pre-built pangene graphs can be downloaded from https://zenodo.org/records/8118576 and visualized at https://pangene.bioinweb.org.

摘要

动机

基因组成分调控着生物体的生物学特性。它在不同物种之间以及同一物种的不同个体之间存在差异。尽管已经开发出了一些工具来识别细菌基因组中的基因组成分变化,但没有一种工具适用于人类泛基因组等大型真核生物基因组集合。

结果

我们开发了pangene,这是一种计算工具,用于识别基因组集合中的基因方向、基因顺序和基因拷贝数变化。pangene将一组输入的蛋白质序列与基因组进行比对,解决蛋白质序列之间的冗余问题,并构建一个基因图,每个基因组在图中表示为一条路径。它还能找到子图,我们称之为双气泡图,这些子图捕捉了基因组成分的变化。应用于人类泛基因组时,pangene识别出已知的基因水平变异,并揭示了以前研究较少的复杂单倍型。pangene也适用于高质量的细菌泛基因组,与现有工具相比,它报告的核心基因和辅助基因数量相似。

可用性和实现方式

源代码位于https://github.com/lh3/pangene;预构建的pangene图可从https://zenodo.org/records/8118576下载,并在https://pangene.bioinweb.org上进行可视化。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3f52/11286280/122f8c9383bb/btae456f7.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3f52/11286280/0dccc37885ee/btae456f1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3f52/11286280/654870cf62eb/btae456f2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3f52/11286280/a0ffb5c6d01f/btae456f3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3f52/11286280/c8153281df85/btae456f4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3f52/11286280/d34a63b49319/btae456f5.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3f52/11286280/5bd4ded70303/btae456f6.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3f52/11286280/122f8c9383bb/btae456f7.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3f52/11286280/0dccc37885ee/btae456f1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3f52/11286280/654870cf62eb/btae456f2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3f52/11286280/a0ffb5c6d01f/btae456f3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3f52/11286280/c8153281df85/btae456f4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3f52/11286280/d34a63b49319/btae456f5.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3f52/11286280/5bd4ded70303/btae456f6.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3f52/11286280/122f8c9383bb/btae456f7.jpg

相似文献

1
Exploring gene content with pangene graphs.利用泛基因图探索基因内容。
Bioinformatics. 2024 Jul 23;40(7). doi: 10.1093/bioinformatics/btae456.
2
Exploring gene content with pangene graphs.利用泛基因图探索基因内容。
ArXiv. 2024 May 29:arXiv:2402.16185v3.
3
Prescription of Controlled Substances: Benefits and Risks管制药品的处方:益处与风险
4
Short-Term Memory Impairment短期记忆障碍
5
Pairwise graph edit distance characterizes the impact of the construction method on pangenome graphs.成对图编辑距离表征了构建方法对泛基因组图的影响。
Bioinformatics. 2025 May 9. doi: 10.1093/bioinformatics/btaf291.
6
The Black Book of Psychotropic Dosing and Monitoring.《精神药物剂量与监测黑皮书》
Psychopharmacol Bull. 2024 Jul 8;54(3):8-59.
7
Can a Liquid Biopsy Detect Circulating Tumor DNA With Low-passage Whole-genome Sequencing in Patients With a Sarcoma? A Pilot Evaluation.液体活检能否通过低深度全基因组测序检测肉瘤患者的循环肿瘤DNA?一项初步评估。
Clin Orthop Relat Res. 2025 Jan 1;483(1):39-48. doi: 10.1097/CORR.0000000000003161. Epub 2024 Jun 21.
8
[Volume and health outcomes: evidence from systematic reviews and from evaluation of Italian hospital data].[容量与健康结果:来自系统评价和意大利医院数据评估的证据]
Epidemiol Prev. 2013 Mar-Jun;37(2-3 Suppl 2):1-100.
9
Antidepressants for pain management in adults with chronic pain: a network meta-analysis.抗抑郁药治疗成人慢性疼痛的疼痛管理:一项网络荟萃分析。
Health Technol Assess. 2024 Oct;28(62):1-155. doi: 10.3310/MKRT2948.
10
Systemic pharmacological treatments for chronic plaque psoriasis: a network meta-analysis.系统性药理学治疗慢性斑块状银屑病:网络荟萃分析。
Cochrane Database Syst Rev. 2021 Apr 19;4(4):CD011535. doi: 10.1002/14651858.CD011535.pub4.

引用本文的文献

1
Transposable elements drive evolution and perturb gene expression in Brassica rapa and B. oleracea.转座元件推动甘蓝型油菜和甘蓝的进化并扰乱其基因表达。
Plant J. 2025 Sep;123(5):e70452. doi: 10.1111/tpj.70452.
2
Phased genome assemblies and pangenome graphs of human populations of Japan and Saudi Arabia.日本和沙特阿拉伯人群的阶段性基因组组装和泛基因组图谱。
Sci Data. 2025 Aug 12;12(1):1316. doi: 10.1038/s41597-025-05652-y.
3
A survey of sequence-to-graph mapping algorithms in the pangenome era.泛基因组时代序列到图谱映射算法综述。

本文引用的文献

1
Full-resolution HLA and KIR gene annotations for human genome assemblies.人类基因组组装的全分辨率 HLA 和 KIR 基因注释。
Genome Res. 2024 Nov 20;34(11):1931-1941. doi: 10.1101/gr.278985.124.
2
The complete sequence and comparative analysis of ape sex chromosomes.猿类性染色体的完整序列与比较分析。
Nature. 2024 Jun;630(8016):401-411. doi: 10.1038/s41586-024-07473-2. Epub 2024 May 29.
3
T2T-YAO: A Telomere-to-telomere Assembled Diploid Reference Genome for Han Chinese.T2T-YAO:一个端粒到端粒组装的中国汉族二倍体参考基因组。
Genome Biol. 2025 May 22;26(1):138. doi: 10.1186/s13059-025-03606-6.
4
Pitfalls of bacterial pan-genome analysis approaches: a case study of Mycobacterium tuberculosis and two less clonal bacterial species.细菌泛基因组分析方法的陷阱:以结核分枝杆菌和另外两种克隆性较低的细菌物种为例的研究
Bioinformatics. 2025 May 6;41(5). doi: 10.1093/bioinformatics/btaf219.
5
Comparative population pangenomes reveal unexpected complexity and fitness effects of structural variants.比较群体泛基因组揭示了结构变异出人意料的复杂性和适应性效应。
bioRxiv. 2025 Feb 13:2025.02.11.637762. doi: 10.1101/2025.02.11.637762.
6
Small variant benchmark from a complete assembly of X and Y chromosomes.来自X和Y染色体完整组装的小变异基准。
Nat Commun. 2025 Jan 8;16(1):497. doi: 10.1038/s41467-024-55710-z.
7
BWT construction and search at the terabase scale.万亿碱基规模下的BWT构建与搜索。
Bioinformatics. 2024 Nov 28;40(12). doi: 10.1093/bioinformatics/btae717.
8
Fitness consequences of structural variation inferred from a House Finch pangenome.从白头翁基因组中推断出的结构变异对其健康状况的影响。
Proc Natl Acad Sci U S A. 2024 Nov 19;121(47):e2409943121. doi: 10.1073/pnas.2409943121. Epub 2024 Nov 12.
9
Haplotype-aware sequence alignment to pangenome graphs.基于单倍型感知的序列比对到泛基因组图谱。
Genome Res. 2024 Oct 11;34(9):1265-1275. doi: 10.1101/gr.279143.124.
Genomics Proteomics Bioinformatics. 2023 Dec;21(6):1085-1100. doi: 10.1016/j.gpb.2023.08.001. Epub 2023 Aug 16.
4
The complete and fully-phased diploid genome of a male Han Chinese.一位男性汉族个体的完整、全面二倍体基因组。
Cell Res. 2023 Oct;33(10):745-761. doi: 10.1038/s41422-023-00849-5. Epub 2023 Jul 14.
5
Phase variation as a major mechanism of adaptation in complex.相变异作为复杂适应的主要机制。
Proc Natl Acad Sci U S A. 2023 Jul 11;120(28):e2301394120. doi: 10.1073/pnas.2301394120. Epub 2023 Jul 3.
6
Multiscale analysis of pangenomes enables improved representation of genomic diversity for repetitive and clinically relevant genes.泛基因组的多尺度分析能够改善对重复和临床相关基因的基因组多样性的表示。
Nat Methods. 2023 Aug;20(8):1213-1221. doi: 10.1038/s41592-023-01914-y. Epub 2023 Jun 26.
7
Challenges in prokaryote pangenomics.原核生物泛基因组学的挑战。
Microb Genom. 2023 May;9(5). doi: 10.1099/mgen.0.001021.
8
A draft human pangenome reference.人类泛基因组参考草图。
Nature. 2023 May;617(7960):312-324. doi: 10.1038/s41586-023-05896-x. Epub 2023 May 10.
9
Increased mutation and gene conversion within human segmental duplications.人类片段重复序列中突变和基因转换的增加。
Nature. 2023 May;617(7960):325-334. doi: 10.1038/s41586-023-05895-y. Epub 2023 May 10.
10
Pangenome graph construction from genome alignments with Minigraph-Cactus.基于 Minigraph-Cactus 的基因组比对构建泛基因组图谱。
Nat Biotechnol. 2024 Apr;42(4):663-673. doi: 10.1038/s41587-023-01793-w. Epub 2023 May 10.