• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

人类泛基因组中基因家族的结构变异、选择与多样化

Structural variation, selection, and diversification of the gene family from the human pangenome.

作者信息

Dishuck Philip C, Munson Katherine M, Lewis Alexandra P, Dougherty Max L, Underwood Jason G, Harvey William T, Hsieh PingHsun, Pastinen Tomi, Eichler Evan E

机构信息

Department of Genome Sciences, University of Washington School of Medicine, Seattle, WA, USA.

Present address: Tisch Cancer Institute, Division of Hematology and Medical Oncology, The Icahn School of Medicine at Mount Sinai, New York, NY, USA.

出版信息

bioRxiv. 2025 Feb 5:2025.02.04.636496. doi: 10.1101/2025.02.04.636496.

DOI:10.1101/2025.02.04.636496
PMID:39975192
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC11838601/
Abstract

The (nuclear pore interacting protein) gene family has expanded to high copy number in humans and African apes where it has been subject to an excess of amino acid replacement consistent with positive selection (1). Due to the limitations of short-read sequencing, human genetic diversity has been poorly understood. Using highly accurate assemblies generated from long-read sequencing as part of the human pangenome, we completely characterize 169 human haplotypes (4,665 paralogs and alleles). Of the 28 paralogs, just three (, , and ) are fixed at a single copy, and only a single locus, , shows no structural variation. Four paralogs map to large segmental duplication blocks that mediate polymorphic inversions (355 kbp-1.6 Mbp) corresponding to microdeletions associated with developmental delay and autism. Haplotype-based tests of positive selection and selective sweeps identify two paralogs, and , within the top percentile for both tests. Using full-length cDNA data from 101 tissue/cell types, we construct paralog-specific gene models and show that 56% (31/55 most abundant isoforms) have not been previously described in RefSeq. We define six distinct translation start sites and other protein structural features that distinguish paralogs, including a variable number tandem repeat that encodes a beta helix of variable size that emerged ~3.1 million years ago in human evolution. Among the 28 paralogs, we identify distinct tissue and developmental patterns of expression with only a few maintaining the ancestral testis-enriched expression. A subset of paralogs (, , , , and ) show increased brain expression. Our results suggest ongoing positive selection in the human population and rapid diversification of gene models.

摘要

(核孔相互作用蛋白)基因家族在人类和非洲猿中扩增至高拷贝数,在那里它经历了与正选择一致的过量氨基酸替换(1)。由于短读长测序的局限性,人类遗传多样性一直未得到充分了解。利用作为人类泛基因组一部分的长读长测序生成的高精度组装序列,我们完整地表征了169个人类单倍型(4665个旁系同源基因和等位基因)。在28个旁系同源基因中,只有三个(、和)以单拷贝固定,并且只有一个位点()没有结构变异。四个旁系同源基因映射到介导多态性倒位(355 kbp - 1.6 Mbp)的大片段重复区域,这些倒位与发育迟缓及自闭症相关的微缺失相对应。基于单倍型的正选择和选择性清除测试确定了两个旁系同源基因(和)在两项测试中均处于前百分之一。利用来自101种组织/细胞类型的全长cDNA数据,我们构建了旁系同源基因特异性基因模型,并表明56%(55种最丰富的异构体中的31种)在RefSeq中未曾被描述过。我们定义了六个不同的翻译起始位点和其他区分旁系同源基因的蛋白质结构特征,包括一个可变数量串联重复序列,它编码一个大小可变的β螺旋,该螺旋在人类进化过程中大约310万年前出现。在28个旁系同源基因中,我们确定了不同的组织和发育表达模式,只有少数保持了祖先在睾丸中富集的表达。一部分旁系同源基因(、、、和)在大脑中的表达增加。我们的结果表明人类群体中正在进行正选择,并且基因模型快速多样化。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/cd89/11838601/4dc9df180b47/nihpp-2025.02.04.636496v1-f0006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/cd89/11838601/74ce46ab7df4/nihpp-2025.02.04.636496v1-f0001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/cd89/11838601/4bd17aa7b3ea/nihpp-2025.02.04.636496v1-f0002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/cd89/11838601/259d3d4e7b50/nihpp-2025.02.04.636496v1-f0003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/cd89/11838601/294f89b2b611/nihpp-2025.02.04.636496v1-f0004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/cd89/11838601/4bd14b4baf9c/nihpp-2025.02.04.636496v1-f0005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/cd89/11838601/4dc9df180b47/nihpp-2025.02.04.636496v1-f0006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/cd89/11838601/74ce46ab7df4/nihpp-2025.02.04.636496v1-f0001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/cd89/11838601/4bd17aa7b3ea/nihpp-2025.02.04.636496v1-f0002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/cd89/11838601/259d3d4e7b50/nihpp-2025.02.04.636496v1-f0003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/cd89/11838601/294f89b2b611/nihpp-2025.02.04.636496v1-f0004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/cd89/11838601/4bd14b4baf9c/nihpp-2025.02.04.636496v1-f0005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/cd89/11838601/4dc9df180b47/nihpp-2025.02.04.636496v1-f0006.jpg

相似文献

1
Structural variation, selection, and diversification of the gene family from the human pangenome.人类泛基因组中基因家族的结构变异、选择与多样化
bioRxiv. 2025 Feb 5:2025.02.04.636496. doi: 10.1101/2025.02.04.636496.
2
Prescription of Controlled Substances: Benefits and Risks管制药品的处方:益处与风险
3
Structural variation, selection, and diversification of the NPIP gene family from the human pangenome.人类泛基因组中NPIP基因家族的结构变异、选择与多样化
Cell Genom. 2025 Aug 19:100977. doi: 10.1016/j.xgen.2025.100977.
4
The Black Book of Psychotropic Dosing and Monitoring.《精神药物剂量与监测黑皮书》
Psychopharmacol Bull. 2024 Jul 8;54(3):8-59.
5
Population differences of chromosome 22q11.2 duplication structure predispose differentially to microdeletion and inversion.22号染色体q11.2区域重复结构的人群差异对微缺失和倒位具有不同的易感性。
bioRxiv. 2025 Jul 7:2025.07.04.662981. doi: 10.1101/2025.07.04.662981.
6
Signs and symptoms to determine if a patient presenting in primary care or hospital outpatient settings has COVID-19.在基层医疗机构或医院门诊环境中,如果患者出现以下症状和体征,可判断其是否患有 COVID-19。
Cochrane Database Syst Rev. 2022 May 20;5(5):CD013665. doi: 10.1002/14651858.CD013665.pub3.
7
Interventions for central serous chorioretinopathy: a network meta-analysis.中心性浆液性脉络膜视网膜病变的干预措施:一项网状Meta分析
Cochrane Database Syst Rev. 2025 Jun 16;6(6):CD011841. doi: 10.1002/14651858.CD011841.pub3.
8
Can a Liquid Biopsy Detect Circulating Tumor DNA With Low-passage Whole-genome Sequencing in Patients With a Sarcoma? A Pilot Evaluation.液体活检能否通过低深度全基因组测序检测肉瘤患者的循环肿瘤DNA?一项初步评估。
Clin Orthop Relat Res. 2025 Jan 1;483(1):39-48. doi: 10.1097/CORR.0000000000003161. Epub 2024 Jun 21.
9
[Volume and health outcomes: evidence from systematic reviews and from evaluation of Italian hospital data].[容量与健康结果:来自系统评价和意大利医院数据评估的证据]
Epidemiol Prev. 2013 Mar-Jun;37(2-3 Suppl 2):1-100.
10
Polaris: Polarization of ancestral and derived polymorphic alleles for inferences of extended haplotype homozygosity in human populations.北极星:通过祖先和衍生多态性等位基因的极化来推断人类群体中扩展单倍型纯合性。
Bioinformatics. 2025 Jun 2;41(6). doi: 10.1093/bioinformatics/btaf171.

本文引用的文献

1
Cas13d-mediated isoform-specific RNA knockdown with a unified computational and experimental toolbox.利用统一的计算和实验工具箱实现Cas13d介导的异构体特异性RNA敲低。
Nat Commun. 2025 Jul 29;16(1):6948. doi: 10.1038/s41467-025-62066-5.
2
Complex genetic variation in nearly complete human genomes.近乎完整的人类基因组中的复杂遗传变异。
Nature. 2025 Jul 23. doi: 10.1038/s41586-025-09140-6.
3
SVbyEye: A visual tool to characterize structural variation among whole-genome assemblies.SVbyEye:一种用于表征全基因组组装之间结构变异的可视化工具。
Bioinformatics. 2025 Jun 6. doi: 10.1093/bioinformatics/btaf332.
4
The GIAB genomic stratifications resource for human reference genomes.GIAB 基因组分层资源用于人类参考基因组。
Nat Commun. 2024 Oct 19;15(1):9029. doi: 10.1038/s41467-024-53260-y.
5
Independent expansion, selection, and hypervariability of the gene family in humans.人类基因家族的独立扩张、选择和高变异性。
Genome Res. 2024 Nov 20;34(11):1798-1810. doi: 10.1101/gr.279299.124.
6
Long-read proteogenomics to connect disease-associated sQTLs to the protein isoform effectors of disease.长读蛋白质组学将疾病相关的 sQTL 与疾病的蛋白质同工型效应物联系起来。
Am J Hum Genet. 2024 Sep 5;111(9):1914-1931. doi: 10.1016/j.ajhg.2024.07.003. Epub 2024 Jul 29.
7
SQANTI3: curation of long-read transcriptomes for accurate identification of known and novel isoforms.SQANTI3:长读转录组的编目,用于准确识别已知和新的异构体。
Nat Methods. 2024 May;21(5):793-797. doi: 10.1038/s41592-024-02229-2. Epub 2024 Mar 20.
8
Recombinations, chains and caps: resolving problems with the DCJ-indel model.重组、链与端粒帽:用DCJ-插入缺失模型解决问题
Algorithms Mol Biol. 2024 Feb 27;19(1):8. doi: 10.1186/s13015-024-00253-7.
9
Identification of region-specific gene isoforms in the human brain using long-read transcriptome sequencing.利用长读转录组测序鉴定人脑的区域特异性基因亚型。
Sci Adv. 2024 Jan 26;10(4):eadj5279. doi: 10.1126/sciadv.adj5279. Epub 2024 Jan 24.
10
selscan 2.0: scanning for sweeps in unphased data.selscan 2.0:在非相位数据中扫描扫描。
Bioinformatics. 2024 Jan 2;40(1). doi: 10.1093/bioinformatics/btae006.