• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

T2T-CHM13参考基因组组装揭示了重要的WASH1和GPRIN2旁系同源基因。

The T2T-CHM13 reference assembly uncovers essential WASH1 and GPRIN2 paralogues.

作者信息

Cerdán-Vélez Daniel, Tress Michael Liam

机构信息

Bioinformatics Unit, Spanish National Cancer Research Centre (CNIO), Madrid 28029, Spain.

出版信息

Bioinform Adv. 2024 Feb 28;4(1):vbae029. doi: 10.1093/bioadv/vbae029. eCollection 2024.

DOI:10.1093/bioadv/vbae029
PMID:38464973
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC10924726/
Abstract

SUMMARY

The recently published T2T-CHM13 reference assembly completed the annotation of the final 8% of the human genome. It introduced 1956 genes, close to 100 of which are predicted to be coding because they have a protein coding parent gene. Here, we confirm the coding status and functional relevance of two of these genes, paralogues of and . We find that , one of four novel subtelomeric WASH1 genes uncovered in the new assembly, produces the WASH1 protein that forms part of the vital actin-regulatory WASH complex. Its coding status is supported by abundant proteomics, conservation, and cDNA evidence. It was previously assumed that gene produced the functional WASH1 protein, but new evidence shows that is a human-derived duplication and likely to be one of 12 WASH1 pseudogenes in the human gene set. We also find that the T2T-CHM13 assembly has added a functionally important copy of to the human gene set. We demonstrate that uniquely mapping peptides from proteomics databases support the novel rather than the GRCh38 assembly gene. These new additions to the set of human coding genes underlines the importance of the new T2T-CHM13 assembly.

AVAILABILITY AND IMPLEMENTATION

None.

摘要

摘要

最近发布的T2T-CHM13参考基因组完成了人类基因组最后8%的注释。它引入了1956个基因,其中近100个预计为编码基因,因为它们有一个蛋白质编码亲本基因。在此,我们证实了其中两个基因( 和 的旁系同源基因)的编码状态和功能相关性。我们发现,新组装中发现的四个新型亚端粒WASH1基因之一的 ,产生形成重要肌动蛋白调节WASH复合物一部分的WASH1蛋白。其编码状态得到了丰富的蛋白质组学、保守性和cDNA证据的支持。以前认为 基因产生功能性WASH1蛋白,但新证据表明 是一个人类衍生的重复基因,可能是人类基因集中12个WASH1假基因之一。我们还发现,T2T-CHM13组装向人类基因集添加了一个功能重要的 拷贝。我们证明,来自蛋白质组学数据库的唯一映射肽支持新型的 而不是GRCh38组装的 基因。人类编码基因集的这些新增加强调了新的T2T-CHM13组装的重要性。

可用性和实施

无。

相似文献

1
The T2T-CHM13 reference assembly uncovers essential WASH1 and GPRIN2 paralogues.T2T-CHM13参考基因组组装揭示了重要的WASH1和GPRIN2旁系同源基因。
Bioinform Adv. 2024 Feb 28;4(1):vbae029. doi: 10.1093/bioadv/vbae029. eCollection 2024.
2
Lost in the WASH. The functional human WASH complex 1 gene is on chromosome 20.迷失在WASH中。功能性人类WASH复合蛋白1基因位于20号染色体上。
bioRxiv. 2023 Jun 17:2023.06.14.544951. doi: 10.1101/2023.06.14.544951.
3
Nanopore sequencing with T2T-CHM13 for accurate detection and preventing the transmission of structural rearrangements in highly repetitive heterochromatin regions in human embryos.利用 T2T-CHM13 的纳米孔测序技术,准确检测和预防人类胚胎中高度重复异染色质区域结构重排的传播。
Clin Transl Med. 2024 Mar;14(3):e1612. doi: 10.1002/ctm2.1612.
4
Genome-wide maps of highly-similar intrachromosomal repeats that mediate ectopic recombination in three human genome assemblies.在三个人类基因组组装体中,介导异位重组的高度相似的染色体内重复序列的全基因组图谱。
bioRxiv. 2024 Jan 31:2024.01.29.577884. doi: 10.1101/2024.01.29.577884.
5
Inversion polymorphism in a complete human genome assembly.人类基因组完整组装中的倒位多态性。
Genome Biol. 2023 Apr 30;24(1):100. doi: 10.1186/s13059-023-02919-8.
6
Mind the gap: the relevance of the genome reference to resolve rare and pathogenic inversions.注意差距:基因组参考对于解析罕见和致病性倒位的相关性。
medRxiv. 2024 Apr 24:2024.04.22.24305780. doi: 10.1101/2024.04.22.24305780.
7
Localizing unmapped sequences with families to validate the Telomere-to-Telomere assembly and identify new hotspots for genetic diversity.利用家族将未映射的序列本地化,以验证端粒到端粒组装并确定新的遗传多样性热点。
Genome Res. 2023 Oct;33(10):1734-1746. doi: 10.1101/gr.277175.122. Epub 2023 Oct 25.
8
The benefit of a complete reference genome for cancer structural variant analysis.完整参考基因组在癌症结构变异分析中的益处。
medRxiv. 2024 Mar 18:2024.03.15.24304369. doi: 10.1101/2024.03.15.24304369.
9
Enhancing Variant Calling in Whole-exome Sequencing Data Using Population-matched Reference Genomes.使用群体匹配参考基因组增强全外显子组测序数据中的变异检测
Genomics Proteomics Bioinformatics. 2024 Dec 3;22(5). doi: 10.1093/gpbjnl/qzae070.
10
[Analysis, identification and correction of some errors of model refseqs appeared in NCBI Human Gene Database by in silico cloning and experimental verification of novel human genes].[通过新型人类基因的电子克隆和实验验证对NCBI人类基因数据库中出现的模型参考序列的一些错误进行分析、鉴定和校正]
Yi Chuan Xue Bao. 2004 May;31(5):431-43.

引用本文的文献

1
More than 2,500 coding genes in the human reference gene set still have unsettled status.人类参考基因集中超过2500个编码基因的状态仍未确定。
bioRxiv. 2024 Dec 9:2024.12.05.626965. doi: 10.1101/2024.12.05.626965.
2
A deep audit of the PeptideAtlas database uncovers evidence for unannotated coding genes and aberrant translation.对肽图谱数据库的深入审查发现了未注释编码基因和异常翻译的证据。
bioRxiv. 2024 Nov 15:2024.11.14.623419. doi: 10.1101/2024.11.14.623419.
3
GENCODE 2025: reference gene annotation for human and mouse.GENCODE 2025:人类和小鼠的参考基因注释

本文引用的文献

1
A draft human pangenome reference.人类泛基因组参考草图。
Nature. 2023 May;617(7960):312-324. doi: 10.1038/s41586-023-05896-x. Epub 2023 May 10.
2
Trans-Proteomic Pipeline: Robust Mass Spectrometry-Based Proteomics Data Analysis Suite.跨蛋白质组学分析流程:基于质谱的稳健蛋白质组学数据分析套件。
J Proteome Res. 2023 Feb 3;22(2):615-624. doi: 10.1021/acs.jproteome.2c00624. Epub 2023 Jan 17.
3
GENCODE: reference annotation for the human and mouse genomes in 2023.GENCODE:2023 年人类和小鼠基因组的参考注释。
Nucleic Acids Res. 2025 Jan 6;53(D1):D966-D975. doi: 10.1093/nar/gkae1078.
4
Evidence for widespread translation of 5' untranslated regions.广泛存在 5' 非翻译区翻译的证据。
Nucleic Acids Res. 2024 Aug 12;52(14):8112-8126. doi: 10.1093/nar/gkae571.
Nucleic Acids Res. 2023 Jan 6;51(D1):D942-D949. doi: 10.1093/nar/gkac1071.
4
UniProt: the Universal Protein Knowledgebase in 2023.UniProt:2023 年的通用蛋白质知识库。
Nucleic Acids Res. 2023 Jan 6;51(D1):D523-D531. doi: 10.1093/nar/gkac1052.
5
Database resources of the National Center for Biotechnology Information in 2023.2023 年国立生物技术信息中心的数据库资源。
Nucleic Acids Res. 2023 Jan 6;51(D1):D29-D38. doi: 10.1093/nar/gkac1032.
6
GenBank 2023 update.GenBank 2023 更新。
Nucleic Acids Res. 2023 Jan 6;51(D1):D141-D144. doi: 10.1093/nar/gkac1012.
7
Ensembl 2023.Ensembl 2023.
Nucleic Acids Res. 2023 Jan 6;51(D1):D933-D941. doi: 10.1093/nar/gkac958.
8
A complete reference genome improves analysis of human genetic variation.完整的参考基因组提高了人类遗传变异分析的能力。
Science. 2022 Apr;376(6588):eabl3533. doi: 10.1126/science.abl3533. Epub 2022 Apr 1.
9
The complete sequence of a human genome.人类基因组的完整序列。
Science. 2022 Apr;376(6588):44-53. doi: 10.1126/science.abj6987. Epub 2022 Mar 31.
10
Segmental duplications and their variation in a complete human genome.人类全基因组中的串联重复序列及其变异。
Science. 2022 Apr;376(6588):eabj6965. doi: 10.1126/science.abj6965. Epub 2022 Apr 1.