• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

基于470个基因组比对的人类剪接位点注释的保守性评估。

Conservation assessment of human splice site annotation based on a 470-genome alignment.

作者信息

Minkin Ilia, Salzberg Steven L

机构信息

Department of Biomedical Engineering, Johns Hopkins University, 3400 N. Charles Street, Baltimore, MD 21218, United States.

Center for Computational Biology, Johns Hopkins University, 3100 Wyman Park Drive, Baltimore, MD 21211, United States.

出版信息

Nucleic Acids Res. 2025 Mar 20;53(6). doi: 10.1093/nar/gkaf184.

DOI:10.1093/nar/gkaf184
PMID:40119728
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC11928937/
Abstract

Despite many improvements over the years, the annotation of the human genome remains imperfect. The use of evolutionarily conserved sequences provides a strategy for selecting a high-confidence subset of the annotation. Using the latest whole-genome alignment, we found that splice sites from protein-coding genes in the high-quality MANE annotation are consistently conserved across >350 species. We also studied splice sites from the RefSeq, GENCODE, and CHESS databases not present in MANE. In addition, we analyzed the completeness of the alignment with respect to the human genome annotations and described a method that would allow us to fix up to 60% of the missing alignments of the protein-coding exons. We trained a logistic regression classifier to distinguish between the conservation exhibited by sites from MANE versus sites chosen randomly from neutrally evolving sequences. We found that splice sites classified by our model as well-supported have lower single nucleotide polymorphism rates and better transcriptomic evidence. We then computed a subset of transcripts using only "well-supported" splice sites or ones from MANE. This subset is enriched in high-confidence transcripts of the major gene catalogs that appear to be under purifying selection and are more likely to be correct and functionally relevant.

摘要

尽管多年来有了许多改进,但人类基因组注释仍不完善。利用进化保守序列为选择高可信度注释子集提供了一种策略。使用最新的全基因组比对,我们发现高质量MANE注释中蛋白质编码基因的剪接位点在超过350个物种中始终保守。我们还研究了MANE中不存在的RefSeq、GENCODE和CHESS数据库中的剪接位点。此外,我们分析了与人类基因组注释相关的比对完整性,并描述了一种方法,该方法可修复高达60%的蛋白质编码外显子缺失比对。我们训练了一个逻辑回归分类器,以区分MANE位点与从中性进化序列中随机选择的位点所表现出的保守性。我们发现,被我们的模型分类为有充分支持的剪接位点具有较低的单核苷酸多态性率和更好的转录组学证据。然后,我们仅使用“有充分支持的”剪接位点或MANE中的剪接位点计算了一个转录本子集。该子集富含主要基因目录中的高可信度转录本,这些转录本似乎受到纯化选择,更有可能是正确的且与功能相关。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e778/11928937/8a0d82d72de3/gkaf184fig5.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e778/11928937/c9ba1e69ba4c/gkaf184figgra1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e778/11928937/6ddfb1681df9/gkaf184fig1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e778/11928937/271a96f79168/gkaf184fig2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e778/11928937/e722a01cbb34/gkaf184fig3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e778/11928937/822aad15ee1e/gkaf184fig4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e778/11928937/8a0d82d72de3/gkaf184fig5.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e778/11928937/c9ba1e69ba4c/gkaf184figgra1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e778/11928937/6ddfb1681df9/gkaf184fig1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e778/11928937/271a96f79168/gkaf184fig2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e778/11928937/e722a01cbb34/gkaf184fig3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e778/11928937/822aad15ee1e/gkaf184fig4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e778/11928937/8a0d82d72de3/gkaf184fig5.jpg

相似文献

1
Conservation assessment of human splice site annotation based on a 470-genome alignment.基于470个基因组比对的人类剪接位点注释的保守性评估。
Nucleic Acids Res. 2025 Mar 20;53(6). doi: 10.1093/nar/gkaf184.
2
Conservation assessment of human splice site annotation based on a 470-genome alignment.基于470个基因组比对的人类剪接位点注释的保守性评估
bioRxiv. 2025 Mar 15:2023.12.01.569581. doi: 10.1101/2023.12.01.569581.
3
Prescription of Controlled Substances: Benefits and Risks管制药品的处方:益处与风险
4
Falls prevention interventions for community-dwelling older adults: systematic review and meta-analysis of benefits, harms, and patient values and preferences.社区居住的老年人跌倒预防干预措施:系统评价和荟萃分析的益处、危害以及患者的价值观和偏好。
Syst Rev. 2024 Nov 26;13(1):289. doi: 10.1186/s13643-024-02681-3.
5
A New Measure of Quantified Social Health Is Associated With Levels of Discomfort, Capability, and Mental and General Health Among Patients Seeking Musculoskeletal Specialty Care.一种新的量化社会健康指标与寻求肌肉骨骼专科护理的患者的不适程度、能力以及心理和总体健康水平相关。
Clin Orthop Relat Res. 2025 Apr 1;483(4):647-663. doi: 10.1097/CORR.0000000000003394. Epub 2025 Feb 5.
6
Systemic pharmacological treatments for chronic plaque psoriasis: a network meta-analysis.慢性斑块状银屑病的全身药理学治疗:一项网状Meta分析。
Cochrane Database Syst Rev. 2020 Jan 9;1(1):CD011535. doi: 10.1002/14651858.CD011535.pub3.
7
Systemic pharmacological treatments for chronic plaque psoriasis: a network meta-analysis.系统性药理学治疗慢性斑块状银屑病:网络荟萃分析。
Cochrane Database Syst Rev. 2021 Apr 19;4(4):CD011535. doi: 10.1002/14651858.CD011535.pub4.
8
Comparison of Two Modern Survival Prediction Tools, SORG-MLA and METSSS, in Patients With Symptomatic Long-bone Metastases Who Underwent Local Treatment With Surgery Followed by Radiotherapy and With Radiotherapy Alone.两种现代生存预测工具 SORG-MLA 和 METSSS 在接受手术联合放疗和单纯放疗治疗有症状长骨转移患者中的比较。
Clin Orthop Relat Res. 2024 Dec 1;482(12):2193-2208. doi: 10.1097/CORR.0000000000003185. Epub 2024 Jul 23.
9
Omega-3 fatty acids for depression in adults.成人抑郁症的ω-3脂肪酸治疗
Cochrane Database Syst Rev. 2015 Nov 5;2015(11):CD004692. doi: 10.1002/14651858.CD004692.pub4.
10
Systemic pharmacological treatments for chronic plaque psoriasis: a network meta-analysis.慢性斑块状银屑病的全身药理学治疗:一项网状荟萃分析。
Cochrane Database Syst Rev. 2017 Dec 22;12(12):CD011535. doi: 10.1002/14651858.CD011535.pub2.

引用本文的文献

1
Element-specific estimation of background mutation rates in whole cancer genomes through transfer learning.通过迁移学习对全癌基因组中背景突变率进行元素特异性估计。
NPJ Precis Oncol. 2025 Mar 29;9(1):92. doi: 10.1038/s41698-025-00871-3.

本文引用的文献

1
Neurodevelopmental Disorder Caused by Deletion of , a lncRNA Gene.由 lncRNA 基因缺失引起的神经发育障碍。
N Engl J Med. 2024 Oct 24;391(16):1511-1518. doi: 10.1056/NEJMoa2400718.
2
Random genetic drift sets an upper limit on mRNA splicing accuracy in metazoans.随机遗传漂变对后生动物mRNA剪接准确性设置了上限。
Elife. 2024 Mar 12;13:RP93629. doi: 10.7554/eLife.93629.
3
A genomic mutational constraint map using variation in 76,156 human genomes.基于 76156 个人类基因组的变异,绘制出基因组突变约束图谱。
Nature. 2024 Jan;625(7993):92-100. doi: 10.1038/s41586-023-06045-0. Epub 2023 Dec 6.
4
The UCSC Genome Browser database: 2024 update.UCSC 基因组浏览器数据库:2024 年更新。
Nucleic Acids Res. 2024 Jan 5;52(D1):D1082-D1088. doi: 10.1093/nar/gkad987.
5
CHESS 3: an improved, comprehensive catalog of human genes and transcripts based on large-scale expression data, phylogenetic analysis, and protein structure.CHESS 3:基于大规模表达数据、系统发育分析和蛋白质结构,改进和综合的人类基因和转录本目录。
Genome Biol. 2023 Oct 30;24(1):249. doi: 10.1186/s13059-023-03088-4.
6
The status of the human gene catalogue.人类基因目录的现状。
Nature. 2023 Oct;622(7981):41-47. doi: 10.1038/s41586-023-06490-x. Epub 2023 Oct 4.
7
Branch point strength controls species-specific alternative splicing and regulates LTP.分支点强度控制物种特异性的选择性剪接,并调节 LTP。
Life Sci Alliance. 2022 Dec 21;6(3). doi: 10.26508/lsa.202201826. Print 2023 Mar.
8
Clinical variant interpretation and biologically relevant reference transcripts.临床变异解读与生物学相关参考转录本
NPJ Genom Med. 2022 Oct 18;7(1):59. doi: 10.1038/s41525-022-00329-6.
9
APPRIS principal isoforms and MANE Select transcripts define reference splice variants.APPRIS 主要异构体和 MANE Select 转录本定义参考剪接变体。
Bioinformatics. 2022 Sep 16;38(Suppl_2):ii89-ii94. doi: 10.1093/bioinformatics/btac473.
10
Multiple genome alignment in the telomere-to-telomere assembly era.端粒到端粒组装时代的多基因组比对。
Genome Biol. 2022 Aug 29;23(1):182. doi: 10.1186/s13059-022-02735-6.