长读蛋白质组学将疾病相关的 sQTL 与疾病的蛋白质同工型效应物联系起来。

Long-read proteogenomics to connect disease-associated sQTLs to the protein isoform effectors of disease.

机构信息

Center for Public Health Genomics, School of Medicine, University of Virginia, Charlottesville, VA 22908, USA; Department of Biochemistry and Molecular Genetics, School of Medicine, University of Virginia, Charlottesville, VA 22908, USA.

Center for Public Health Genomics, School of Medicine, University of Virginia, Charlottesville, VA 22908, USA; Department of Public Health Sciences, University of Virginia, Charlottesville, VA 22908, USA.

出版信息

Am J Hum Genet. 2024 Sep 5;111(9):1914-1931. doi: 10.1016/j.ajhg.2024.07.003. Epub 2024 Jul 29.

DOI:10.1016/j.ajhg.2024.07.003

PMID:39079539

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC11393689/

Abstract

A major fraction of loci identified by genome-wide association studies (GWASs) mediate alternative splicing, but mechanistic interpretation is hindered by the technical limitations of short-read RNA sequencing (RNA-seq), which cannot directly link splicing events to full-length protein isoforms. Long-read RNA-seq represents a powerful tool to characterize transcript isoforms, and recently, infer protein isoform existence. Here, we present an approach that integrates information from GWASs, splicing quantitative trait loci (sQTLs), and PacBio long-read RNA-seq in a disease-relevant model to infer the effects of sQTLs on the ultimate protein isoform products they encode. We demonstrate the utility of our approach using bone mineral density (BMD) GWAS data. We identified 1,863 sQTLs from the Genotype-Tissue Expression (GTEx) project in 732 protein-coding genes that colocalized with BMD associations (H4PP ≥ 0.75). We generated PacBio Iso-Seq data (N = ∼22 million full-length reads) on human osteoblasts, identifying 68,326 protein-coding isoforms, of which 17,375 (25%) were unannotated. By casting the sQTLs onto protein isoforms, we connected 809 sQTLs to 2,029 protein isoforms from 441 genes expressed in osteoblasts. Overall, we found that 74 sQTLs influenced isoforms likely impacted by nonsense-mediated decay and 190 that potentially resulted in the expression of unannotated protein isoforms. Finally, we functionally validated colocalizing sQTLs in TPM2, in which siRNA-mediated knockdown in osteoblasts showed two TPM2 isoforms with opposing effects on mineralization but exhibited no effect upon knockdown of the entire gene. Our approach should be to generalize across diverse clinical traits and to provide insights into protein isoform activities modulated by GWAS loci.

摘要

全基因组关联研究 (GWAS) 鉴定的大多数基因座介导可变剪接，但由于短读长 RNA 测序 (RNA-seq) 的技术限制，机制解释受到阻碍，因为短读长 RNA-seq 无法直接将剪接事件与全长蛋白质亚型联系起来。长读长 RNA-seq 是一种强大的工具，可用于表征转录物亚型，并最近推断蛋白质亚型的存在。在这里，我们提出了一种方法，该方法将 GWAS、剪接数量性状基因座 (sQTL) 和 PacBio 长读长 RNA-seq 的信息整合到疾病相关模型中，以推断 sQTL 对其编码的最终蛋白质亚型产物的影响。我们使用骨密度 (BMD) GWAS 数据证明了我们方法的实用性。我们从 GTEx 项目中鉴定了 732 个蛋白质编码基因中的 1,863 个 sQTL，这些基因座与 BMD 关联 (H4PP ≥ 0.75) 共定位。我们在人类成骨细胞上生成了 PacBio Iso-Seq 数据 (N = ∼2200 万全长读数)，鉴定了 68,326 个蛋白质编码亚型，其中 17,375(25%)未注释。通过将 sQTL 映射到蛋白质亚型上，我们将 441 个基因中表达的 809 个 sQTL 与 2,029 个蛋白质亚型联系起来。总的来说，我们发现 74 个 sQTL 影响了可能受无意义介导的衰变影响的亚型，而 190 个 sQTL 可能导致未注释蛋白质亚型的表达。最后，我们在 TPM2 中对共定位的 sQTL 进行了功能验证，其中 siRNA 介导的成骨细胞敲低显示出两种 TPM2 亚型对矿化有相反的影响，但敲低整个基因则没有影响。我们的方法应该推广到不同的临床特征，并深入了解 GWAS 基因座调节的蛋白质亚型活性。

相似文献

Long-read proteogenomics to connect disease-associated sQTLs to the protein isoform effectors of disease.长读蛋白质组学将疾病相关的 sQTL 与疾病的蛋白质同工型效应物联系起来。

Am J Hum Genet. 2024 Sep 5;111(9):1914-1931. doi: 10.1016/j.ajhg.2024.07.003. Epub 2024 Jul 29.

Long-read proteogenomics to connect disease-associated sQTLs to the protein isoform effectors of disease.长读长片段蛋白质基因组学将疾病相关的剪接定量性状位点与疾病的蛋白质异构体效应器联系起来。

bioRxiv. 2023 Mar 21:2023.03.17.531557. doi: 10.1101/2023.03.17.531557.

Comparison of Two Modern Survival Prediction Tools, SORG-MLA and METSSS, in Patients With Symptomatic Long-bone Metastases Who Underwent Local Treatment With Surgery Followed by Radiotherapy and With Radiotherapy Alone.两种现代生存预测工具 SORG-MLA 和 METSSS 在接受手术联合放疗和单纯放疗治疗有症状长骨转移患者中的比较。

Clin Orthop Relat Res. 2024 Dec 1;482(12):2193-2208. doi: 10.1097/CORR.0000000000003185. Epub 2024 Jul 23.

Identification of new candidate genes affecting drip loss in pigs based on genomics and transcriptomics data.基于基因组学和转录组学数据鉴定影响猪滴水损失的新候选基因。

J Anim Sci. 2025 Jan 4;103. doi: 10.1093/jas/skaf177.

Systematic analysis of the effects of splicing on the diversity of post-translational modifications in protein isoforms using PTM-POSE.使用PTM-POSE对剪接对蛋白质异构体翻译后修饰多样性的影响进行系统分析。

bioRxiv. 2025 Mar 27:2024.01.10.575062. doi: 10.1101/2024.01.10.575062.

Systemic pharmacological treatments for chronic plaque psoriasis: a network meta-analysis.系统性药理学治疗慢性斑块状银屑病：网络荟萃分析。

Cochrane Database Syst Rev. 2021 Apr 19;4(4):CD011535. doi: 10.1002/14651858.CD011535.pub4.

Antidepressants for pain management in adults with chronic pain: a network meta-analysis.抗抑郁药治疗成人慢性疼痛的疼痛管理：一项网络荟萃分析。

Health Technol Assess. 2024 Oct;28(62):1-155. doi: 10.3310/MKRT2948.

Comparison of self-administered survey questionnaire responses collected using mobile apps versus other methods.使用移动应用程序与其他方法收集的自我管理调查问卷回复的比较。

Cochrane Database Syst Rev. 2015 Jul 27;2015(7):MR000042. doi: 10.1002/14651858.MR000042.pub2.

Automated devices for identifying peripheral arterial disease in people with leg ulceration: an evidence synthesis and cost-effectiveness analysis.用于识别下肢溃疡患者外周动脉疾病的自动化设备：证据综合和成本效益分析。

Health Technol Assess. 2024 Aug;28(37):1-158. doi: 10.3310/TWCG3912.

Systemic pharmacological treatments for chronic plaque psoriasis: a network meta-analysis.慢性斑块状银屑病的全身药理学治疗：一项网状Meta分析。

Cochrane Database Syst Rev. 2020 Jan 9;1(1):CD011535. doi: 10.1002/14651858.CD011535.pub3.

引用本文的文献

Tropomyosin isoforms encoded by TPM2 control the actin-bundling activity of fascin-1.由TPM2编码的原肌球蛋白同工型控制丝束蛋白-1的肌动蛋白成束活性。

Biol Res. 2025 Aug 31;58(1):60. doi: 10.1186/s40659-025-00640-3.

Protein Sequencing with Single Amino Acid Resolution Discerns Peptides That Discriminate Tropomyosin Proteoforms.具有单氨基酸分辨率的蛋白质测序可识别区分原肌球蛋白蛋白变体的肽段。

J Proteome Res. 2025 Jun 10. doi: 10.1021/acs.jproteome.4c00978.

Biosurfer for systematic tracking of regulatory mechanisms leading to protein isoform diversity.用于系统追踪导致蛋白质异构体多样性的调控机制的生物冲浪者。（备注：这里“Biosurfer”可能是一个特定的专业术语或新造词，直接按字面翻译，具体含义可能需结合相关领域知识进一步理解）

Genome Res. 2025 Apr 14;35(4):1012-1024. doi: 10.1101/gr.279317.124.

Long-read RNA sequencing atlas of human microglia isoforms elucidates disease-associated genetic regulation of splicing.人类小胶质细胞异构体的长读长RNA测序图谱阐明了与疾病相关的剪接基因调控。

Nat Genet. 2025 Mar;57(3):604-615. doi: 10.1038/s41588-025-02099-0. Epub 2025 Mar 3.

Structural variation, selection, and diversification of the gene family from the human pangenome.人类泛基因组中基因家族的结构变异、选择与多样化

bioRxiv. 2025 Feb 5:2025.02.04.636496. doi: 10.1101/2025.02.04.636496.

Full-length transcriptome sequencing of seven tissues of GuShi chickens.固始鸡七个组织的全长转录组测序

Poult Sci. 2025 Feb;104(2):104697. doi: 10.1016/j.psj.2024.104697. Epub 2024 Dec 19.

IS-PRM-Based Peptide Targeting Informed by Long-Read Sequencing for Alternative Proteome Detection.基于 IS-PRM 的肽靶向策略，结合长读测序，用于发现替代蛋白质组。

J Am Soc Mass Spectrom. 2024 Nov 6;35(11):2614-2630. doi: 10.1021/jasms.4c00119. Epub 2024 Jul 16.

Cell type-specific network analysis in Diversity Outbred mice identifies genes potentially responsible for human bone mineral density GWAS associations.多样性远交系小鼠的细胞类型特异性网络分析确定了可能与人类骨密度全基因组关联研究（GWAS）关联相关的基因。

bioRxiv. 2024 May 21:2024.05.20.594981. doi: 10.1101/2024.05.20.594981.

IS-PRM-based peptide targeting informed by long-read sequencing for alternative proteome detection.基于长读长测序的IS-PRM肽靶向技术用于替代蛋白质组检测。

bioRxiv. 2024 Apr 1:2024.04.01.587549. doi: 10.1101/2024.04.01.587549.

Biosurfer for systematic tracking of regulatory mechanisms leading to protein isoform diversity.用于系统追踪导致蛋白质异构体多样性的调控机制的生物冲浪者。（备注：此翻译可能需结合更专业背景知识理解，原英文表述可能不太符合常规准确的医学专业文献表达规范，推测“Biosurfer”可能是某特定系统或工具名称。）

bioRxiv. 2024 Mar 17:2024.03.15.585320. doi: 10.1101/2024.03.15.585320.

本文引用的文献

eQTL Catalogue 2023: New datasets, X chromosome QTLs, and improved detection and visualisation of transcript-level QTLs.eQTL 目录 2023：新数据集、X 染色体 QTL 以及转录水平 QTL 的检测和可视化能力提升。

PLoS Genet. 2023 Sep 18;19(9):e1010932. doi: 10.1371/journal.pgen.1010932. eCollection 2023 Sep.

The variables on RNA molecules: concert or cacophony? Answers in long-read sequencing.RNA分子上的变量：和谐还是杂音？长读长测序给出答案。

Nat Methods. 2023 Jan;20(1):20-24. doi: 10.1038/s41592-022-01715-9.

Integrative transcriptomic analysis of the amyotrophic lateral sclerosis spinal cord implicates glial activation and suggests new risk genes.肌萎缩侧索硬化症脊髓的综合转录组分析表明存在神经胶质激活并提示新的风险基因。

Nat Neurosci. 2023 Jan;26(1):150-162. doi: 10.1038/s41593-022-01205-3. Epub 2022 Dec 8.

Transcriptome-wide association study and eQTL colocalization identify potentially causal genes responsible for human bone mineral density GWAS associations.全转录组关联研究和 eQTL 共定位鉴定了与人类骨密度 GWAS 关联相关的潜在因果基因。

Elife. 2022 Nov 23;11:e77285. doi: 10.7554/eLife.77285.

The next-generation Open Targets Platform: reimagined, redesigned, rebuilt.下一代开放靶点平台：重新构想、重新设计、重新构建。

Nucleic Acids Res. 2023 Jan 6;51(D1):D1353-D1359. doi: 10.1093/nar/gkac1046.

The International Mouse Phenotyping Consortium: comprehensive knockout phenotyping underpinning the study of human disease.国际小鼠表型分析联盟：全面的基因敲除表型分析为人类疾病研究提供支撑。

Nucleic Acids Res. 2023 Jan 6;51(D1):D1038-D1045. doi: 10.1093/nar/gkac972.

Genetic control of RNA splicing and its distinct role in complex trait variation.RNA 剪接的遗传控制及其在复杂性状变异中的独特作用。

Nat Genet. 2022 Sep;54(9):1355-1363. doi: 10.1038/s41588-022-01154-4. Epub 2022 Aug 18.

Bridging the splicing gap in human genetics with long-read RNA sequencing: finding the protein isoform drivers of disease.利用长读 RNA 测序弥合人类遗传学中的剪接缺口：寻找疾病的蛋白质同工型驱动因子。

Hum Mol Genet. 2022 Oct 20;31(R1):R123-R136. doi: 10.1093/hmg/ddac196.

Transcriptome variation in human tissues revealed by long-read sequencing.长读测序揭示人类组织中的转录组变异。

Nature. 2022 Aug;608(7922):353-359. doi: 10.1038/s41586-022-05035-y. Epub 2022 Aug 3.

Splice factor polypyrimidine tract-binding protein 1 (Ptbp1) primes endothelial inflammation in atherogenic disturbed flow conditions.拼接因子多嘧啶 tract 结合蛋白 1（Ptbp1）在动脉粥样硬化性紊乱流条件下启动内皮炎症。

Proc Natl Acad Sci U S A. 2022 Jul 26;119(30):e2122227119. doi: 10.1073/pnas.2122227119. Epub 2022 Jul 18.

文献检索

告别复杂PubMed语法，用中文像聊天一样搜索，搜遍4000万医学文献。AI智能推荐，让科研检索更轻松。

立即免费搜索

文件翻译

保留排版，准确专业，支持PDF/Word/PPT等文件格式，支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述，25分钟生成高质量综述，智能提取关键信息，辅助科研写作。

立即免费体验