• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

利用随机森林优先选择拟南芥中的候选 eQTL 因果基因。

Prioritizing candidate eQTL causal genes in Arabidopsis using RANDOM FORESTS.

机构信息

Bioinformatics Group, Wageningen University and Research, 6708 PB Wageningen, The Netherlands.

出版信息

G3 (Bethesda). 2022 Nov 4;12(11). doi: 10.1093/g3journal/jkac255.

DOI:10.1093/g3journal/jkac255
PMID:36149290
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC9635658/
Abstract

Expression quantitative trait locus mapping has been widely used to study the genetic regulation of gene expression in Arabidopsis thaliana. As a result, a large amount of expression quantitative trait locus data has been generated for this model plant; however, only a few causal expression quantitative trait locus genes have been identified, and experimental validation is costly and laborious. A prioritization method could help speed up the identification of causal expression quantitative trait locus genes. This study extends the machine-learning-based QTG-Finder2 method for prioritizing candidate causal genes in phenotype quantitative trait loci to be used for expression quantitative trait loci by adding gene structure, protein interaction, and gene expression. Independent validation shows that the new algorithm can prioritize 16 out of 25 potential expression quantitative trait locus causal genes within the top 20% rank. Several new features are important in prioritizing causal expression quantitative trait locus genes, including the number of protein-protein interactions, unique domains, and introns. Overall, this study provides a foundation for developing computational methods to prioritize candidate expression quantitative trait locus causal genes. The prediction of all genes is available in the AraQTL workbench (https://www.bioinformatics.nl/AraQTL/) to support the identification of gene expression regulators in Arabidopsis.

摘要

表达数量性状基因座(eQTL)图谱分析已广泛应用于研究拟南芥基因表达的遗传调控。作为结果,大量的 eQTL 数据已在该模式植物中生成;然而,仅鉴定出少数因果 eQTL 基因,并且实验验证既昂贵又费力。优先级排序方法可以帮助加快因果 eQTL 基因的鉴定。本研究通过添加基因结构、蛋白质相互作用和基因表达,将基于机器学习的 QTG-Finder2 方法扩展用于表型数量性状基因座中候选因果基因的优先级排序,以用于表达数量性状基因座。独立验证表明,新算法可以在前 20%的排名中优先考虑 25 个潜在的 eQTL 因果基因中的 16 个。在优先考虑因果 eQTL 基因方面,几个新特征很重要,包括蛋白质-蛋白质相互作用的数量、独特的结构域和内含子。总体而言,本研究为开发用于优先考虑候选 eQTL 因果基因的计算方法提供了基础。所有基因的预测都可在 AraQTL 工作平台(https://www.bioinformatics.nl/AraQTL/)中获得,以支持在拟南芥中鉴定基因表达调控因子。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2355/9635658/b3ef9b0440c6/jkac255f5.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2355/9635658/2e6519690b66/jkac255f1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2355/9635658/438a977a6916/jkac255f2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2355/9635658/82d3fc75ab20/jkac255f3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2355/9635658/830dded2d3eb/jkac255f4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2355/9635658/b3ef9b0440c6/jkac255f5.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2355/9635658/2e6519690b66/jkac255f1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2355/9635658/438a977a6916/jkac255f2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2355/9635658/82d3fc75ab20/jkac255f3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2355/9635658/830dded2d3eb/jkac255f4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2355/9635658/b3ef9b0440c6/jkac255f5.jpg

相似文献

1
Prioritizing candidate eQTL causal genes in Arabidopsis using RANDOM FORESTS.利用随机森林优先选择拟南芥中的候选 eQTL 因果基因。
G3 (Bethesda). 2022 Nov 4;12(11). doi: 10.1093/g3journal/jkac255.
2
QTG-Finder2: A Generalized Machine-Learning Algorithm for Prioritizing QTL Causal Genes in Plants.QTG-Finder2:一种用于对植物数量性状基因座(QTL)因果基因进行优先级排序的广义机器学习算法。
G3 (Bethesda). 2020 Jul 7;10(7):2411-2421. doi: 10.1534/g3.120.401122.
3
QTG-Finder: A Machine-Learning Based Algorithm To Prioritize Causal Genes of Quantitative Trait Loci in Arabidopsis and Rice.QTG-Finder:一种基于机器学习的算法,用于优先考虑拟南芥和水稻中数量性状位点的因果基因。
G3 (Bethesda). 2019 Oct 7;9(10):3129-3138. doi: 10.1534/g3.119.400319.
4
AraQTL - workbench and archive for systems genetics in Arabidopsis thaliana.拟南芥系统遗传学的AraQTL工作台与存档库。
Plant J. 2017 Mar;89(6):1225-1235. doi: 10.1111/tpj.13457. Epub 2017 Feb 13.
5
Network Analysis Prioritizes and as the Candidate Genes for Major eQTL Hotspots in Seed Germination of .网络分析将 和 优先列为 种子萌发中主要eQTL热点的候选基因。
G3 (Bethesda). 2020 Nov 5;10(11):4215-4226. doi: 10.1534/g3.120.401477.
6
Expression quantitative trait locus mapping across water availability environments reveals contrasting associations with genomic features in Arabidopsis.水分可利用性环境下的表达数量性状基因座定位揭示了拟南芥中与基因组特征的对比关联。
Plant Cell. 2013 Sep;25(9):3266-79. doi: 10.1105/tpc.113.115352. Epub 2013 Sep 17.
7
Quantitative Trait Locus and Genetical Genomics Analysis Identifies Putatively Causal Genes for Fecundity and Brooding in the Chicken.数量性状基因座与遗传基因组学分析鉴定出鸡繁殖力和育雏行为的潜在因果基因。
G3 (Bethesda). 2015 Dec 4;6(2):311-9. doi: 10.1534/g3.115.024299.
8
Identification of enzymatic and regulatory genes of plant metabolism through QTL analysis in Arabidopsis.通过拟南芥的 QTL 分析鉴定植物代谢的酶和调控基因。
J Plant Physiol. 2011 Aug 15;168(12):1387-94. doi: 10.1016/j.jplph.2011.03.008. Epub 2011 May 4.
9
Regulatory network construction in Arabidopsis by using genome-wide gene expression quantitative trait loci.利用全基因组基因表达数量性状位点构建拟南芥调控网络
Proc Natl Acad Sci U S A. 2007 Jan 30;104(5):1708-13. doi: 10.1073/pnas.0610429104. Epub 2007 Jan 19.
10
Quantitative trait loci and candidate genes underlying genotype by environment interaction in the response of Arabidopsis thaliana to drought.拟南芥响应干旱的基因型-环境互作的数量性状位点和候选基因。
Plant Cell Environ. 2015 Mar;38(3):585-99. doi: 10.1111/pce.12418. Epub 2014 Sep 3.

引用本文的文献

1
Plasticity of maternal environment-dependent expression-QTLs of tomato seeds.母体环境依赖型表达-QTL 对番茄种子的可塑性。
Theor Appl Genet. 2023 Feb 22;136(2):28. doi: 10.1007/s00122-023-04322-0.

本文引用的文献

1
Opening the Black Box: The Promise and Limitations of Explainable Machine Learning in Cardiology.揭开黑箱:可解释机器学习在心脏病学中的前景与局限。
Can J Cardiol. 2022 Feb;38(2):204-213. doi: 10.1016/j.cjca.2021.09.004. Epub 2021 Sep 14.
2
Comparative transcriptomic analysis reveals conserved programmes underpinning organogenesis and reproduction in land plants.比较转录组学分析揭示了陆地植物器官发生和生殖的保守程序。
Nat Plants. 2021 Aug;7(8):1143-1159. doi: 10.1038/s41477-021-00958-2. Epub 2021 Jul 12.
3
Network Analysis Prioritizes and as the Candidate Genes for Major eQTL Hotspots in Seed Germination of .
网络分析将 和 优先列为 种子萌发中主要eQTL热点的候选基因。
G3 (Bethesda). 2020 Nov 5;10(11):4215-4226. doi: 10.1534/g3.120.401477.
4
A Comprehensive Online Database for Exploring ∼20,000 Public Arabidopsis RNA-Seq Libraries.一个用于探索约20,000个公开拟南芥RNA测序文库的综合在线数据库。
Mol Plant. 2020 Sep 7;13(9):1231-1233. doi: 10.1016/j.molp.2020.08.001. Epub 2020 Aug 5.
5
QTG-Finder2: A Generalized Machine-Learning Algorithm for Prioritizing QTL Causal Genes in Plants.QTG-Finder2:一种用于对植物数量性状基因座(QTL)因果基因进行优先级排序的广义机器学习算法。
G3 (Bethesda). 2020 Jul 7;10(7):2411-2421. doi: 10.1534/g3.120.401122.
6
The Gene Underlies Variation in Chemotherapeutic Responses.该基因是化疗反应差异的基础。
G3 (Bethesda). 2020 Jul 7;10(7):2353-2364. doi: 10.1534/g3.120.401310.
7
Gene Ontology Semantic Similarity Analysis Using GOSemSim.使用 GOSemSim 进行基因本体论语义相似性分析。
Methods Mol Biol. 2020;2117:207-215. doi: 10.1007/978-1-0716-0301-7_11.
8
PlantRegMap: charting functional regulatory maps in plants.植物调控图谱绘制:绘制植物中的功能调控图谱。
Nucleic Acids Res. 2020 Jan 8;48(D1):D1104-D1113. doi: 10.1093/nar/gkz1020.
9
QTG-Finder: A Machine-Learning Based Algorithm To Prioritize Causal Genes of Quantitative Trait Loci in Arabidopsis and Rice.QTG-Finder:一种基于机器学习的算法,用于优先考虑拟南芥和水稻中数量性状位点的因果基因。
G3 (Bethesda). 2019 Oct 7;9(10):3129-3138. doi: 10.1534/g3.119.400319.
10
STRING v11: protein-protein association networks with increased coverage, supporting functional discovery in genome-wide experimental datasets.STRING v11:具有增强覆盖范围的蛋白质-蛋白质相互作用网络,支持在全基因组实验数据集的功能发现。
Nucleic Acids Res. 2019 Jan 8;47(D1):D607-D613. doi: 10.1093/nar/gky1131.