• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

深度多基因神经网络预测和鉴定印度尼西亚水稻品种中的产量相关基因。

Deep polygenic neural network for predicting and identifying yield-associated genes in Indonesian rice accessions.

机构信息

BINUS Graduate Program, Bina Nusantara University, Jakarta, 11480, Indonesia.

School of Computer Science, Bina Nusantara University, Jakarta, 11480, Indonesia.

出版信息

Sci Rep. 2022 Aug 15;12(1):13823. doi: 10.1038/s41598-022-16075-9.

DOI:10.1038/s41598-022-16075-9
PMID:35970979
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC9378700/
Abstract

As the fourth most populous country in the world, Indonesia must increase the annual rice production rate to achieve national food security by 2050. One possible solution comes from the nanoscopic level: a genetic variant called Single Nucleotide Polymorphism (SNP), which can express significant yield-associated genes. The prior benchmark of this study utilized a statistical genetics model where no SNP position information and attention mechanism were involved. Hence, we developed a novel deep polygenic neural network, named the NucleoNet model, to address these obstacles. The NucleoNets were constructed with the combination of prominent components that include positional SNP encoding, the context vector, wide models, Elastic Net, and Shannon's entropy loss. This polygenic modeling obtained up to 2.779 of Mean Squared Error (MSE) with 47.156% of Symmetric Mean Absolute Percentage Error (SMAPE), while revealing 15 new important SNPs. Furthermore, the NucleoNets reduced the MSE score up to 32.28% compared to the Ordinary Least Squares (OLS) model. Through the ablation study, we learned that the combination of Xavier distribution for weights initialization and Normal distribution for biases initialization sparked more various important SNPs throughout 12 chromosomes. Our findings confirmed that the NucleoNet model was successfully outperformed the OLS model and identified important SNPs to Indonesian rice yields.

摘要

作为世界第四大人口大国,印度尼西亚必须提高每年的水稻产量,才能在 2050 年实现国家粮食安全。一种可能的解决方案来自纳米级水平:一种名为单核苷酸多态性(SNP)的遗传变异,它可以表达与产量显著相关的基因。本研究的先前基准利用了一个统计遗传学模型,其中不涉及 SNP 位置信息和注意力机制。因此,我们开发了一种新的深度多基因神经网络,命名为 NucleoNet 模型,以解决这些障碍。NucleoNets 是通过结合突出的组件构建的,包括位置 SNP 编码、上下文向量、宽模型、弹性网络和香农熵损失。这种多基因建模获得了高达 2.779 的均方误差(MSE)和 47.156%的对称平均绝对百分比误差(SMAPE),同时揭示了 15 个新的重要 SNP。此外,与普通最小二乘法(OLS)模型相比,NucleoNets 将 MSE 评分降低了 32.28%。通过消融研究,我们了解到,对于权重初始化的 Xavier 分布和对于偏差初始化的正态分布的组合,激发了 12 条染色体中更多不同的重要 SNP。我们的研究结果证实,NucleoNet 模型成功地优于 OLS 模型,并确定了对印度尼西亚水稻产量重要的 SNP。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1868/9378700/24ba32145169/41598_2022_16075_Fig7_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1868/9378700/8ebcf9555806/41598_2022_16075_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1868/9378700/31fcdd1619e3/41598_2022_16075_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1868/9378700/28b199b2cae0/41598_2022_16075_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1868/9378700/76189cc6feb2/41598_2022_16075_Fig4_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1868/9378700/26eaf18bac1a/41598_2022_16075_Fig5_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1868/9378700/a30e4f9ed6c7/41598_2022_16075_Fig6_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1868/9378700/24ba32145169/41598_2022_16075_Fig7_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1868/9378700/8ebcf9555806/41598_2022_16075_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1868/9378700/31fcdd1619e3/41598_2022_16075_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1868/9378700/28b199b2cae0/41598_2022_16075_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1868/9378700/76189cc6feb2/41598_2022_16075_Fig4_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1868/9378700/26eaf18bac1a/41598_2022_16075_Fig5_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1868/9378700/a30e4f9ed6c7/41598_2022_16075_Fig6_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1868/9378700/24ba32145169/41598_2022_16075_Fig7_HTML.jpg

相似文献

1
Deep polygenic neural network for predicting and identifying yield-associated genes in Indonesian rice accessions.深度多基因神经网络预测和鉴定印度尼西亚水稻品种中的产量相关基因。
Sci Rep. 2022 Aug 15;12(1):13823. doi: 10.1038/s41598-022-16075-9.
2
Genome-wide association mapping of salinity tolerance in rice (Oryza sativa).水稻(Oryza sativa)耐盐性的全基因组关联图谱分析
DNA Res. 2015 Apr;22(2):133-45. doi: 10.1093/dnares/dsu046. Epub 2015 Jan 27.
3
An improved 7K SNP array, the C7AIR, provides a wealth of validated SNP markers for rice breeding and genetics studies.改良后的 7K SNP 芯片 C7AIR 为水稻育种和遗传学研究提供了丰富的经验证的 SNP 标记。
PLoS One. 2020 May 14;15(5):e0232479. doi: 10.1371/journal.pone.0232479. eCollection 2020.
4
Phenotypic variation and genome-wide association studies of main culm panicle node number, maximum node production rate, and degree-days to heading in rice.水稻主茎穗颈节数、最大穗颈节间伸长速率和生育天数的表型变异及全基因组关联研究。
BMC Genomics. 2022 May 23;23(1):390. doi: 10.1186/s12864-022-08629-y.
5
The genetic diversity and structure of indica rice in China as detected by single nucleotide polymorphism analysis.通过单核苷酸多态性分析检测中国籼稻的遗传多样性和结构
BMC Genet. 2016 Mar 15;17:53. doi: 10.1186/s12863-016-0361-x.
6
[Single nucleotide polymorphism (SNP) and its application in rice].[单核苷酸多态性(SNP)及其在水稻中的应用]
Yi Chuan. 2006 Jun;28(6):737-44.
7
Genomic regions involved in yield potential detected by genome-wide association analysis in Japanese high-yielding rice cultivars.通过全基因组关联分析在日本高产水稻品种中检测到的与产量潜力相关的基因组区域。
BMC Genomics. 2014 May 8;15(1):346. doi: 10.1186/1471-2164-15-346.
8
Genome-wide and SNP network analyses reveal genetic control of spikelet sterility and yield-related traits in wheat.全基因组和 SNP 网络分析揭示了小麦小穗不育和产量相关性状的遗传控制。
Sci Rep. 2020 Feb 7;10(1):2098. doi: 10.1038/s41598-020-59004-4.
9
A Comprehensive Image-based Phenomic Analysis Reveals the Complex Genetic Architecture of Shoot Growth Dynamics in Rice ().基于图像的综合性表型分析揭示了水稻 Shoot 生长动态的复杂遗传结构()。
Plant Genome. 2017 Jul;10(2). doi: 10.3835/plantgenome2016.07.0064.
10
Genetic dissection of ozone tolerance in rice (Oryza sativa L.) by a genome-wide association study.通过全基因组关联研究对水稻(Oryza sativa L.)耐臭氧性进行遗传剖析。
J Exp Bot. 2015 Jan;66(1):293-306. doi: 10.1093/jxb/eru419. Epub 2014 Nov 4.

引用本文的文献

1
Transformer Architecture and Attention Mechanisms in Genome Data Analysis: A Comprehensive Review.基因组数据分析中的Transformer架构与注意力机制:全面综述
Biology (Basel). 2023 Jul 22;12(7):1033. doi: 10.3390/biology12071033.

本文引用的文献

1
Carbon Starved Anther modulates sugar and ABA metabolism to protect rice seed germination and seedling fitness.缺碳花药通过调节糖和 ABA 代谢来保护水稻种子萌发和幼苗活力。
Plant Physiol. 2021 Dec 4;187(4):2405-2418. doi: 10.1093/plphys/kiab391.
2
Genetic risk factors for colorectal cancer in multiethnic Indonesians.多民族印度尼西亚人群结直肠癌的遗传风险因素。
Sci Rep. 2021 May 11;11(1):9988. doi: 10.1038/s41598-021-88805-4.
3
Recent Insights into Anthocyanin Pigmentation, Synthesis, Trafficking, and Regulatory Mechanisms in Rice ( L.) Caryopsis.
水稻颖果中花青素色素沉着、合成、运输及调控机制的最新见解
Biomolecules. 2021 Mar 7;11(3):394. doi: 10.3390/biom11030394.
4
Identification of Novel Genomic Associations and Gene Candidates for Grain Starch Content in Sorghum.鉴定高粱淀粉含量的新型基因组关联和候选基因。
Genes (Basel). 2020 Dec 2;11(12):1448. doi: 10.3390/genes11121448.
5
A genome-wide association study reveals the quantitative trait locus and candidate genes that regulate phosphate efficiency in a Vietnamese rice collection.一项全基因组关联研究揭示了越南水稻品种中调控磷效率的数量性状位点和候选基因。
Physiol Mol Biol Plants. 2020 Nov;26(11):2267-2281. doi: 10.1007/s12298-020-00902-2. Epub 2020 Oct 30.
6
Identification of a candidate gene associated with isoflavone content in soybean seeds using genome-wide association and linkage mapping.利用全基因组关联和连锁作图鉴定与大豆种子异黄酮含量相关的候选基因。
Plant J. 2020 Nov;104(4):950-963. doi: 10.1111/tpj.14972. Epub 2020 Sep 22.
7
Genome-wide association study reveals the genetic basis of fiber quality traits in upland cotton (Gossypium hirsutum L.).全基因组关联研究揭示了陆地棉(Gossypium hirsutum L.)纤维品质性状的遗传基础。
BMC Plant Biol. 2020 Aug 27;20(1):395. doi: 10.1186/s12870-020-02611-0.
8
Phenotypic and genetic variation in phosphorus-deficiency-tolerance traits in Chinese wheat landraces.中国小麦地方品种耐磷特性的表型和遗传变异。
BMC Plant Biol. 2020 Jul 13;20(1):330. doi: 10.1186/s12870-020-02492-3.
9
New quantitative trait locus (QTLs) and candidate genes associated with the grape berry color trait identified based on a high-density genetic map.基于高密度遗传图谱鉴定与葡萄浆果颜色性状相关的新数量性状位点(QTLs)和候选基因。
BMC Plant Biol. 2020 Jun 30;20(1):302. doi: 10.1186/s12870-020-02517-x.
10
Natural Variation Uncovers Candidate Genes for Barley Spikelet Number and Grain Yield under Drought Stress.自然变异揭示了大麦小穗数和耐旱性下籽粒产量的候选基因。
Genes (Basel). 2020 May 11;11(5):533. doi: 10.3390/genes11050533.