• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

玉米RNA测序估计的转录本丰度受读段比对偏差的强烈影响。

Zea mays RNA-seq estimated transcript abundances are strongly affected by read mapping bias.

作者信息

Zhan Shuhua, Griswold Cortland, Lukens Lewis

机构信息

Department of Plant Agriculture, University of Guelph, Guelph, Ontario, Canada.

Department of Integrative Biology, University of Guelph, Guelph, Ontario, Canada.

出版信息

BMC Genomics. 2021 Apr 20;22(1):285. doi: 10.1186/s12864-021-07577-3.

DOI:10.1186/s12864-021-07577-3
PMID:33874908
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC8056621/
Abstract

BACKGROUND

Genetic variation for gene expression is a source of phenotypic variation for natural and agricultural species. The common approach to map and to quantify gene expression from genetically distinct individuals is to assign their RNA-seq reads to a single reference genome. However, RNA-seq reads from alleles dissimilar to this reference genome may fail to map correctly, causing transcript levels to be underestimated. Presently, the extent of this mapping problem is not clear, particularly in highly diverse species. We investigated if mapping bias occurred and if chromosomal features associated with mapping bias. Zea mays presents a model species to assess these questions, given it has genotypically distinct and well-studied genetic lines.

RESULTS

In Zea mays, the inbred B73 genome is the standard reference genome and template for RNA-seq read assignments. In the absence of mapping bias, B73 and a second inbred line, Mo17, would each have an approximately equal number of regulatory alleles that increase gene expression. Remarkably, Mo17 had 2-4 times fewer such positively acting alleles than did B73 when RNA-seq reads were aligned to the B73 reference genome. Reciprocally, over one-half of the B73 alleles that increased gene expression were not detected when reads were aligned to the Mo17 genome template. Genes at dissimilar chromosomal ends were strongly affected by mapping bias, and genes at more similar pericentromeric regions were less affected. Biased transcript estimates were higher in untranslated regions and lower in splice junctions. Bias occurred across software and alignment parameters.

CONCLUSIONS

Mapping bias very strongly affects gene transcript abundance estimates in maize, and bias varies across chromosomal features. Individual genome or transcriptome templates are likely necessary for accurate transcript estimation across genetically variable individuals in maize and other species.

摘要

背景

基因表达的遗传变异是自然物种和农业物种表型变异的一个来源。将来自基因不同个体的RNA测序读数定位并定量基因表达的常用方法是将它们的RNA测序读数分配到单个参考基因组。然而,与该参考基因组不同的等位基因的RNA测序读数可能无法正确定位,导致转录水平被低估。目前,这种定位问题的程度尚不清楚,尤其是在高度多样化的物种中。鉴于玉米具有基因型不同且经过充分研究的遗传系,我们研究了是否存在定位偏差以及是否存在与定位偏差相关的染色体特征。

结果

在玉米中,自交系B73基因组是RNA测序读数分配的标准参考基因组和模板。在不存在定位偏差的情况下,B73和另一个自交系Mo17各自具有数量大致相等的增加基因表达的调控等位基因。值得注意的是,当RNA测序读数与B73参考基因组比对时,Mo17中此类正向作用等位基因的数量比B73少2至4倍。相反,当读数与Mo17基因组模板比对时,超过一半的增加基因表达的B73等位基因未被检测到。位于不同染色体末端的基因受定位偏差的影响很大,而位于更相似的着丝粒周围区域的基因受影响较小。有偏差的转录本估计在非翻译区域较高,在剪接连接处较低。偏差在不同软件和比对参数中均存在。

结论

定位偏差对玉米基因转录本丰度估计有非常强烈的影响,并且偏差因染色体特征而异。对于准确估计玉米和其他物种中基因可变个体的转录本,可能需要个体基因组或转录组模板。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f50f/8056621/61baa0449bf4/12864_2021_7577_Fig4_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f50f/8056621/759befe02cf2/12864_2021_7577_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f50f/8056621/34d0a71ce200/12864_2021_7577_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f50f/8056621/de8dda3f820e/12864_2021_7577_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f50f/8056621/61baa0449bf4/12864_2021_7577_Fig4_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f50f/8056621/759befe02cf2/12864_2021_7577_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f50f/8056621/34d0a71ce200/12864_2021_7577_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f50f/8056621/de8dda3f820e/12864_2021_7577_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f50f/8056621/61baa0449bf4/12864_2021_7577_Fig4_HTML.jpg

相似文献

1
Zea mays RNA-seq estimated transcript abundances are strongly affected by read mapping bias.玉米RNA测序估计的转录本丰度受读段比对偏差的强烈影响。
BMC Genomics. 2021 Apr 20;22(1):285. doi: 10.1186/s12864-021-07577-3.
2
Variation in leaf transcriptome responses to elevated ozone corresponds with physiological sensitivity to ozone across maize inbred lines.叶片转录组对臭氧升高的反应变化与玉米自交系对臭氧的生理敏感性相对应。
Genetics. 2022 Jul 30;221(4). doi: 10.1093/genetics/iyac080.
3
Maize (Zea mays L.) genome diversity as revealed by RNA-sequencing.利用 RNA 测序揭示玉米(Zea mays L.)基因组多样性。
PLoS One. 2012;7(3):e33071. doi: 10.1371/journal.pone.0033071. Epub 2012 Mar 16.
4
RSEM: accurate transcript quantification from RNA-Seq data with or without a reference genome.RSEM:有或无参考基因组的 RNA-Seq 数据的准确转录本定量。
BMC Bioinformatics. 2011 Aug 4;12:323. doi: 10.1186/1471-2105-12-323.
5
Genome-wide analysis of alternative splicing in Zea mays: landscape and genetic regulation.玉米中可变剪接的全基因组分析:格局与遗传调控
Plant Cell. 2014 Sep;26(9):3472-87. doi: 10.1105/tpc.114.130773. Epub 2014 Sep 23.
6
Natural variation for alleles under epigenetic control by the maize chromomethylase zmet2.玉米染色体甲基转移酶zmet2表观遗传控制下的等位基因自然变异。
Genetics. 2007 Oct;177(2):749-60. doi: 10.1534/genetics.107.072702. Epub 2007 Jul 29.
7
RNA-Seq gene expression estimation with read mapping uncertainty.基于读段比对不确定性的 RNA-Seq 基因表达估计。
Bioinformatics. 2010 Feb 15;26(4):493-500. doi: 10.1093/bioinformatics/btp692. Epub 2009 Dec 18.
8
Ornaments for efficient allele-specific expression estimation with bias correction.用于高效等位基因特异性表达估计的偏倚校正修饰。
Am J Hum Genet. 2024 Aug 8;111(8):1770-1781. doi: 10.1016/j.ajhg.2024.06.014. Epub 2024 Jul 23.
9
Mendelian and non-Mendelian regulation of gene expression in maize.玉米中基因表达的孟德尔和非孟德尔调控。
PLoS Genet. 2013;9(1):e1003202. doi: 10.1371/journal.pgen.1003202. Epub 2013 Jan 17.
10
Complementation contributes to transcriptome complexity in maize (Zea mays L.) hybrids relative to their inbred parents.互补作用导致玉米(Zea mays L.)杂种相对于其自交系亲本的转录组复杂性增加。
Genome Res. 2012 Dec;22(12):2445-54. doi: 10.1101/gr.138461.112. Epub 2012 Oct 19.

引用本文的文献

1
Introgressions lead to reference bias in wheat RNA-seq analysis.基因渗入导致小麦 RNA-seq 分析中的参考偏倚。
BMC Biol. 2024 Mar 7;22(1):56. doi: 10.1186/s12915-024-01853-w.
2
Divergence of cochlear transcriptomics between reference‑based and reference‑free transcriptome analyses among Rhinolophus ferrumequinum populations.基于参考和无参考转录组分析的菊头蝠属不同种群耳蜗转录组的差异。
PLoS One. 2023 Jul 11;18(7):e0288404. doi: 10.1371/journal.pone.0288404. eCollection 2023.

本文引用的文献

1
Evidence for the Accumulation of Nonsynonymous Mutations and Favorable Pleiotropic Alleles During Wheat Breeding.小麦育种过程中非同义突变和有利多效性等位基因积累的证据
G3 (Bethesda). 2020 Nov 5;10(11):4001-4011. doi: 10.1534/g3.120.401269.
2
European maize genomes highlight intraspecies variation in repeat and gene content.欧洲玉米基因组突出了种内重复序列和基因组成的变异。
Nat Genet. 2020 Sep;52(9):950-957. doi: 10.1038/s41588-020-0671-9. Epub 2020 Jul 27.
3
Major Impacts of Widespread Structural Variation on Gene Expression and Crop Improvement in Tomato.
广泛的结构变异对番茄基因表达和作物改良的主要影响。
Cell. 2020 Jul 9;182(1):145-161.e23. doi: 10.1016/j.cell.2020.05.021. Epub 2020 Jun 17.
4
Tools and Strategies for Long-Read Sequencing and De Novo Assembly of Plant Genomes.长读测序和植物基因组从头组装的工具和策略。
Trends Plant Sci. 2019 Aug;24(8):700-724. doi: 10.1016/j.tplants.2019.05.003. Epub 2019 Jun 14.
5
Connecting genome structural variation with complex traits in crop plants.将作物基因组结构变异与复杂性状联系起来。
Theor Appl Genet. 2019 Mar;132(3):733-750. doi: 10.1007/s00122-018-3233-0. Epub 2018 Nov 17.
6
Shared and genetically distinct Zea mays transcriptome responses to ongoing and past low temperature exposure.持续和过去低温暴露下玉米共享且具有遗传差异的转录组响应。
BMC Genomics. 2018 Oct 20;19(1):761. doi: 10.1186/s12864-018-5134-7.
7
Extensive intraspecific gene order and gene structural variations between Mo17 and other maize genomes.Mo17 与其他玉米基因组之间广泛的种内基因顺序和基因结构变异。
Nat Genet. 2018 Sep;50(9):1289-1295. doi: 10.1038/s41588-018-0182-0. Epub 2018 Jul 30.
8
Long reads: their purpose and place.长读序列:它们的用途和位置。
Hum Mol Genet. 2018 Aug 1;27(R2):R234-R241. doi: 10.1093/hmg/ddy177.
9
INDEL variation in the regulatory region of the major flowering time gene LanFTc1 is associated with vernalization response and flowering time in narrow-leafed lupin (Lupinus angustifolius L.).INDEL 变异在主要开花时间基因 LanFTc1 的调控区域中与冬化反应和窄叶羽扇豆( Lupinus angustifolius L. )的开花时间有关。
Plant Cell Environ. 2019 Jan;42(1):174-187. doi: 10.1111/pce.13320. Epub 2018 May 23.
10
Construction of the third-generation Zea mays haplotype map.第三代玉米单倍型图谱的构建。
Gigascience. 2018 Apr 1;7(4):1-12. doi: 10.1093/gigascience/gix134.