• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

基于 RNA-seq 数据比对到单个参考基因组得到的等位基因表达特异性测量中的偏倚来源。

Sources of bias in measures of allele-specific expression derived from RNA-sequence data aligned to a single reference genome.

机构信息

Department of Computational Medicine and Bioinformatics, University of Michigan, Ann Arbor, MI 48109, USA.

出版信息

BMC Genomics. 2013 Aug 7;14:536. doi: 10.1186/1471-2164-14-536.

DOI:10.1186/1471-2164-14-536
PMID:23919664
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC3751238/
Abstract

BACKGROUND

RNA-seq can be used to measure allele-specific expression (ASE) by assigning sequence reads to individual alleles; however, relative ASE is systematically biased when sequence reads are aligned to a single reference genome. Aligning sequence reads to both parental genomes can eliminate this bias, but this approach is not always practical, especially for non-model organisms. To improve accuracy of ASE measured using a single reference genome, we identified properties of differentiating sites responsible for biased measures of relative ASE.

RESULTS

We found that clusters of differentiating sites prevented sequence reads from an alternate allele from aligning to the reference genome, causing a bias in relative ASE favoring the reference allele. This bias increased with greater sequence divergence between alleles. Increasing the number of mismatches allowed when aligning sequence reads to the reference genome and restricting analysis to genomic regions with fewer differentiating sites than the number of mismatches allowed almost completely eliminated this systematic bias. Accuracy of allelic abundance was increased further by excluding differentiating sites within sequence reads that could not be aligned uniquely within the genome (imperfect mappability) and reads that overlapped one or more insertions or deletions (indels) between alleles.

CONCLUSIONS

After aligning sequence reads to a single reference genome, excluding differentiating sites with at least as many neighboring differentiating sites as the number of mismatches allowed, imperfect mappability, and/or an indel(s) nearby resulted in measures of allelic abundance comparable to those derived from aligning sequence reads to both parental genomes.

摘要

背景

RNA-seq 可通过将序列读取分配给个体等位基因来测量等位基因特异性表达(ASE);然而,当序列读取与单个参考基因组对齐时,相对 ASE 会受到系统偏差的影响。将序列读取对齐到两个亲本基因组可以消除这种偏差,但这种方法并不总是实用的,尤其是对于非模式生物。为了提高使用单个参考基因组测量的 ASE 的准确性,我们确定了导致相对 ASE 偏倚的区分位点的特性。

结果

我们发现区分位点簇阻止了交替等位基因的序列读取与参考基因组对齐,从而导致参考等位基因的相对 ASE 产生偏差。这种偏差随着等位基因之间的序列差异增大而增加。增加允许在参考基因组上对齐序列读取时的错配数,并将分析限制在允许错配数少于区分位点数的基因组区域,几乎可以完全消除这种系统偏差。通过排除无法在基因组内唯一对齐的序列读取内的区分位点(不完全可映射性)以及与等位基因之间的一个或多个插入或缺失(indel)重叠的读取,进一步提高了等位基因丰度的准确性。

结论

在将序列读取与单个参考基因组对齐后,排除具有与允许的错配数一样多的相邻区分位点、不完全可映射性和/或附近的 indel(s) 的区分位点,可得到与从将序列读取对齐到两个亲本基因组相比可比的等位基因丰度测量值。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5f61/3751238/51effffb5874/1471-2164-14-536-6.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5f61/3751238/6dbdbd8de7f4/1471-2164-14-536-1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5f61/3751238/ae9774c389ac/1471-2164-14-536-2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5f61/3751238/fda8d1595433/1471-2164-14-536-3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5f61/3751238/0289980a23a4/1471-2164-14-536-4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5f61/3751238/c867fadd9e6d/1471-2164-14-536-5.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5f61/3751238/51effffb5874/1471-2164-14-536-6.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5f61/3751238/6dbdbd8de7f4/1471-2164-14-536-1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5f61/3751238/ae9774c389ac/1471-2164-14-536-2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5f61/3751238/fda8d1595433/1471-2164-14-536-3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5f61/3751238/0289980a23a4/1471-2164-14-536-4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5f61/3751238/c867fadd9e6d/1471-2164-14-536-5.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5f61/3751238/51effffb5874/1471-2164-14-536-6.jpg

相似文献

1
Sources of bias in measures of allele-specific expression derived from RNA-sequence data aligned to a single reference genome.基于 RNA-seq 数据比对到单个参考基因组得到的等位基因表达特异性测量中的偏倚来源。
BMC Genomics. 2013 Aug 7;14:536. doi: 10.1186/1471-2164-14-536.
2
Hierarchical analysis of RNA-seq reads improves the accuracy of allele-specific expression.基于层次分析的 RNA-seq 测序reads 提高了等位基因特异性表达的准确性。
Bioinformatics. 2018 Jul 1;34(13):2177-2184. doi: 10.1093/bioinformatics/bty078.
3
Comparing computational methods for identification of allele-specific expression based on next generation sequencing data.基于下一代测序数据比较用于鉴定等位基因特异性表达的计算方法。
Genet Epidemiol. 2014 Nov;38(7):591-8. doi: 10.1002/gepi.21846. Epub 2014 Sep 2.
4
Extensive variation between tissues in allele specific expression in an outbred mammal.远交哺乳动物中各组织间等位基因特异性表达存在广泛差异。
BMC Genomics. 2015 Nov 23;16:993. doi: 10.1186/s12864-015-2174-0.
5
A new strategy to reduce allelic bias in RNA-Seq readmapping.一种减少 RNA-Seq 读段比对中等位基因偏倚的新策略。
Nucleic Acids Res. 2012 Sep;40(16):e127. doi: 10.1093/nar/gks425. Epub 2012 May 14.
6
Effect of read-mapping biases on detecting allele-specific expression from RNA-sequencing data.RNA-seq 数据中检测等位基因特异性表达的读映射偏倚效应。
Bioinformatics. 2009 Dec 15;25(24):3207-12. doi: 10.1093/bioinformatics/btp579. Epub 2009 Oct 6.
7
Estimates of allele-specific expression in Drosophila with a single genome sequence and RNA-seq data.使用单个基因组序列和 RNA-seq 数据估计果蝇中的等位基因特异性表达。
Bioinformatics. 2014 Sep 15;30(18):2603-10. doi: 10.1093/bioinformatics/btu342. Epub 2014 May 19.
8
Zea mays RNA-seq estimated transcript abundances are strongly affected by read mapping bias.玉米RNA测序估计的转录本丰度受读段比对偏差的强烈影响。
BMC Genomics. 2021 Apr 20;22(1):285. doi: 10.1186/s12864-021-07577-3.
9
ASElux: an ultra-fast and accurate allelic reads counter.ASElux:一种超快速且准确的等位基因读取计数器。
Bioinformatics. 2018 Apr 15;34(8):1313-1320. doi: 10.1093/bioinformatics/btx762.
10
AlleleSeq: analysis of allele-specific expression and binding in a network framework.AlleleSeq:在网络框架中分析等位基因特异性表达和结合。
Mol Syst Biol. 2011 Aug 2;7:522. doi: 10.1038/msb.2011.54.

引用本文的文献

1
A mouse organoid platform for modeling cerebral cortex development and cis-regulatory evolution in vitro.一种用于在体外模拟大脑皮质发育和顺式调控进化的小鼠类器官平台。
Dev Cell. 2025 Aug 25. doi: 10.1016/j.devcel.2025.08.001.
2
Adaptive Evolution and Transcriptomic Specialization of P450 Detoxification Genes in the Colorado Potato Beetle Across Developmental Stages and Tissues.科罗拉多马铃薯甲虫不同发育阶段和组织中P450解毒基因的适应性进化与转录组特化
Insects. 2025 Jun 9;16(6):608. doi: 10.3390/insects16060608.
3
Extreme heterochiasmy and high rates of sex-reversed recombination result in large yet homomorphic sex chromosomes in the Emei moustache toad.

本文引用的文献

1
Alternative strategies for development of a reference transcriptome for quantification of allele specific expression in organisms having sparse genomic resources.用于开发参考转录组的替代策略,以定量具有稀疏基因组资源的生物体中等位基因特异性表达。
Comp Biochem Physiol Part D Genomics Proteomics. 2013 Mar;8(1):11-6. doi: 10.1016/j.cbd.2012.10.006. Epub 2012 Nov 9.
2
Genomic imprinting absent in Drosophila melanogaster adult females.果蝇成年雌体中不存在基因组印记。
Cell Rep. 2012 Jul 26;2(1):69-75. doi: 10.1016/j.celrep.2012.06.013. Epub 2012 Jul 20.
3
A new strategy to reduce allelic bias in RNA-Seq readmapping.
极端的异态性和高频率的性反转重组导致峨眉髭蟾拥有大而形态相似的性染色体。
Genome Res. 2025 Jun 2;35(6):1325-1336. doi: 10.1101/gr.280161.124.
4
Fat-tail allele-specific expression genes may affect fat deposition in tail of sheep.脂尾等位基因特异性表达基因可能影响绵羊尾部脂肪沉积。
PLoS One. 2024 Dec 27;19(12):e0316046. doi: 10.1371/journal.pone.0316046. eCollection 2024.
5
The rapid, mass invasion of New Zealand by North American "".北美“”对新西兰的迅速大规模入侵。 你提供的原文似乎不完整,“北美‘’”这里内容缺失。
Limnol Oceanogr. 2021 Jul;66(7):2672-2683. doi: 10.1002/lno.11780. Epub 2021 Apr 26.
6
Ornaments for efficient allele-specific expression estimation with bias correction.用于高效等位基因特异性表达估计的偏倚校正修饰。
Am J Hum Genet. 2024 Aug 8;111(8):1770-1781. doi: 10.1016/j.ajhg.2024.06.014. Epub 2024 Jul 23.
7
STAR+WASP reduces reference bias in the allele-specific mapping of RNA-seq reads.STAR+WASP可减少RNA测序读数等位基因特异性映射中的参考偏差。
bioRxiv. 2024 Feb 5:2024.01.21.576391. doi: 10.1101/2024.01.21.576391.
8
Allele biased transcription factor binding across human brain regions gives mechanistic insight into eQTLs.全人类大脑区域中偏向性等位基因的转录因子结合为基因表达数量性状位点提供了机制性见解。
bioRxiv. 2023 Oct 9:2023.10.06.561245. doi: 10.1101/2023.10.06.561245.
9
Divergence of cochlear transcriptomics between reference‑based and reference‑free transcriptome analyses among Rhinolophus ferrumequinum populations.基于参考和无参考转录组分析的菊头蝠属不同种群耳蜗转录组的差异。
PLoS One. 2023 Jul 11;18(7):e0288404. doi: 10.1371/journal.pone.0288404. eCollection 2023.
10
Favoring Expression of Yak Alleles in Interspecies F1 Hybrids of Cattle and Yak Under High-Altitude Environments.高海拔环境下牦牛等位基因在牛和牦牛种间F1杂种中的优势表达
Front Vet Sci. 2022 Jun 30;9:892663. doi: 10.3389/fvets.2022.892663. eCollection 2022.
一种减少 RNA-Seq 读段比对中等位基因偏倚的新策略。
Nucleic Acids Res. 2012 Sep;40(16):e127. doi: 10.1093/nar/gks425. Epub 2012 May 14.
4
Critical evaluation of imprinted gene expression by RNA-Seq: a new perspective.通过 RNA-Seq 对印迹基因表达进行批判性评估:一个新视角。
PLoS Genet. 2012;8(3):e1002600. doi: 10.1371/journal.pgen.1002600. Epub 2012 Mar 29.
5
Allelic imbalance in Drosophila hybrid heads: exons, isoforms, and evolution.果蝇杂种头部的等位基因失衡:外显子、异构体和进化。
Mol Biol Evol. 2012 Jun;29(6):1521-32. doi: 10.1093/molbev/msr318. Epub 2012 Jan 7.
6
The Drosophila melanogaster Genetic Reference Panel.黑腹果蝇遗传参考面板。
Nature. 2012 Feb 8;482(7384):173-8. doi: 10.1038/nature10811.
7
Fast computation and applications of genome mappability.基因组可及性的快速计算与应用。
PLoS One. 2012;7(1):e30377. doi: 10.1371/journal.pone.0030377. Epub 2012 Jan 19.
8
The evolution of gene expression levels in mammalian organs.哺乳动物器官中基因表达水平的演变。
Nature. 2011 Oct 19;478(7369):343-8. doi: 10.1038/nature10532.
9
Mapping personal functional data to personal genomes.将个人功能数据映射到个人基因组上。
Bioinformatics. 2011 Dec 15;27(24):3427-9. doi: 10.1093/bioinformatics/btr578. Epub 2011 Oct 17.
10
A powerful and flexible statistical framework for testing hypotheses of allele-specific gene expression from RNA-seq data.用于从 RNA-seq 数据中测试等位基因特异性基因表达假设的强大而灵活的统计框架。
Genome Res. 2011 Oct;21(10):1728-37. doi: 10.1101/gr.119784.110. Epub 2011 Aug 26.