• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

同源基因引导的高度杂合作物组装:创建参考转录组以揭示黑麦草中的遗传多样性。

Orthology Guided Assembly in highly heterozygous crops: creating a reference transcriptome to uncover genetic diversity in Lolium perenne.

机构信息

Plant Sciences Unit--Growth and Development, Institute for Agricultural and Fisheries Research-ILVO, Melle, Belgium.

出版信息

Plant Biotechnol J. 2013 Jun;11(5):605-17. doi: 10.1111/pbi.12051. Epub 2013 Feb 21.

DOI:10.1111/pbi.12051
PMID:23433242
Abstract

Despite current advances in next-generation sequencing data analysis procedures, de novo assembly of a reference sequence required for SNP discovery and expression analysis is still a major challenge in genetically uncharacterized, highly heterozygous species. High levels of polymorphism inherent to outbreeding crop species hamper De Bruijn Graph-based de novo assembly algorithms, causing transcript fragmentation and the redundant assembly of allelic contigs. If multiple genotypes are sequenced to study genetic diversity, primary de novo assembly is best performed per genotype to limit the level of polymorphism and avoid transcript fragmentation. Here, we propose an Orthology Guided Assembly procedure that first uses sequence similarity (tBLASTn) to proteins of a model species to select allelic and fragmented contigs from all genotypes and then performs CAP3 clustering on a gene-by-gene basis. Thus, we simultaneously annotate putative orthologues for each protein of the model species, resolve allelic redundancy and fragmentation and create a de novo transcript sequence representing the consensus of all alleles present in the sequenced genotypes. We demonstrate the procedure using RNA-seq data from 14 genotypes of Lolium perenne to generate a reference transcriptome for gene discovery and translational research, to reveal the transcriptome-wide distribution and density of SNPs in an outbreeding crop and to illustrate the effect of polymorphisms on the assembly procedure. The results presented here illustrate that constructing a non-redundant reference sequence is essential for comparative genomics, orthology-based annotation and candidate gene selection but also for read mapping and subsequent polymorphism discovery and/or read count-based gene expression analysis.

摘要

尽管目前在下一代测序数据分析程序方面取得了进展,但对于 SNP 发现和表达分析而言,从头组装参考序列仍然是一个具有挑战性的问题,尤其是在那些遗传特征尚未明确且高度杂合的物种中。异花授粉作物物种固有的高水平多态性会阻碍基于 De Bruijn 图的从头组装算法,导致转录本碎片化和等位基因序列的冗余组装。如果要对多个基因型进行测序以研究遗传多样性,最好针对每个基因型进行初步从头组装,以限制多态性水平并避免转录本碎片化。在这里,我们提出了一种同源物指导组装程序,该程序首先使用序列相似性(tBLASTn)对模型物种的蛋白质进行搜索,以从所有基因型中选择等位基因和碎片化的序列,然后根据基因进行 CAP3 聚类。这样,我们可以同时为模型物种的每个蛋白质注释假定的同源物,解决等位基因冗余和碎片化问题,并创建一个代表测序基因型中所有等位基因共识的从头转录序列。我们使用 14 个 Lolium perenne 基因型的 RNA-seq 数据来演示该程序,以生成一个参考转录组,用于基因发现和转化研究,揭示异花授粉作物中转录组范围内 SNP 的分布和密度,并说明多态性对组装程序的影响。这里呈现的结果表明,构建一个非冗余的参考序列对于比较基因组学、基于同源物的注释和候选基因选择至关重要,但对于读取映射以及随后的多态性发现和/或基于读取计数的基因表达分析也是如此。

相似文献

1
Orthology Guided Assembly in highly heterozygous crops: creating a reference transcriptome to uncover genetic diversity in Lolium perenne.同源基因引导的高度杂合作物组装:创建参考转录组以揭示黑麦草中的遗传多样性。
Plant Biotechnol J. 2013 Jun;11(5):605-17. doi: 10.1111/pbi.12051. Epub 2013 Feb 21.
2
Coverage-based consensus calling (CbCC) of short sequence reads and comparison of CbCC results to identify SNPs in chickpea (Cicer arietinum; Fabaceae), a crop species without a reference genome.基于覆盖度的短序列读取共识调用(CbCC),并将 CbCC 结果与 SNP 进行比较,以鉴定无参考基因组的作物豌豆(Cicer arietinum;豆科)。
Am J Bot. 2012 Feb;99(2):186-92. doi: 10.3732/ajb.1100419. Epub 2012 Feb 1.
3
De novo assembly of the perennial ryegrass transcriptome using an RNA-Seq strategy.利用RNA测序策略对多年生黑麦草转录组进行从头组装。
PLoS One. 2014 Aug 15;9(8):e103567. doi: 10.1371/journal.pone.0103567. eCollection 2014.
4
Challenges and advances for transcriptome assembly in non-model species.非模式物种转录组组装面临的挑战与进展
PLoS One. 2017 Sep 20;12(9):e0185020. doi: 10.1371/journal.pone.0185020. eCollection 2017.
5
Comparative performance of transcriptome assembly methods for non-model organisms.非模式生物转录组组装方法的比较性能
BMC Genomics. 2016 Jul 27;17:523. doi: 10.1186/s12864-016-2923-8.
6
Large-scale transcriptome characterization and mass discovery of SNPs in globe artichoke and its related taxa.大规模转录组特征分析和大规模 SNP 发现于朝鲜蓟及其相关分类群。
Plant Biotechnol J. 2012 Oct;10(8):956-69. doi: 10.1111/j.1467-7652.2012.00725.x. Epub 2012 Jul 31.
7
Efficient assembly and annotation of the transcriptome of catfish by RNA-Seq analysis of a doubled haploid homozygote.通过对双倍单倍体纯合子的 RNA-Seq 分析,高效组装和注释鲶鱼转录组。
BMC Genomics. 2012 Nov 5;13:595. doi: 10.1186/1471-2164-13-595.
8
De novo transcriptome assembly and polymorphism detection in the flowering plant Silene vulgaris (Caryophyllaceae).在石竹科植物普通矢车菊(Silene vulgaris)中从头转录组组装和多态性检测。
Mol Ecol Resour. 2012 Mar;12(2):333-43. doi: 10.1111/j.1755-0998.2011.03079.x. Epub 2011 Oct 17.
9
Critical assessment of assembly strategies for non-model species mRNA-Seq data and application of next-generation sequencing to the comparison of C(3) and C(4) species.非模式生物 mRNA-Seq 数据组装策略的评价及下一代测序在 C(3)和 C(4)物种比较中的应用。
J Exp Bot. 2011 May;62(9):3093-102. doi: 10.1093/jxb/err029. Epub 2011 Mar 11.
10
Reference-free transcriptome assembly in non-model animals from next-generation sequencing data.无参考转录组组装在非模式动物从下一代测序数据。
Mol Ecol Resour. 2012 Sep;12(5):834-45. doi: 10.1111/j.1755-0998.2012.03148.x. Epub 2012 Apr 30.

引用本文的文献

1
Utilization of Tissue Ploidy Level Variation in Transcriptome Assembly of .利用组织倍性水平变化进行. 的转录组组装。
G3 (Bethesda). 2019 Oct 7;9(10):3409-3421. doi: 10.1534/g3.119.400357.
2
Overcoming challenges in variant calling: exploring sequence diversity in candidate genes for plant development in perennial ryegrass (Lolium perenne).克服变异调用中的挑战:探索多年生黑麦草(Lolium perenne)发育候选基因中的序列多样性。
DNA Res. 2019 Feb 1;26(1):1-12. doi: 10.1093/dnares/dsy033.
3
Single-Copy Genes as Molecular Markers for Phylogenomic Studies in Seed Plants.
单拷贝基因作为种子植物系统基因组学研究的分子标记
Genome Biol Evol. 2017 May 1;9(5):1130-1147. doi: 10.1093/gbe/evx070.
4
Improving transcriptome de novo assembly by using a reference genome of a related species: Translational genomics from oil palm to coconut.通过使用相关物种的参考基因组改进转录组从头组装:从油棕到椰子的转化基因组学。
PLoS One. 2017 Mar 23;12(3):e0173300. doi: 10.1371/journal.pone.0173300. eCollection 2017.
5
An ultra-high density genetic linkage map of perennial ryegrass (Lolium perenne) using genotyping by sequencing (GBS) based on a reference shotgun genome assembly.基于参考鸟枪法基因组组装,利用简化基因组测序(GBS)构建的多年生黑麦草(Lolium perenne)超高密度遗传连锁图谱。
Ann Bot. 2016 Jul;118(1):71-87. doi: 10.1093/aob/mcw081. Epub 2016 Jun 6.
6
In Silico Identification of Candidate Genes for Fertility Restoration in Cytoplasmic Male Sterile Perennial Ryegrass (Lolium perenne L.).细胞质雄性不育多年生黑麦草(Lolium perenne L.)育性恢复候选基因的计算机鉴定
Genome Biol Evol. 2017 Feb 1;9(2):351-362. doi: 10.1093/gbe/evw047.
7
Genetic-geographic correlation revealed across a broad European ecotypic sample of perennial ryegrass (Lolium perenne) using array-based SNP genotyping.利用基于芯片的单核苷酸多态性(SNP)基因分型技术,在广泛的欧洲多年生黑麦草(Lolium perenne)生态型样本中揭示了遗传-地理相关性。
Theor Appl Genet. 2015 Oct;128(10):1917-32. doi: 10.1007/s00122-015-2556-3. Epub 2015 Jun 21.
8
Using RNA-Seq to assemble a rose transcriptome with more than 13,000 full-length expressed genes and to develop the WagRhSNP 68k Axiom SNP array for rose (Rosa L.).利用RNA测序技术组装一个拥有超过13000个全长表达基因的玫瑰转录组,并开发用于玫瑰(蔷薇属)的WagRhSNP 68k公理单核苷酸多态性阵列。
Front Plant Sci. 2015 Apr 21;6:249. doi: 10.3389/fpls.2015.00249. eCollection 2015.
9
Comparative transcriptome analysis within the Lolium/Festuca species complex reveals high sequence conservation.黑麦草/羊茅物种复合体中的比较转录组分析揭示了高度的序列保守性。
BMC Genomics. 2015 Mar 28;16(1):249. doi: 10.1186/s12864-015-1447-y.
10
De novo assembly of the perennial ryegrass transcriptome using an RNA-Seq strategy.利用RNA测序策略对多年生黑麦草转录组进行从头组装。
PLoS One. 2014 Aug 15;9(8):e103567. doi: 10.1371/journal.pone.0103567. eCollection 2014.