Suppr超能文献

基于转录组的外显子捕获在多个分歧尺度上对蛙类系统基因组学的评估(纲:两栖纲,目:无尾目)。

An evaluation of transcriptome-based exon capture for frog phylogenomics across multiple scales of divergence (Class: Amphibia, Order: Anura).

作者信息

Portik Daniel M, Smith Lydia L, Bi Ke

机构信息

Museum of Vertebrate Zoology, University of California, Berkeley, CA, 94720, USA.

Department of Integrative Biology, University of California, Valley Life Sciences Building, Berkeley, CA, 94720, USA.

出版信息

Mol Ecol Resour. 2016 Sep;16(5):1069-83. doi: 10.1111/1755-0998.12541. Epub 2016 Jun 24.

Abstract

Custom sequence capture experiments are becoming an efficient approach for gathering large sets of orthologous markers in nonmodel organisms. Transcriptome-based exon capture utilizes transcript sequences to design capture probes, typically using a reference genome to identify intron-exon boundaries to exclude shorter exons (<200 bp). Here, we test directly using transcript sequences for probe design, which are often composed of multiple exons of varying lengths. Using 1260 orthologous transcripts, we conducted sequence captures across multiple phylogenetic scales for frogs, including outgroups 100 Myr divergent from the ingroup. We recovered a large phylogenomic data set consisting of sequence alignments for 1047 of the 1260 transcriptome-based loci (561 000 bp) and a large quantity of highly variable regions flanking the exons in transcripts (70 000 bp), the latter improving substantially by only including ingroup species (797 000 bp). We recovered both shorter (<100 bp) and longer exons (>200 bp), with no major reduction in coverage towards the ends of exons. We observed significant differences in the performance of blocking oligos for target enrichment and nontarget depletion during captures, and differences in PCR duplication rates resulting from the number of individuals pooled for capture reactions. We explicitly tested the effects of phylogenetic distance on capture sensitivity, specificity, and missing data, and provide a baseline estimate of expectations for these metrics based on a priori knowledge of nuclear pairwise differences among samples. We provide recommendations for transcriptome-based exon capture design based on our results, cost estimates and offer multiple pipelines for data assembly and analysis.

摘要

定制序列捕获实验正成为一种在非模式生物中收集大量直系同源标记的有效方法。基于转录组的外显子捕获利用转录本序列设计捕获探针,通常使用参考基因组来识别内含子 - 外显子边界以排除较短的外显子(<200 bp)。在这里,我们直接测试使用转录本序列进行探针设计,转录本通常由多个长度不同的外显子组成。我们使用1260个直系同源转录本,对青蛙进行了跨多个系统发育尺度的序列捕获,包括与内群分歧约1亿年的外类群。我们获得了一个大型系统发育基因组数据集,该数据集由1260个基于转录组的位点中的1047个的序列比对组成(约561000 bp),以及转录本中外显子侧翼的大量高变区(约70000 bp),仅纳入内群物种后,后者显著改善(约797000 bp)。我们回收了较短(<100 bp)和较长(>200 bp)的外显子,外显子末端的覆盖率没有大幅降低。我们观察到捕获过程中用于靶标富集和非靶标耗竭的封闭寡核苷酸性能存在显著差异,以及捕获反应中合并个体数量导致的PCR重复率差异。我们明确测试了系统发育距离对捕获敏感性、特异性和缺失数据的影响,并根据样本间核成对差异的先验知识为这些指标提供了预期的基线估计。基于我们的结果、成本估计,我们为基于转录组的外显子捕获设计提供了建议,并提供了多个数据组装和分析流程。

文献AI研究员

20分钟写一篇综述,助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型,支持多种主流文档格式。

立即体验