Hanna Zachary R, Henderson James B, Sellas Anna B, Fuchs Jérôme, Bowie Rauri C K, Dumbacher John P
Museum of Vertebrate Zoology, University of California, Berkeley, CA, United States of America.
Department of Integrative Biology, University of California, Berkeley, CA, United States of America.
PeerJ. 2017 Oct 10;5:e3901. doi: 10.7717/peerj.3901. eCollection 2017.
We report here the successful assembly of the complete mitochondrial genomes of the northern spotted owl () and the barred owl (). We utilized sequence data from two sequencing methodologies, Illumina paired-end sequence data with insert lengths ranging from approximately 250 nucleotides (nt) to 9,600 nt and read lengths from 100-375 nt and Sanger-derived sequences. We employed multiple assemblers and alignment methods to generate the final assemblies. The circular genomes of and are comprised of 19,948 nt and 18,975 nt, respectively. Both code for two rRNAs, twenty-two tRNAs, and thirteen polypeptides. They both have duplicated control region sequences with complex repeat structures. We were not able to assemble the control regions solely using Illumina paired-end sequence data. By fully spanning the control regions, Sanger-derived sequences enabled accurate and complete assembly of these mitochondrial genomes. These are the first complete mitochondrial genome sequences of owls (Aves: Strigiformes) possessing duplicated control regions. We searched the nuclear genome of for copies of mitochondrial genes and found at least nine separate stretches of nuclear copies of gene sequences originating in the mitochondrial genome (). The ranged from 226-19,522 nt in length and included copies of all mitochondrial genes except , , and . and exhibited an average of 10.74% (8.68% uncorrected -distance) divergence across the non-tRNA mitochondrial genes.
我们在此报告成功组装出北方斑点鸮()和横斑林鸮()的完整线粒体基因组。我们利用了来自两种测序方法的序列数据,即插入片段长度约为250个核苷酸(nt)至9600 nt、读长为100 - 375 nt的Illumina双端序列数据以及桑格测序法获得的序列。我们采用了多种组装程序和比对方法来生成最终组装结果。和的环状基因组分别由19948 nt和18975 nt组成。两者都编码两种rRNA、22种tRNA和13种多肽。它们都有重复的控制区序列,具有复杂的重复结构。我们仅使用Illumina双端序列数据无法组装出控制区。通过完全覆盖控制区,桑格测序法获得的序列实现了这些线粒体基因组的准确和完整组装。这些是拥有重复控制区的猫头鹰(鸟纲:鸮形目)的首批完整线粒体基因组序列。我们在的核基因组中搜索线粒体基因的拷贝,发现至少有九个源自线粒体基因组()的基因序列核拷贝的独立片段。长度在226 - 19522 nt之间,包括除、和之外的所有线粒体基因的拷贝。和在非tRNA线粒体基因上平均表现出10.74%(未校正距离为8.68%)的分歧。