Fuller Evolutionary Biology Program, Cornell Lab of Ornithology,
Department of Ecology and Evolutionary Biology, Cornell University, Ithaca, New York 14853.
G3 (Bethesda). 2020 Feb 6;10(2):475-478. doi: 10.1534/g3.119.400846.
The Horned Lark () is a small songbird that exhibits remarkable geographic variation in appearance and habitat across an expansive distribution. While has been the focus of many ecological and evolutionary studies, we still lack a highly contiguous genome assembly for the Horned Lark and related taxa (Alaudidae). Here, we present CLO_EAlp_1.0, a highly contiguous assembly for generated from a blood sample of a wild, male bird captured in the Altiplano Cundiboyacense of Colombia. By combining short-insert and mate-pair libraries with the ALLPATHS-LG genome assembly pipeline, we generated a 1.04 Gb assembly comprised of 2713 scaffolds, with a largest scaffold size of 31.81 Mb, a scaffold N50 of 9.42 Mb, and a scaffold L50 of 30. These scaffolds were assembled from 23685 contigs, with a largest contig size of 1.69 Mb, a contig N50 of 193.81 kb, and a contig L50 of 1429. Our assembly pipeline also produced a single mitochondrial DNA contig of 14.00 kb. After polishing the genome, we identified 94.5% of single-copy gene orthologs from an Aves data set and 97.7% of single-copy gene orthologs from a vertebrata data set, which further demonstrates the high quality of our assembly. We anticipate that this genomic resource will be useful to the broader ornithological community and those interested in studying the evolutionary history and ecological interactions of larks, which comprise a widespread, yet understudied lineage of songbirds.
角百灵()是一种小型鸣禽,在其广泛的分布范围内,其外观和栖息地表现出显著的地理变异。虽然已经成为许多生态和进化研究的焦点,但我们仍然缺乏角百灵及其相关物种(百灵科)的高度连续基因组组装。在这里,我们展示了 CLO_EAlp_1.0,这是一种从哥伦比亚安第斯高原的一个野生雄性鸟类的血液样本中生成的高度连续的角百灵基因组组装。通过将短插入和 mate-pair 文库与 ALLPATHS-LG 基因组组装管道相结合,我们生成了一个 1.04 Gb 的组装,由 2713 个支架组成,最大支架大小为 31.81 Mb,支架 N50 为 9.42 Mb,支架 L50 为 30。这些支架是由 23685 个 contigs 组装而成的,最大 contig 大小为 1.69 Mb,contig N50 为 193.81 kb,contig L50 为 1429。我们的组装管道还生成了一个 14.00 kb 的单一线粒体 DNA 连续体。在对基因组进行抛光后,我们鉴定出了来自鸟类数据集的 94.5%的单拷贝基因直系同源物和来自脊椎动物数据集的 97.7%的单拷贝基因直系同源物,这进一步证明了我们组装的高质量。我们预计,这个基因组资源将对更广泛的鸟类学社区和那些对研究百灵科的进化历史和生态相互作用感兴趣的人有用,百灵科是一个广泛但研究不足的鸣禽谱系。