Morelli Magnolia W, Blackmon Heath, Hjelmen Carl E
Department of Biology, Utah Valley University, Orem, UT, United States.
Department of Biology, Texas A&M University, College Station, TX, United States.
Front Ecol Evol. 2022;10. doi: 10.3389/fevo.2022.832378. Epub 2022 Mar 17.
Karyotypes and chromosome data have been widely used in many subfields of biology over the last century. Unfortunately, this data is largely scattered among hundreds of articles, books, and theses, many of which are only available behind paywalls. This creates a barrier to new researchers wishing to use this data, especially those from smaller institutions or in countries lacking institutional access to much of the scientific literature. We solved this problem by building two datasets for true flies (Order: Diptera and one specific to ), These datasets are available via a public interactive database that allows users to explore, visualize and download all data. The Diptera karyotype databases currently contain a total of 3,474 karyotype records from 538 publications. Synthesizing this data, we show several groups are of particular interest for future investigations by whole genome sequencing.
在过去的一个世纪里,核型和染色体数据在生物学的许多子领域中得到了广泛应用。不幸的是,这些数据大多分散在数百篇文章、书籍和论文中,其中许多只能通过付费墙获取。这给希望使用这些数据的新研究人员造成了障碍,尤其是那些来自较小机构或所在国家无法通过机构途径获取大量科学文献的研究人员。我们通过为实蝇构建两个数据集(双翅目和一个特定于……的数据集)解决了这个问题。这些数据集可通过一个公共交互式数据库获取,该数据库允许用户探索、可视化和下载所有数据。双翅目核型数据库目前包含来自538篇出版物的总共3474条核型记录。综合这些数据,我们表明有几个群体对于未来通过全基因组测序进行的研究特别有意义。