Edward Via College of Osteopathic Medicine, University of Louisiana Monroe, Monroe, LA 71203, USA.
Center for One Health Research, VA-MD Regional College of Veterinary Medicine, Blacksburg, VA 24060, USA.
G3 (Bethesda). 2021 Dec 8;11(12). doi: 10.1093/g3journal/jkab315.
The eastern fox squirrel, Sciurus niger, exhibits marked geographic variation in size and coat color, is a model organism for studies of behavior and ecology, and a potential model for investigating physiological solutions to human porphyrias. We assembled a genome using Illumina HiSeq, PacBio SMRT, and Oxford Nanopore MinION sequencing platforms. Together, the sequencing data resulted in a draft genome of 2.99 Gb, containing 32,830 scaffolds with an average size of 90.9 Kb and N50 of 183.8 Kb. Genome completeness was estimated to be 93.78%. A total of 24,443 protein-encoding genes were predicted from the assembly and 23,079 (94.42%) were annotated. Repeat elements comprised an estimated 38.49% of the genome, with the majority being LINEs (13.92%), SINEs (6.04%), and LTR elements. The topology of the species tree reconstructed using maximum-likelihood phylogenetic analysis was congruent with those of previous studies. This genome assembly can prove useful for comparative studies of genome structure and function in this rapidly diversifying lineage of mammals, for studies of population genomics and adaptation, and for biomedical research. Predicted amino acid sequence alignments for genes affecting heme biosynthesis, color vision, and hibernation showed point mutations and indels that may affect protein function and ecological adaptation.
东部红松鼠(Sciurus niger)在体型和毛色上表现出明显的地理变异,是行为和生态学研究的模式生物,也是研究人类卟啉症生理解决方案的潜在模型。我们使用 Illumina HiSeq、PacBio SMRT 和 Oxford Nanopore MinION 测序平台组装了一个基因组。这些测序数据共同产生了一个 2.99Gb 的草图基因组,包含 32830 个支架,平均大小为 90.9 Kb,N50 为 183.8 Kb。基因组完整性估计为 93.78%。从组装中预测了 24443 个蛋白质编码基因,其中 23079 个(94.42%)被注释。重复元件估计占基因组的 38.49%,其中大部分是 LINEs(13.92%)、SINEs(6.04%)和 LTR 元件。使用最大似然系统发育分析重建的物种树拓扑结构与先前的研究一致。这个基因组组装对于研究这个快速多样化的哺乳动物谱系的基因组结构和功能、群体基因组学和适应性以及生物医学研究都将非常有用。预测影响血红素生物合成、颜色视觉和冬眠的基因的氨基酸序列比对显示,可能影响蛋白质功能和生态适应性的点突变和插入缺失。