Cossío-Bayúgar Raquel, Miranda-Miranda Estefan, Aguilar-Díaz Hugo, Narváez-Padilla Verónica, Reynaud Enrique
Centro Nacional de Investigaciones Disciplinarias en Salud Animal e Inocuidad, Instituto Nacional de Investigaciones Forestales Agrícolas y Pecuarias (INIFAP). Boulevard Cuauhnahuac No. 8534, Jiutepec, Morelos 62574, Mexico.
Centro de Investigación en Dinámica Celular, Universidad Autónoma del Estado de Morelos, Av. Universidad 1001, Chamilpa, Cuernavaca, 62209, Mexico.
Data Brief. 2024 Jun 24;55:110661. doi: 10.1016/j.dib.2024.110661. eCollection 2024 Aug.
To conduct differential gene expression analysis, ovaries from the cattle tick were dissected at three distinct developmental stages (preingurgitated, immature ingurgitated, and mature ingurgitated). Additionally, undissected intact mature males and complete ingurgitated female ticks without ovaries (carcasses) were also collected to serve as reference samples for analysis. To perform total RNA purification, tissue from ten individuals representing each of the five previously described conditions was pooled. mRNA was isolated from the purified total RNA using the oligo (dT) method. Following fragmentation, double stranded cDNA was synthesized and ligated to sequencing adapters. Suitable-sized fragments were subsequently used for PCR amplification. Libraries were analyzed and quantified using an Agilent 2100 Bioanalyzer and an ABI StepOnePlus Real-Time PCR System. A total of 45.64 Gb bases were sequenced using the Illumina HiSeq sequencing platform. After assembling the samples and correcting for abundance, we obtained 82,877 unigenes. The total length, average length, N50, and GC content of the unigenes were 89,754,828 bp,1,082 bp,2,068 bp and 49.04 % respectively. For functional annotation, the unigenes were aligned with 7 functional databases. The number of unigenes identified in the functional databases were as follows: 32,518 (NR:39.24 %), 10,259 (NT:12.38 %), 23,624 (Swissprot:28.50 %), 22,203 (KOG:26.79 %), 25,072 (KEGG:30.25 %), 17,435(GO:21.04 %), and 23,220 (InterPro:28.02 %). Unigene candidate coding regions (CDS) among the unigenes were predicted using TransDecoder software and 42,143 CDS were detected. We also detected 10,522 simple sequence repeats (SSRs) distributed on 8,126 unigenes, and predicted 4,672 transcription factors (TF) coding unigenes. Our data can be used to identify genes that are important for male and female tick and arachnid reproduction and tick general physiology.
为了进行差异基因表达分析,在三个不同发育阶段(吸血前、未成熟吸血和成熟吸血)解剖牛蜱的卵巢。此外,还收集了未解剖的完整成熟雄性蜱以及没有卵巢的完全吸血雌性蜱(虫体)作为分析的参考样本。为了进行总RNA纯化,将代表上述五种条件中每种条件的十个个体的组织混合。使用寡聚(dT)方法从纯化的总RNA中分离mRNA。片段化后,合成双链cDNA并连接到测序接头。随后使用合适大小的片段进行PCR扩增。使用安捷伦2100生物分析仪和ABI StepOnePlus实时PCR系统对文库进行分析和定量。使用Illumina HiSeq测序平台共测序45.64 Gb碱基。在组装样本并校正丰度后,我们获得了82,877个单基因。单基因的总长度、平均长度、N50和GC含量分别为89,754,828 bp、1,082 bp、2,068 bp和49.04%。为了进行功能注释,将单基因与7个功能数据库进行比对。在功能数据库中鉴定出的单基因数量如下:32,518个(NR:39.24%)、10,259个(NT:12.38%)、23,624个(Swissprot:28.50%)、22,203个(KOG:26.79%)、25,072个(KEGG:30.25%)、17,435个(GO:21.04%)和23,220个(InterPro:28.02%)。使用TransDecoder软件预测单基因中的单基因候选编码区(CDS),并检测到42,143个CDS。我们还检测到分布在8,126个单基因上的10,522个简单序列重复(SSR),并预测了4,672个编码转录因子(TF)的单基因。我们的数据可用于鉴定对蜱和蛛形纲动物的雄性和雌性繁殖以及蜱的一般生理学重要的基因。