Sato Masanao, Seki Masahide, Suzuki Yutaka, Ueki Shoko
Division of Applied Bioscience, Graduate School of Agriculture, Hokkaido University, Kita 9 Nishi 9, Kita-Ku, Sapporo, Hokkaido 060-8589, Japan.
Department of Computational Biology and Medical Sciences, Graduate School of Frontier Sciences, The University of Tokyo, 5-1-5 Kashiwanoha, Kashiwa, Chiba 277-8561, Japan.
Data Brief. 2023 Mar 22;48:109071. doi: 10.1016/j.dib.2023.109071. eCollection 2023 Jun.
is a eukaryotic, cosmopolitan, and unicellular alga (class: Raphidophyceae), and produces fish-killing blooms. There is a substantial scientific and practical interest in its ecophysiological characteristics that determine bloom dynamics and its adaptation to broad climate zones. A well-annotated genomic/genetic sequence information enables researchers to characterize organisms using modern molecular technology. In the present study, we conducted RNA sequencing, a transcriptome assembly of 84,693,530 high-quality deduplicated short-read sequences. Obtained RNA reads were assembled by Trinity assembler and 144,777 contigs were identified with N values of 1085. Total 60,877 open reading frames with the length of 150 bp or greater were predicted. For further analyses, top Gene Ontology terms, pfam hits, and blast hits were annotated for all the predicted genes. The raw data were deposited in the NCBI SRA database (BioProject PRJDB6241 and PRJDB15108), and the assemblies are available in NCBI TSA database (ICRV01). The annotation information can be obtained in Dryad and can be accessed via doi: 10.5061/dryad.m0cfxpp56.
是一种真核、广布且单细胞的藻类(分类:针胞藻纲),会形成导致鱼类死亡的水华。其决定水华动态及其对广泛气候带适应性的生态生理特征具有重大的科学和实际研究意义。经过充分注释的基因组/遗传序列信息使研究人员能够利用现代分子技术对生物体进行特征描述。在本研究中,我们进行了RNA测序,对84,693,530条高质量去重短读长序列进行了转录组组装。获得的RNA读段由Trinity组装器进行组装,共鉴定出144,777个重叠群,N值为1085。预测出总共60,877个长度为150 bp或更长的开放阅读框。为了进一步分析,对所有预测基因注释了顶级基因本体术语、pfam比对结果和blast比对结果。原始数据已存入NCBI SRA数据库(生物项目PRJDB6241和PRJDB15108),组装结果可在NCBI TSA数据库(ICRV01)中获取。注释信息可在Dryad中获得,可通过doi: 10.5061/dryad.m0cfxpp56访问。