Institute of Bast Fiber Crops, Chinese Academy of Agricultural Sciences, No. 348 Xianjia Lake West Road, Changsha, 410205, Hunan, China.
Mol Biol Rep. 2012 May;39(5):6289-96. doi: 10.1007/s11033-012-1450-1.
Bast fibre crops are the second most important natural fibre crops following cotton. Of these, flax (Linum ustitatissimum L.) is the most widely planted in the world, with its fibre used for high quality linen textile. A cDNA library of flax bark tissues was constructed with the purpose of identifying genes involved in the Bast fibre development. A total of 2,297 unigene sequences were obtained from 3,200 randomly selected clones of the cDNA library. These sequences were grouped into 155 clusters and 2,142 singletons, which have been submitted to the GenBank databases. By putative functional annotation, 23.3% of these sequences were similar to known proteins in GenBank, 44.0% of these sequences were similar to unknown proteins, and 32.7% of these sequences showed no significant similarity to any other protein sequences in existing databases. Classified by the Gene Ontology, 24.8, 23.1 and 14.3% were assigned to molecular function, biological process, and cellular component GO terms, respectively. By further bioinformatics approaches, about 110 ESTs matched cell wall related genes in the MAIZEWALL database, representing 16 functional categories of all 19 categories, of which, the most abundant category was protein synthesis. Based on the PlantTFDB database, 39 of the 64 transcription factor families in the Arabidopsis thaliana genome were identified as being involved in flax cell wall formation. The sequences and bioinformatics analysis data generated in this paper will be useful for gene expression, cloning and genetic engineering studies to characterize bast fibre development and improve the properties of the bast fibres.
麻纤维作物是继棉花之后的第二大重要天然纤维作物。其中,亚麻(Linum usitatissimum L.)的种植最为广泛,其纤维用于高品质的亚麻纺织品。为了鉴定与韧皮纤维发育相关的基因,构建了亚麻树皮组织的 cDNA 文库。从 cDNA 文库中随机选择的 3200 个克隆中获得了 2297 个单基因序列。这些序列分为 155 个簇和 2142 个单序列,已提交到 GenBank 数据库。通过假定的功能注释,这些序列中有 23.3%与 GenBank 中的已知蛋白相似,44.0%与未知蛋白相似,32.7%与现有数据库中的任何其他蛋白序列没有显著相似性。根据基因本体论分类,这些序列中有 24.8%、23.1%和 14.3%分别被分配到分子功能、生物过程和细胞成分 GO 术语。通过进一步的生物信息学方法,约 110 个 EST 与 MAIZEWALL 数据库中的细胞壁相关基因匹配,代表了 19 个功能类别中的 16 个,其中最丰富的类别是蛋白质合成。根据 PlantTFDB 数据库,鉴定出拟南芥基因组中的 39 个转录因子家族参与了亚麻细胞壁的形成。本文生成的序列和生物信息学分析数据将有助于基因表达、克隆和遗传工程研究,以表征韧皮纤维的发育并改善韧皮纤维的性能。