Suppr超能文献

辐射松六个发育中木质部文库的表达序列标签的产生与分析

Generation and analysis of expressed sequence tags from six developing xylem libraries in Pinus radiata D. Don.

作者信息

Li Xinguo, Wu Harry X, Dillon Shannon K, Southerton Simon G

机构信息

CSIRO Plant Industry, GPO Box 1600, Canberra, ACT 2601, Australia.

出版信息

BMC Genomics. 2009 Jan 21;10:41. doi: 10.1186/1471-2164-10-41.

Abstract

BACKGROUND

Wood is a major renewable natural resource for the timber, fibre and bioenergy industry. Pinus radiata D. Don is the most important commercial plantation tree species in Australia and several other countries; however, genomic resources for this species are very limited in public databases. Our primary objective was to sequence a large number of expressed sequence tags (ESTs) from genes involved in wood formation in radiata pine.

RESULTS

Six developing xylem cDNA libraries were constructed from earlywood and latewood tissues sampled at juvenile (7 yrs), transition (11 yrs) and mature (30 yrs) ages, respectively. These xylem tissues represent six typical development stages in a rotation period of radiata pine. A total of 6,389 high quality ESTs were collected from 5,952 cDNA clones. Assembly of 5,952 ESTs from 5' end sequences generated 3,304 unigenes including 952 contigs and 2,352 singletons. About 97.0% of the 5,952 ESTs and 96.1% of the unigenes have matches in the UniProt and TIGR databases. Of the 3,174 unigenes with matches, 42.9% were not assigned GO (Gene Ontology) terms and their functions are unknown or unclassified. More than half (52.1%) of the 5,952 ESTs have matches in the Pfam database and represent 772 known protein families. About 18.0% of the 5,952 ESTs matched cell wall related genes in the MAIZEWALL database, representing all 18 categories, 91 of all 174 families and possibly 557 genes. Fifteen cell wall-related genes are ranked in the 30 most abundant genes, including CesA, tubulin, AGP, SAMS, actin, laccase, CCoAMT, MetE, phytocyanin, pectate lyase, cellulase, SuSy, expansin, chitinase and UDP-glucose dehydrogenase. Based on the PlantTFDB database 41 of the 64 transcription factor families in the poplar genome were identified as being involved in radiata pine wood formation. Comparative analysis of GO term abundance revealed a distinct transcriptome in juvenile earlywood formation compared to other stages of wood development.

CONCLUSION

The first large scale genomic resource in radiata pine was generated from six developing xylem cDNA libraries. Cell wall-related genes and transcription factors were identified. Juvenile earlywood has a distinct transcriptome, which is likely to contribute to the undesirable properties of juvenile wood in radiata pine. The publicly available resource of radiata pine will also be valuable for gene function studies and comparative genomics in forest trees.

摘要

背景

木材是木材、纤维和生物能源行业的主要可再生自然资源。辐射松是澳大利亚和其他几个国家最重要的商业人工林树种;然而,该物种的基因组资源在公共数据库中非常有限。我们的主要目标是对辐射松中参与木材形成的大量表达序列标签(EST)进行测序。

结果

分别从幼年(7年)、过渡(11年)和成熟(30年)阶段采集的早材和晚材组织构建了6个发育中的木质部cDNA文库。这些木质部组织代表了辐射松一个轮伐期内的6个典型发育阶段。从5952个cDNA克隆中总共收集到6389个高质量的EST。对5952个EST的5'端序列进行组装,产生了3304个单基因,包括952个重叠群和2352个单拷贝序列。5952个EST中的约97.0%和单基因中的96.1%在UniProt和TIGR数据库中有匹配项。在3174个有匹配项的单基因中,42.9%未被赋予基因本体(GO)术语,其功能未知或未分类。5952个EST中的一半以上(52.1%)在Pfam数据库中有匹配项,代表772个已知蛋白质家族。5952个EST中的约18.0%与MAIZEWALL数据库中与细胞壁相关的基因匹配,代表所有18个类别、174个家族中的91个以及可能的557个基因。15个与细胞壁相关的基因位列最丰富的30个基因之中,包括纤维素合成酶(CesA)、微管蛋白、阿拉伯半乳聚糖蛋白(AGP)、S-腺苷甲硫氨酸合成酶(SAMS)、肌动蛋白、漆酶、CCoAOMT、蛋氨酸合酶(MetE)、植物蓝蛋白、果胶酸裂解酶、纤维素酶、蔗糖合成酶(SuSy)、扩张蛋白、几丁质酶和UDP-葡萄糖脱氢酶。基于PlantTFDB数据库,杨树基因组中64个转录因子家族中的41个被鉴定参与辐射松木材形成。GO术语丰度的比较分析表明,与木材发育的其他阶段相比,幼年早材形成中有一个独特的转录组。

结论

从6个发育中的木质部cDNA文库中生成了辐射松的首个大规模基因组资源。鉴定出了与细胞壁相关的基因和转录因子。幼年早材有一个独特的转录组,这可能导致辐射松幼年木材的不良特性。辐射松的公开可用资源对于林木的基因功能研究和比较基因组学也将是有价值的。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/cee7/2636829/509cddab8d9e/1471-2164-10-41-1.jpg

文献AI研究员

20分钟写一篇综述,助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型,支持多种主流文档格式。

立即体验