• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

基于负选择模式(NSP)的基因家族鉴定策略的验证

Validation of an NSP-based (negative selection pattern) gene family identification strategy.

作者信息

Frank Ronald L, Kandoth Cyriac, Ercal Fikret

机构信息

Biological Sciences Department, Missouri S&T, Rolla, MO 65409, USA.

出版信息

BMC Bioinformatics. 2008 Aug 12;9 Suppl 9(Suppl 9):S2. doi: 10.1186/1471-2105-9-S9-S2.

DOI:10.1186/1471-2105-9-S9-S2
PMID:18793465
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC2537573/
Abstract

BACKGROUND

Gene family identification from ESTs can be a valuable resource for analysis of genome evolution but presents unique challenges in organisms for which the entire genome is not yet sequenced. We have developed a novel gene family identification method based on negative selection patterns (NSP) between family members to screen EST-generated contigs. This strategy was tested on five known gene families in Arabidopsis to see if individual paralogs could be identified with accuracy from EST data alone when compared to the actual gene sequences in this fully sequenced genome.

RESULTS

The NSP method uniquely identified family members in all the gene families tested. Two members of the FtsH gene family, three members each of the PAL, RF1, and ribosomal L6 gene families, and four members of the CAD gene family were correctly identified. Additionally all ESTs from the representative contigs when checked against MapViewer data successfully identify the gene locus predicted.

CONCLUSION

We demonstrate the effectiveness of the NSP strategy in identifying specific gene family members in Arabidopsis using only EST data and we describe how this strategy can be used to identify many gene families in agronomically important crop species where they are as yet undiscovered.

摘要

背景

从ESTs中识别基因家族对于基因组进化分析而言可能是一种宝贵资源,但对于尚未进行全基因组测序的生物体来说,这带来了独特的挑战。我们开发了一种基于家族成员间负选择模式(NSP)的新型基因家族识别方法,用于筛选由EST生成的重叠群。该策略在拟南芥的五个已知基因家族上进行了测试,以确定仅根据EST数据与这个已完成全基因组测序的实际基因序列相比,能否准确识别各个旁系同源基因。

结果

NSP方法在所有测试的基因家族中都能独特地识别家族成员。FtsH基因家族的两个成员、PAL基因家族、RF1基因家族和核糖体L6基因家族的各三个成员,以及CAD基因家族的四个成员都被正确识别。此外,当根据MapViewer数据检查代表性重叠群的所有EST时,成功识别出了预测的基因位点。

结论

我们证明了NSP策略仅使用EST数据在拟南芥中识别特定基因家族成员的有效性,并且描述了该策略如何用于识别在重要农艺作物物种中尚未被发现的许多基因家族。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8bb1/2537573/b91111bcc294/1471-2105-9-S9-S2-1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8bb1/2537573/b91111bcc294/1471-2105-9-S9-S2-1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8bb1/2537573/b91111bcc294/1471-2105-9-S9-S2-1.jpg

相似文献

1
Validation of an NSP-based (negative selection pattern) gene family identification strategy.基于负选择模式(NSP)的基因家族鉴定策略的验证
BMC Bioinformatics. 2008 Aug 12;9 Suppl 9(Suppl 9):S2. doi: 10.1186/1471-2105-9-S9-S2.
2
Genome-wide analysis of S-Locus F-box-like genes in Arabidopsis thaliana.拟南芥中S位点类F盒基因的全基因组分析。
Plant Mol Biol. 2004 Dec;56(6):929-45. doi: 10.1007/s11103-004-6236-y. Epub 2005 Apr 7.
3
CATMA, a comprehensive genome-scale resource for silencing and transcript profiling of Arabidopsis genes.CATMA,一个用于拟南芥基因沉默和转录谱分析的全面的基因组规模资源。
BMC Bioinformatics. 2007 Oct 18;8:400. doi: 10.1186/1471-2105-8-400.
4
Genome-wide analysis of the rice and Arabidopsis non-specific lipid transfer protein (nsLtp) gene families and identification of wheat nsLtp genes by EST data mining.水稻和拟南芥非特异性脂质转移蛋白(nsLtp)基因家族的全基因组分析以及通过EST数据挖掘鉴定小麦nsLtp基因
BMC Genomics. 2008 Feb 21;9:86. doi: 10.1186/1471-2164-9-86.
5
Identification and analysis of gene families from the duplicated genome of soybean using EST sequences.利用EST序列对大豆重复基因组中的基因家族进行鉴定与分析。
BMC Genomics. 2006 Aug 9;7:204. doi: 10.1186/1471-2164-7-204.
6
The organization of cytoplasmic ribosomal protein genes in the Arabidopsis genome.拟南芥基因组中细胞质核糖体蛋白基因的组织方式。
Plant Physiol. 2001 Oct;127(2):398-415.
7
Refined annotation of the Arabidopsis genome by complete expressed sequence tag mapping.通过完整的表达序列标签定位对拟南芥基因组进行精确注释。
Plant Physiol. 2003 Jun;132(2):469-84. doi: 10.1104/pp.102.018101.
8
Genome-wide comparative analysis of the metalloprotease ftsH gene families between Arabidopsis thaliana and rice.
Sheng Wu Gong Cheng Xue Bao. 2009 Sep;25(9):1402-8.
9
Genome-wide analysis of heat shock transcription factor families in rice and Arabidopsis.水稻和拟南芥热激转录因子家族的全基因组分析
J Genet Genomics. 2008 Feb;35(2):105-18. doi: 10.1016/S1673-8527(08)60016-8.
10
Comparative analysis of the Arabidopsis and rice expressed sequence tag (EST) sets.拟南芥和水稻表达序列标签(EST)集的比较分析。
In Silico Biol. 1999;1(4):197-213.

引用本文的文献

1
Proceedings of the 2009 MidSouth Computational Biology and Bioinformatics Society (MCBIOS) conference. Introduction.2009年中南计算生物学与生物信息学学会(MCBIOS)会议论文集。引言。
BMC Bioinformatics. 2009 Oct 8;10 Suppl 11(Suppl 11):S1. doi: 10.1186/1471-2105-10-S11-S1.
2
Proceedings of the 2008 MidSouth Computational Biology and Bioinformatics Society (MCBIOS) Conference.2008年中南计算生物学与生物信息学学会(MCBIOS)会议论文集
BMC Bioinformatics. 2008 Aug 12;9 Suppl 9(Suppl 9):S1. doi: 10.1186/1471-2105-9-S9-S1.

本文引用的文献

1
Identification and characterization of insect-specific proteins by genome data analysis.通过基因组数据分析鉴定和表征昆虫特异性蛋白质。
BMC Genomics. 2007 Apr 4;8:93. doi: 10.1186/1471-2164-8-93.
2
A gene family of putative immune recognition molecules in the hydroid Hydractinia.水螅虫纲的水螅中一个假定的免疫识别分子基因家族。
Immunogenetics. 2007 Mar;59(3):233-46. doi: 10.1007/s00251-006-0179-1. Epub 2007 Jan 11.
3
An automated method for rapid identification of putative gene family members in plants.一种用于快速鉴定植物中假定基因家族成员的自动化方法。
BMC Bioinformatics. 2006 Sep 6;7 Suppl 2(Suppl 2):S19. doi: 10.1186/1471-2105-7-S2-S19.
4
Identification and analysis of gene families from the duplicated genome of soybean using EST sequences.利用EST序列对大豆重复基因组中的基因家族进行鉴定与分析。
BMC Genomics. 2006 Aug 9;7:204. doi: 10.1186/1471-2164-7-204.
5
PAL2NAL: robust conversion of protein sequence alignments into the corresponding codon alignments.PAL2NAL:将蛋白质序列比对稳健地转换为相应的密码子比对。
Nucleic Acids Res. 2006 Jul 1;34(Web Server issue):W609-12. doi: 10.1093/nar/gkl315.
6
A hitchhiker's guide to expressed sequence tag (EST) analysis.表达序列标签(EST)分析指南
Brief Bioinform. 2007 Jan;8(1):6-21. doi: 10.1093/bib/bbl015. Epub 2006 May 23.
7
CAFE: a computational tool for the study of gene family evolution.CAFE:一种用于研究基因家族进化的计算工具。
Bioinformatics. 2006 May 15;22(10):1269-71. doi: 10.1093/bioinformatics/btl097. Epub 2006 Mar 16.
8
Application of DETECTER, an evolutionary genomic tool to analyze genetic variation, to the cystic fibrosis gene family.将一种用于分析基因变异的进化基因组工具DETECTER应用于囊性纤维化基因家族。
BMC Genomics. 2006 Mar 7;7:44. doi: 10.1186/1471-2164-7-44.
9
Computational identification and systematic analysis of the ACR gene family in Oryza sativa.
J Plant Physiol. 2006 Mar;163(4):445-51. doi: 10.1016/j.jplph.2005.05.011. Epub 2005 Sep 6.
10
Genome-wide analysis of the ERF gene family in Arabidopsis and rice.拟南芥和水稻中ERF基因家族的全基因组分析。
Plant Physiol. 2006 Feb;140(2):411-32. doi: 10.1104/pp.105.073783. Epub 2006 Jan 11.