• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

进化DNA基序以预测基因芯片探针性能。

Evolving DNA motifs to predict GeneChip probe performance.

作者信息

Langdon Wb, Harrison Ap

机构信息

Department of Computer Science, King's College London, Strand, London, WC2R 2LS, UK.

出版信息

Algorithms Mol Biol. 2009 Mar 19;4:6. doi: 10.1186/1748-7188-4-6.

DOI:10.1186/1748-7188-4-6
PMID:19298675
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC2679018/
Abstract

BACKGROUND

Affymetrix High Density Oligonuclotide Arrays (HDONA) simultaneously measure expression of thousands of genes using millions of probes. We use correlations between measurements for the same gene across 6685 human tissue samples from NCBI's GEO database to indicated the quality of individual HG-U133A probes. Low correlation indicates a poor probe.

RESULTS

Regular expressions can be automatically created from a Backus-Naur form (BNF) context-free grammar using strongly typed genetic programming.

CONCLUSION

The automatically produced motif is better at predicting poor DNA sequences than an existing human generated RE, suggesting runs of Cytosine and Guanine and mixtures should all be avoided.

摘要

背景

Affymetrix高密度寡核苷酸阵列(HDONA)使用数百万个探针同时测量数千个基因的表达。我们利用来自NCBI的GEO数据库的6685个人类组织样本中同一基因测量值之间的相关性来表明单个HG-U133A探针的质量。低相关性表明探针质量差。

结果

正则表达式可以使用强类型遗传编程从巴科斯范式(BNF)上下文无关语法自动创建。

结论

自动生成的基序在预测不良DNA序列方面比现有的人工生成的RE更好,这表明应避免胞嘧啶和鸟嘌呤的连续排列以及混合物。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6270/2679018/a5a4f6e91a78/1748-7188-4-6-9.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6270/2679018/07007d32ed77/1748-7188-4-6-1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6270/2679018/97dccb54164e/1748-7188-4-6-2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6270/2679018/346294653507/1748-7188-4-6-3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6270/2679018/1347af904ab7/1748-7188-4-6-4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6270/2679018/26aea19e779d/1748-7188-4-6-5.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6270/2679018/d087081cd2f2/1748-7188-4-6-6.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6270/2679018/91fa9589c309/1748-7188-4-6-7.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6270/2679018/1b845709b9c2/1748-7188-4-6-8.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6270/2679018/a5a4f6e91a78/1748-7188-4-6-9.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6270/2679018/07007d32ed77/1748-7188-4-6-1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6270/2679018/97dccb54164e/1748-7188-4-6-2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6270/2679018/346294653507/1748-7188-4-6-3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6270/2679018/1347af904ab7/1748-7188-4-6-4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6270/2679018/26aea19e779d/1748-7188-4-6-5.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6270/2679018/d087081cd2f2/1748-7188-4-6-6.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6270/2679018/91fa9589c309/1748-7188-4-6-7.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6270/2679018/1b845709b9c2/1748-7188-4-6-8.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6270/2679018/a5a4f6e91a78/1748-7188-4-6-9.jpg

相似文献

1
Evolving DNA motifs to predict GeneChip probe performance.进化DNA基序以预测基因芯片探针性能。
Algorithms Mol Biol. 2009 Mar 19;4:6. doi: 10.1186/1748-7188-4-6.
2
G-spots cause incorrect expression measurement in Affymetrix microarrays.G点会导致在Affymetrix微阵列中出现不正确的表达测量结果。
BMC Genomics. 2008 Dec 18;9:613. doi: 10.1186/1471-2164-9-613.
3
Widespread existence of uncorrelated probe intensities from within the same probeset on Affymetrix GeneChips.在Affymetrix基因芯片上,同一探针组内存在广泛的不相关探针强度。
J Integr Bioinform. 2008 Aug 25;5(2):98. doi: 10.2390/biecoll-jib-2008-98.
4
A summarization approach for Affymetrix GeneChip data using a reference training set from a large, biologically diverse database.一种使用来自大型、生物多样性数据库的参考训练集对Affymetrix基因芯片数据进行汇总的方法。
BMC Bioinformatics. 2006 Oct 23;7:464. doi: 10.1186/1471-2105-7-464.
5
Probes containing runs of guanines provide insights into the biophysics and bioinformatics of Affymetrix GeneChips.含有鸟嘌呤连续序列的探针为Affymetrix基因芯片的生物物理学和生物信息学提供了见解。
Brief Bioinform. 2009 May;10(3):259-77. doi: 10.1093/bib/bbp018. Epub 2009 Apr 8.
6
Normalized Affymetrix expression data are biased by G-quadruplex formation.标准化的 Affymetrix 表达数据受到 G-四链体形成的影响。
Nucleic Acids Res. 2012 Apr;40(8):3307-15. doi: 10.1093/nar/gkr1230. Epub 2011 Dec 22.
7
In vitro identification and in silico utilization of interspecies sequence similarities using GeneChip technology.利用基因芯片技术进行种间序列相似性的体外鉴定及计算机模拟应用。
BMC Genomics. 2005 May 4;6:62. doi: 10.1186/1471-2164-6-62.
8
A formalized description of the standard human variant nomenclature in Extended Backus-Naur Form.用扩展巴科斯-诺尔范式(EBNF)形式化描述标准人类变异命名法。
BMC Bioinformatics. 2011;12 Suppl 4(Suppl 4):S5. doi: 10.1186/1471-2105-12-S4-S5. Epub 2011 Jul 5.
9
A statistical method for predicting splice variants between two groups of samples using GeneChip expression array data.一种使用基因芯片表达阵列数据预测两组样本之间剪接变体的统计方法。
Theor Biol Med Model. 2006 Apr 7;3:19. doi: 10.1186/1742-4682-3-19.
10
Framework for reanalysis of publicly available Affymetrix® GeneChip® data sets based on functional regions of interest.基于功能感兴趣区域的公开可用 Affymetrix® GeneChip® 数据集再分析框架。
BMC Genomics. 2017 Dec 6;18(Suppl 10):875. doi: 10.1186/s12864-017-4266-5.

本文引用的文献

1
A survey of spatial defects in Homo Sapiens Affymetrix GeneChips.人类 Affymetrix 基因芯片的空间缺陷调查。
IEEE/ACM Trans Comput Biol Bioinform. 2010 Oct-Dec;7(4):647-53. doi: 10.1109/TCBB.2008.108.
2
G-spots cause incorrect expression measurement in Affymetrix microarrays.G点会导致在Affymetrix微阵列中出现不正确的表达测量结果。
BMC Genomics. 2008 Dec 18;9:613. doi: 10.1186/1471-2164-9-613.
3
Ab initio identification of human microRNAs based on structure motifs.基于结构基序从头鉴定人类微小RNA
BMC Bioinformatics. 2007 Dec 18;8:478. doi: 10.1186/1471-2105-8-478.
4
NucPred--predicting nuclear localization of proteins.NucPred——预测蛋白质的核定位
Bioinformatics. 2007 May 1;23(9):1159-60. doi: 10.1093/bioinformatics/btm066. Epub 2007 Mar 1.
5
Motif kernel generated by genetic programming improves remote homology and fold detection.通过遗传编程生成的基序核可改善远程同源性和折叠检测。
BMC Bioinformatics. 2007 Jan 25;8:23. doi: 10.1186/1471-2105-8-23.
6
Interpretation of multiple probe sets mapping to the same gene in Affymetrix GeneChips.对Affymetrix基因芯片中映射到同一基因的多个探针集的解读。
BMC Bioinformatics. 2007 Jan 15;8:13. doi: 10.1186/1471-2105-8-13.
7
NCBI GEO: mining tens of millions of expression profiles--database and tools update.NCBI基因表达综合数据库:挖掘数千万个表达谱——数据库与工具更新
Nucleic Acids Res. 2007 Jan;35(Database issue):D760-5. doi: 10.1093/nar/gkl887. Epub 2006 Nov 11.
8
Solving the riddle of the bright mismatches: labeling and effective binding in oligonucleotide arrays.解开明亮错配之谜:寡核苷酸阵列中的标记与有效结合
Phys Rev E Stat Nonlin Soft Matter Phys. 2003 Jul;68(1 Pt 1):011906. doi: 10.1103/PhysRevE.68.011906. Epub 2003 Jul 16.