使用加权秩和策略选择用于基因表达微阵列的长寡核苷酸。

Selection of long oligonucleotides for gene expression microarrays using weighted rank-sum strategy.

作者信息

Hu Guangan, Llinás Manuel, Li Jingguang, Preiser Peter Rainer, Bozdech Zbynek

机构信息

School of Biological Sciences, Nanyang Technological University, No, 60 Nanyang Drive, 637551, Singapore.

出版信息

BMC Bioinformatics. 2007 Sep 19;8:350. doi: 10.1186/1471-2105-8-350.

DOI:10.1186/1471-2105-8-350

PMID:17880708

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC2099447/

Abstract

BACKGROUND

The design of long oligonucleotides for spotted DNA microarrays requires detailed attention to ensure their optimal performance in the hybridization process. The main challenge is to select an optimal oligonucleotide element that represents each genetic locus/gene in the genome and is unique, devoid of internal structures and repetitive sequences and its Tm is uniform with all other elements on the microarray. Currently, all of the publicly available programs for DNA long oligonucleotide microarray selection utilize various combinations of cutoffs in which each parameter (uniqueness, Tm, and secondary structure) is evaluated and filtered individually. The use of the cutoffs can, however, lead to information loss and to selection of suboptimal oligonucleotides, especially for genomes with extreme distribution of the GC content, a large proportion of repetitive sequences or the presence of large gene families with highly homologous members.

RESULTS

Here we present the program OligoRankPick which is using a weighted rank-based strategy to select microarray oligonucleotide elements via an integer weighted linear function. This approach optimizes the selection criteria (weight score) for each gene individually, accommodating variable properties of the DNA sequence along the genome. The designed algorithm was tested using three microbial genomes Escherichia coli, Saccharomyces cerevisiae and the human malaria parasite species Plasmodium falciparum. In comparison to other published algorithms OligoRankPick provides significant improvements in oligonucleotide design for all three genomes with the most significant improvements observed in the microarray design for P. falciparum whose genome is characterized by large fluctuations of GC content, and abundant gene duplications.

CONCLUSION

OligoRankPick is an efficient tool for the design of long oligonucleotide DNA microarrays which does not rely on direct oligonucleotide exclusion by parameter cutoffs but instead optimizes all parameters in context of each other. The weighted rank-sum strategy utilized by this algorithm provides high flexibility of oligonucleotide selection which accommodates extreme variability of DNA sequence properties along genomes of many organisms.

摘要

背景

用于点阵式DNA微阵列的长寡核苷酸设计需要格外关注细节，以确保其在杂交过程中的最佳性能。主要挑战在于选择一个最佳的寡核苷酸元件，该元件代表基因组中的每个基因座/基因，并且是独特的，没有内部结构和重复序列，其解链温度（Tm）与微阵列上的所有其他元件一致。目前，所有公开可用的用于DNA长寡核苷酸微阵列选择的程序都使用各种截止值组合，其中每个参数（独特性、Tm和二级结构）都被单独评估和过滤。然而，使用截止值可能会导致信息丢失和选择次优的寡核苷酸，特别是对于GC含量分布极端、重复序列比例大或存在具有高度同源成员的大基因家族的基因组。

结果

在此，我们展示了程序OligoRankPick，它使用基于加权排名的策略，通过整数加权线性函数来选择微阵列寡核苷酸元件。这种方法针对每个基因单独优化选择标准（权重分数），适应基因组中DNA序列的可变特性。使用三种微生物基因组——大肠杆菌、酿酒酵母和人类疟原虫恶性疟原虫对设计的算法进行了测试。与其他已发表的算法相比，OligoRankPick在所有三个基因组的寡核苷酸设计方面都有显著改进，在恶性疟原虫的微阵列设计中观察到最显著的改进，其基因组的特点是GC含量波动大且基因重复丰富。

结论

OligoRankPick是一种用于设计长寡核苷酸DNA微阵列的有效工具，它不依赖于通过参数截止值直接排除寡核苷酸，而是在相互关联的背景下优化所有参数。该算法使用的加权排名总和策略提供了高度灵活的寡核苷酸选择，适应了许多生物体基因组中DNA序列特性的极端变异性。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c600/2099447/5fabb2deb319/1471-2105-8-350-1.jpg

相似文献

Selection of long oligonucleotides for gene expression microarrays using weighted rank-sum strategy.使用加权秩和策略选择用于基因表达微阵列的长寡核苷酸。

BMC Bioinformatics. 2007 Sep 19;8:350. doi: 10.1186/1471-2105-8-350.

Optimal robust non-unique probe selection using Integer Linear Programming.使用整数线性规划的最优稳健非唯一探针选择

Bioinformatics. 2004 Aug 4;20 Suppl 1:i186-93. doi: 10.1093/bioinformatics/bth936.

Randomized probe selection algorithm for microarray design.用于微阵列设计的随机探针选择算法

J Theor Biol. 2007 Oct 7;248(3):512-21. doi: 10.1016/j.jtbi.2007.05.036. Epub 2007 Jun 11.

Transcript-level annotation of Affymetrix probesets improves the interpretation of gene expression data.Affymetrix探针集的转录本水平注释可改善基因表达数据的解读。

BMC Bioinformatics. 2007 Jun 11;8:194. doi: 10.1186/1471-2105-8-194.

Experimental analysis of oligonucleotide microarray design criteria to detect deletions by comparative genomic hybridization.用于通过比较基因组杂交检测缺失的寡核苷酸微阵列设计标准的实验分析。

BMC Genomics. 2008 Oct 21;9:497. doi: 10.1186/1471-2164-9-497.

Long versus short oligonucleotide microarrays for the study of gene expression in nonhuman primates.用于研究非人类灵长类动物基因表达的长寡核苷酸与短寡核苷酸微阵列

J Neurosci Methods. 2006 Apr 15;152(1-2):179-89. doi: 10.1016/j.jneumeth.2005.09.007. Epub 2005 Oct 25.

Transcript mapping with high-density oligonucleotide tiling arrays.使用高密度寡核苷酸平铺阵列进行转录本图谱分析。

Bioinformatics. 2006 Aug 15;22(16):1963-70. doi: 10.1093/bioinformatics/btl289. Epub 2006 Jun 20.

Design and fabrication of spotted long oligonucleotide microarrays for gene expression analysis.用于基因表达分析的点阵长寡核苷酸微阵列的设计与制作。

Methods Mol Biol. 2007;381:213-25. doi: 10.1007/978-1-59745-303-5_10.

Linking microarray reporters with protein functions.将微阵列报告基因与蛋白质功能相联系。

BMC Bioinformatics. 2007 Sep 26;8:360. doi: 10.1186/1471-2105-8-360.

In silico gene selection for custom oligonucleotide microarray design.用于定制寡核苷酸微阵列设计的计算机基因选择

Methods Mol Biol. 2007;382:417-28. doi: 10.1007/978-1-59745-304-2_26.

引用本文的文献

Enhancing Gene Co-Expression Network Inference for the Malaria Parasite .增强疟原虫基因共表达网络推断

Genes (Basel). 2024 May 25;15(6):685. doi: 10.3390/genes15060685.

Sterile protection against malaria by repeated blood stage infection in the monkey model.经重复血期感染对猴子模型中的疟疾进行无菌保护。

Life Sci Alliance. 2023 Dec 29;7(3). doi: 10.26508/lsa.202302524. Print 2024 Mar.

Cyclical regression covariates remove the major confounding effect of cyclical developmental gene expression with strain-specific drug response in the malaria parasite Plasmodium falciparum.周期性回归协变量消除了疟原虫恶性疟原虫中与菌株特异性药物反应相关的周期性发育基因表达的主要混杂影响。

BMC Genomics. 2022 Mar 5;23(1):180. doi: 10.1186/s12864-021-08281-y.

Simultaneous genome-wide gene expression and transcript isoform profiling in the human malaria parasite.对人类疟原虫进行全基因组范围的基因表达和转录本异构体同时分析。

PLoS One. 2017 Nov 7;12(11):e0187595. doi: 10.1371/journal.pone.0187595. eCollection 2017.

Histone 4 lysine 8 acetylation regulates proliferation and host-pathogen interaction in Plasmodium falciparum.组蛋白4赖氨酸8乙酰化调节恶性疟原虫的增殖及宿主-病原体相互作用。

Epigenetics Chromatin. 2017 Aug 22;10(1):40. doi: 10.1186/s13072-017-0147-z.

Integrated analysis of the Plasmodium species transcriptome.疟原虫物种转录组的综合分析。

EBioMedicine. 2016 May;7:255-66. doi: 10.1016/j.ebiom.2016.04.011. Epub 2016 Apr 22.

DNA damage regulation and its role in drug-related phenotypes in the malaria parasites.疟原虫中DNA损伤调控及其在药物相关表型中的作用。

Sci Rep. 2016 Apr 1;6:23603. doi: 10.1038/srep23603.

Genome-wide transcriptome profiling reveals functional networks involving the Plasmodium falciparum drug resistance transporters PfCRT and PfMDR1.全基因组转录组分析揭示了涉及恶性疟原虫耐药转运蛋白PfCRT和PfMDR1的功能网络。

BMC Genomics. 2015 Dec 21;16:1090. doi: 10.1186/s12864-015-2320-8.

Genome-wide analysis in Plasmodium falciparum reveals early and late phases of RNA polymerase II occupancy during the infectious cycle.恶性疟原虫的全基因组分析揭示了感染周期中RNA聚合酶II占据的早期和晚期阶段。

BMC Genomics. 2014 Nov 6;15(1):959. doi: 10.1186/1471-2164-15-959.

Role of calcium signaling in the transcriptional regulation of the apicoplast genome of Plasmodium falciparum.钙信号在恶性疟原虫顶质体基因组转录调控中的作用。

Biomed Res Int. 2014;2014:869401. doi: 10.1155/2014/869401. Epub 2014 Apr 27.

本文引用的文献

Selection of optimal oligonucleotide probes for microarrays using multiple criteria, global alignment and parameter estimation.使用多标准、全局比对和参数估计选择用于微阵列的最佳寡核苷酸探针。

Nucleic Acids Res. 2005 Oct 24;33(19):6114-23. doi: 10.1093/nar/gki914. Print 2005.

Core transcriptional regulatory circuitry in human embryonic stem cells.人类胚胎干细胞中的核心转录调控回路。

Cell. 2005 Sep 23;122(6):947-56. doi: 10.1016/j.cell.2005.08.020.

The genome of the kinetoplastid parasite, Leishmania major.动质体寄生虫硕大利什曼原虫的基因组。

Science. 2005 Jul 15;309(5733):436-42. doi: 10.1126/science.1112680.

Empirical establishment of oligonucleotide probe design criteria.寡核苷酸探针设计标准的经验性确立。

Appl Environ Microbiol. 2005 Jul;71(7):3753-60. doi: 10.1128/AEM.71.7.3753-3760.2005.

Transcript copy number estimation using a mouse whole-genome oligonucleotide microarray.使用小鼠全基因组寡核苷酸微阵列进行转录本拷贝数估计。

Genome Biol. 2005;6(7):R61. doi: 10.1186/gb-2005-6-7-r61. Epub 2005 Jun 30.

YODA: selecting signature oligonucleotides.尤达：选择标志性寡核苷酸。

Bioinformatics. 2005 Apr 15;21(8):1365-70. doi: 10.1093/bioinformatics/bti182. Epub 2004 Nov 30.

The role of Plasmodium falciparum var genes in malaria in pregnancy.恶性疟原虫变异基因在妊娠疟疾中的作用。

Mol Microbiol. 2004 Aug;53(4):1011-9. doi: 10.1111/j.1365-2958.2004.04256.x.

Optimization of probe length and the number of probes per gene for optimal microarray analysis of gene expression.优化探针长度和每个基因的探针数量以实现基因表达的最佳微阵列分析。

Nucleic Acids Res. 2004 Jul 8;32(12):e99. doi: 10.1093/nar/gnh099.

Designing better probes: effect of probe size, mismatch position and number on hybridization in DNA oligonucleotide microarrays.设计更好的探针：探针大小、错配位置及数量对DNA寡核苷酸微阵列杂交的影响

J Microbiol Methods. 2004 May;57(2):269-78. doi: 10.1016/j.mimet.2004.02.002.

The genome sequence of Mycoplasma mycoides subsp. mycoides SC type strain PG1T, the causative agent of contagious bovine pleuropneumonia (CBPP).丝状支原体山羊亚种丝状支原体SC型菌株PG1T的基因组序列，该菌株是牛传染性胸膜肺炎（CBPP）的病原体。

Genome Res. 2004 Feb;14(2):221-7. doi: 10.1101/gr.1673304.

文献检索

告别复杂PubMed语法，用中文像聊天一样搜索，搜遍4000万医学文献。AI智能推荐，让科研检索更轻松。

立即免费搜索

文件翻译

保留排版，准确专业，支持PDF/Word/PPT等文件格式，支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述，25分钟生成高质量综述，智能提取关键信息，辅助科研写作。

立即免费体验

使用加权秩和策略选择用于基因表达微阵列的长寡核苷酸。

Selection of long oligonucleotides for gene expression microarrays using weighted rank-sum strategy.

作者信息

机构信息

出版信息

BACKGROUND

RESULTS

CONCLUSION

背景

结果

结论

相似文献

引用本文的文献

本文引用的文献

文献检索

文件翻译

深度研究

Suppr 超能文献

相似文献

引用本文的文献

本文引用的文献