• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

一种预测新转录靶标的负选择启发式方法。

A negative selection heuristic to predict new transcriptional targets.

机构信息

Department of Science, University of Sannio, Benevento, Italy.

出版信息

BMC Bioinformatics. 2013;14 Suppl 1(Suppl 1):S3. doi: 10.1186/1471-2105-14-S1-S3. Epub 2013 Jan 14.

DOI:10.1186/1471-2105-14-S1-S3
PMID:23368951
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC3548675/
Abstract

BACKGROUND

Supervised machine learning approaches have been recently adopted in the inference of transcriptional targets from high throughput trascriptomic and proteomic data showing major improvements from with respect to the state of the art of reverse gene regulatory network methods. Beside traditional unsupervised techniques, a supervised classifier learns, from known examples, a function that is able to recognize new relationships for new data. In the context of gene regulatory inference a supervised classifier is coerced to learn from positive and unlabeled examples, as the counter negative examples are unavailable or hard to collect. Such a condition could limit the performance of the classifier especially when the amount of training examples is low.

RESULTS

In this paper we improve the supervised identification of transcriptional targets by selecting reliable counter negative examples from the unlabeled set. We introduce an heuristic based on the known topology of transcriptional networks that in fact restores the conventional positive/negative training condition and shows a significant improvement of the classification performance. We empirically evaluate the proposed heuristic with the experimental datasets of Escherichia coli and show an example of application in the prediction of BCL6 direct core targets in normal germinal center human B cells obtaining a precision of 60%.

CONCLUSIONS

The availability of only positive examples in learning transcriptional relationships negatively affects the performance of supervised classifiers. We show that the selection of reliable negative examples, a practice adopted in text mining approaches, improves the performance of such classifiers opening new perspectives in the identification of new transcriptional targets.

摘要

背景

最近,监督机器学习方法已被用于从高通量转录组学和蛋白质组学数据中推断转录靶标,与反向基因调控网络方法的最新技术相比,这些方法取得了重大改进。除了传统的无监督技术外,监督分类器还可以从已知示例中学习到一种能够识别新数据中新关系的功能。在基因调控推断的背景下,监督分类器被迫从阳性和未标记的示例中学习,因为无法获得或难以收集反例。这种情况可能会限制分类器的性能,尤其是在训练示例数量较少的情况下。

结果

在本文中,我们通过从未标记集中选择可靠的反例,改进了转录靶标监督识别。我们引入了一种基于转录网络已知拓扑结构的启发式方法,该方法实际上恢复了传统的正/负训练条件,并显著提高了分类性能。我们使用大肠杆菌的实验数据集对提出的启发式方法进行了实证评估,并展示了在预测正常生发中心人类 B 细胞中 BCL6 直接核心靶标的应用示例,获得了 60%的精度。

结论

在学习转录关系时只有阳性示例可用,这会对监督分类器的性能产生负面影响。我们表明,选择可靠的反例(文本挖掘方法中采用的实践)可以提高此类分类器的性能,为识别新的转录靶标开辟了新的前景。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e861/3548675/dfcc461185f1/1471-2105-14-S1-S3-7.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e861/3548675/58ce887900c7/1471-2105-14-S1-S3-1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e861/3548675/5eff2b0f2c51/1471-2105-14-S1-S3-2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e861/3548675/295486e3ce77/1471-2105-14-S1-S3-3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e861/3548675/13360620f416/1471-2105-14-S1-S3-4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e861/3548675/1837e25a7e40/1471-2105-14-S1-S3-5.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e861/3548675/2e97e511e39d/1471-2105-14-S1-S3-6.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e861/3548675/dfcc461185f1/1471-2105-14-S1-S3-7.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e861/3548675/58ce887900c7/1471-2105-14-S1-S3-1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e861/3548675/5eff2b0f2c51/1471-2105-14-S1-S3-2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e861/3548675/295486e3ce77/1471-2105-14-S1-S3-3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e861/3548675/13360620f416/1471-2105-14-S1-S3-4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e861/3548675/1837e25a7e40/1471-2105-14-S1-S3-5.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e861/3548675/2e97e511e39d/1471-2105-14-S1-S3-6.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e861/3548675/dfcc461185f1/1471-2105-14-S1-S3-7.jpg

相似文献

1
A negative selection heuristic to predict new transcriptional targets.一种预测新转录靶标的负选择启发式方法。
BMC Bioinformatics. 2013;14 Suppl 1(Suppl 1):S3. doi: 10.1186/1471-2105-14-S1-S3. Epub 2013 Jan 14.
2
Learning gene regulatory networks from only positive and unlabeled data.仅从正样本和无标签数据中学习基因调控网络。
BMC Bioinformatics. 2010 May 5;11:228. doi: 10.1186/1471-2105-11-228.
3
Supervised inference of gene regulatory networks from positive and unlabeled examples.基于正例和未标记示例的基因调控网络的监督推理。
Methods Mol Biol. 2013;939:47-58. doi: 10.1007/978-1-62703-107-3_5.
4
PreCisIon: PREdiction of CIS-regulatory elements improved by gene's positION.精确性:通过基因位置提高 CIS 调控元件的预测。
Nucleic Acids Res. 2013 Feb 1;41(3):1406-15. doi: 10.1093/nar/gks1286. Epub 2012 Dec 14.
5
Transcriptional network inference from functional similarity and expression data: a global supervised approach.基于功能相似性和表达数据的转录网络推断:一种全局监督方法。
Stat Appl Genet Mol Biol. 2012 Jan 6;11(1):Article 2. doi: 10.2202/1544-6115.1695.
6
Semi-supervised prediction of gene regulatory networks using machine learning algorithms.使用机器学习算法对基因调控网络进行半监督预测。
J Biosci. 2015 Oct;40(4):731-40. doi: 10.1007/s12038-015-9558-9.
7
SemiBoost: boosting for semi-supervised learning.半增强算法:用于半监督学习的增强算法
IEEE Trans Pattern Anal Mach Intell. 2009 Nov;31(11):2000-14. doi: 10.1109/TPAMI.2008.235.
8
SIRENE: supervised inference of regulatory networks.SIRENE:监管网络的监督推理
Bioinformatics. 2008 Aug 15;24(16):i76-82. doi: 10.1093/bioinformatics/btn273.
9
Positive-unlabeled learning for disease gene identification.基于正例无标记学习的疾病基因识别。
Bioinformatics. 2012 Oct 15;28(20):2640-7. doi: 10.1093/bioinformatics/bts504. Epub 2012 Aug 24.
10
Integrated biochemical and computational approach identifies BCL6 direct target genes controlling multiple pathways in normal germinal center B cells.综合生化和计算方法鉴定了 BCL6 的直接靶基因,这些基因控制正常生发中心 B 细胞中的多个途径。
Blood. 2010 Feb 4;115(5):975-84. doi: 10.1182/blood-2009-06-227017. Epub 2009 Dec 3.

引用本文的文献

1
Learning a Markov Logic network for supervised gene regulatory network inference.学习马尔可夫逻辑网络以进行监督基因调控网络推断。
BMC Bioinformatics. 2013 Sep 12;14:273. doi: 10.1186/1471-2105-14-273.

本文引用的文献

1
Genome-wide dissection of posttranscriptional and posttranslational interactions.转录后和翻译后相互作用的全基因组剖析
Methods Mol Biol. 2012;786:131-49. doi: 10.1007/978-1-61779-292-2_8.
2
Learning gene regulatory networks from only positive and unlabeled data.仅从正样本和无标签数据中学习基因调控网络。
BMC Bioinformatics. 2010 May 5;11:228. doi: 10.1186/1471-2105-11-228.
3
Reverse-engineering transcription control networks.转录调控网络的反向工程。
Phys Life Rev. 2005 Mar;2(1):65-88. doi: 10.1016/j.plrev.2005.01.001.
4
TimeDelay-ARACNE: Reverse engineering of gene networks from time-course data by an information theoretic approach.时滞 ARACNE:基于信息论方法从时间序列数据中反向工程基因网络。
BMC Bioinformatics. 2010 Mar 25;11:154. doi: 10.1186/1471-2105-11-154.
5
Integrated biochemical and computational approach identifies BCL6 direct target genes controlling multiple pathways in normal germinal center B cells.综合生化和计算方法鉴定了 BCL6 的直接靶基因,这些基因控制正常生发中心 B 细胞中的多个途径。
Blood. 2010 Feb 4;115(5):975-84. doi: 10.1182/blood-2009-06-227017. Epub 2009 Dec 3.
6
Comparing different ODE modelling approaches for gene regulatory networks.比较基因调控网络的不同 ODE 建模方法。
J Theor Biol. 2009 Dec 21;261(4):511-30. doi: 10.1016/j.jtbi.2009.07.040. Epub 2009 Aug 6.
7
Reverse engineering of gene regulatory networks: a comparative study.基因调控网络的逆向工程:一项比较研究。
EURASIP J Bioinform Syst Biol. 2009;2009(1):617281. doi: 10.1155/2009/617281. Epub 2009 Jun 11.
8
The BCL6 transcriptional program features repression of multiple oncogenes in primary B cells and is deregulated in DLBCL.BCL6转录程序的特点是抑制原发性B细胞中的多种癌基因,并且在弥漫性大B细胞淋巴瘤(DLBCL)中失调。
Blood. 2009 May 28;113(22):5536-48. doi: 10.1182/blood-2008-12-193037. Epub 2009 Mar 23.
9
SIRENE: supervised inference of regulatory networks.SIRENE:监管网络的监督推理
Bioinformatics. 2008 Aug 15;24(16):i76-82. doi: 10.1093/bioinformatics/btn273.
10
Reconstructing gene regulatory networks with bayesian networks by combining expression data with multiple sources of prior knowledge.通过将表达数据与多种先验知识来源相结合,利用贝叶斯网络重建基因调控网络。
Stat Appl Genet Mol Biol. 2007;6:Article15. doi: 10.2202/1544-6115.1282. Epub 2007 May 29.