• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

利用多基因表达谱选择用于判别分析的信息基因。

Selecting informative genes for discriminant analysis using multigene expression profiles.

作者信息

Yan Xin, Zheng Tian

机构信息

Russell Investments, Tacoma, WA, USA.

出版信息

BMC Genomics. 2008 Sep 16;9 Suppl 2(Suppl 2):S14. doi: 10.1186/1471-2164-9-S2-S14.

DOI:10.1186/1471-2164-9-S2-S14
PMID:18831779
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC2559878/
Abstract

BACKGROUND

Gene expression data extracted from microarray experiments have been used to study the difference between mRNA abundance of genes under different conditions. In one of such experiments, thousands of genes are measured simultaneously, which provides a high-dimensional feature space for discriminating between different sample classes. However, most of these dimensions are not informative about the between-class difference, and add noises to the discriminant analysis.

RESULTS

In this paper we propose and study feature selection methods that evaluate the "informativeness" of a set of genes. Two measures of information based on multigene expression profiles are considered for a backward information-driven screening approach for selecting important gene features. By considering multigene expression profiles, we are able to utilize interaction information among these genes. Using a breast cancer data, we illustrate our methods and compare them to the performance of existing methods.

CONCLUSION

We illustrate in this paper that methods considering gene-gene interactions have better classification power in gene expression analysis. In our results, we identify important genes with relative large p-values from single gene tests. This indicates that these are genes with weak marginal information but strong interaction information, which will be overlooked by strategies that only examine individual genes.

摘要

背景

从微阵列实验中提取的基因表达数据已被用于研究不同条件下基因的mRNA丰度差异。在其中一项此类实验中,数千个基因被同时测量,这为区分不同样本类别提供了一个高维特征空间。然而,这些维度中的大多数对于类间差异并无信息价值,反而会给判别分析增加噪声。

结果

在本文中,我们提出并研究了评估一组基因“信息价值”的特征选择方法。基于多基因表达谱考虑了两种信息度量,用于一种反向信息驱动的筛选方法来选择重要的基因特征。通过考虑多基因表达谱,我们能够利用这些基因之间的相互作用信息。使用乳腺癌数据,我们阐述了我们的方法并将其与现有方法的性能进行比较。

结论

我们在本文中表明,考虑基因 - 基因相互作用的方法在基因表达分析中具有更好的分类能力。在我们的结果中,我们从单基因测试中识别出具有相对较大p值的重要基因。这表明这些是具有弱边际信息但强相互作用信息的基因,而仅检查单个基因的策略会忽略这些基因。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/cef3/2559878/99ec9c895bef/1471-2164-9-S2-S14-5.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/cef3/2559878/cd9ff17a7471/1471-2164-9-S2-S14-1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/cef3/2559878/91778d8b7d78/1471-2164-9-S2-S14-2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/cef3/2559878/7c13102ffbc6/1471-2164-9-S2-S14-3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/cef3/2559878/51d0c9a3e3e4/1471-2164-9-S2-S14-4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/cef3/2559878/99ec9c895bef/1471-2164-9-S2-S14-5.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/cef3/2559878/cd9ff17a7471/1471-2164-9-S2-S14-1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/cef3/2559878/91778d8b7d78/1471-2164-9-S2-S14-2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/cef3/2559878/7c13102ffbc6/1471-2164-9-S2-S14-3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/cef3/2559878/51d0c9a3e3e4/1471-2164-9-S2-S14-4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/cef3/2559878/99ec9c895bef/1471-2164-9-S2-S14-5.jpg

相似文献

1
Selecting informative genes for discriminant analysis using multigene expression profiles.利用多基因表达谱选择用于判别分析的信息基因。
BMC Genomics. 2008 Sep 16;9 Suppl 2(Suppl 2):S14. doi: 10.1186/1471-2164-9-S2-S14.
2
Eigengene-based linear discriminant model for tumor classification using gene expression microarray data.基于特征基因的线性判别模型用于利用基因表达微阵列数据进行肿瘤分类
Bioinformatics. 2006 Nov 1;22(21):2635-42. doi: 10.1093/bioinformatics/btl442. Epub 2006 Aug 22.
3
Simultaneous classification and feature clustering using discriminant vector quantization with applications to microarray data analysis.使用判别向量量化的同时分类与特征聚类及其在微阵列数据分析中的应用
Proc IEEE Comput Soc Bioinform Conf. 2002;1:246-55.
4
Incorporating prior knowledge of gene functional groups into regularized discriminant analysis of microarray data.将基因功能组的先验知识纳入微阵列数据的正则化判别分析。
Bioinformatics. 2007 Dec 1;23(23):3170-7. doi: 10.1093/bioinformatics/btm488. Epub 2007 Oct 12.
5
Tumor classification ranking from microarray data.基于微阵列数据的肿瘤分类排名
BMC Genomics. 2008 Sep 16;9 Suppl 2(Suppl 2):S21. doi: 10.1186/1471-2164-9-S2-S21.
6
Feature construction from synergic pairs to improve microarray-based classification.基于协同对的特征构建以改进基于微阵列的分类
Bioinformatics. 2007 Nov 1;23(21):2866-72. doi: 10.1093/bioinformatics/btm429. Epub 2007 Oct 9.
7
What should be expected from feature selection in small-sample settings.在小样本情况下,特征选择应达到什么预期效果。
Bioinformatics. 2006 Oct 1;22(19):2430-6. doi: 10.1093/bioinformatics/btl407. Epub 2006 Jul 26.
8
Minimum redundancy feature selection from microarray gene expression data.从微阵列基因表达数据中进行最小冗余特征选择。
J Bioinform Comput Biol. 2005 Apr;3(2):185-205. doi: 10.1142/s0219720005001004.
9
Selecting subsets of newly extracted features from PCA and PLS in microarray data analysis.在微阵列数据分析中从主成分分析(PCA)和偏最小二乘法(PLS)中选择新提取特征的子集。
BMC Genomics. 2008 Sep 16;9 Suppl 2(Suppl 2):S24. doi: 10.1186/1471-2164-9-S2-S24.
10
Attribute clustering for grouping, selection, and classification of gene expression data.用于基因表达数据分组、选择和分类的属性聚类
IEEE/ACM Trans Comput Biol Bioinform. 2005 Apr-Jun;2(2):83-101. doi: 10.1109/TCBB.2005.17.

引用本文的文献

1
Comparison of analytical mathematical approaches for identifying key nuclear magnetic resonance spectroscopy biomarkers in the diagnosis and assessment of clinical change of diseases.比较分析数学方法,以鉴定在疾病的诊断和临床变化评估中的关键磁共振波谱标志物。
J Comp Neurol. 2010 Oct 15;518(20):4091-112. doi: 10.1002/cne.22365.
2
Genomics, molecular imaging, bioinformatics, and bio-nano-info integration are synergistic components of translational medicine and personalized healthcare research.基因组学、分子成像、生物信息学以及生物纳米信息整合是转化医学和个性化医疗研究的协同组成部分。
BMC Genomics. 2008 Sep 16;9 Suppl 2(Suppl 2):I1. doi: 10.1186/1471-2164-9-S2-I1.

本文引用的文献

1
The statistical sign test.统计符号检验。
J Am Stat Assoc. 1946 Dec;41(236):557-66. doi: 10.1080/01621459.1946.10501898.
2
Backward genotype-trait association (BGTA)-based dissection of complex traits in case-control designs.基于病例对照设计中反向基因型-性状关联(BGTA)的复杂性状剖析。
Hum Hered. 2006;62(4):196-212. doi: 10.1159/000096995. Epub 2006 Nov 15.
3
Pre-validation and inference in microarrays.微阵列中的预验证和推断
Stat Appl Genet Mol Biol. 2002;1:Article1. doi: 10.2202/1544-6115.1000. Epub 2002 Aug 22.
4
Analysis of recursive gene selection approaches from microarray data.基于微阵列数据的递归基因选择方法分析。
Bioinformatics. 2005 Oct 1;21(19):3741-7. doi: 10.1093/bioinformatics/bti618. Epub 2005 Aug 23.
5
Systematic benchmarking of microarray data classification: assessing the role of non-linearity and dimensionality reduction.微阵列数据分类的系统基准测试:评估非线性和降维的作用。
Bioinformatics. 2004 Nov 22;20(17):3185-95. doi: 10.1093/bioinformatics/bth383. Epub 2004 Jul 1.
6
Statistical tests for differential expression in cDNA microarray experiments.cDNA微阵列实验中差异表达的统计检验。
Genome Biol. 2003;4(4):210. doi: 10.1186/gb-2003-4-4-210. Epub 2003 Mar 17.
7
Backward Haplotype Transmission Association (BHTA) algorithm - a fast multiple-marker screening method.反向单倍型传递关联(BHTA)算法——一种快速的多标记筛选方法。
Hum Hered. 2002;53(4):197-215. doi: 10.1159/000066194.
8
Empirical bayes methods and false discovery rates for microarrays.微阵列的经验贝叶斯方法与错误发现率
Genet Epidemiol. 2002 Jun;23(1):70-86. doi: 10.1002/gepi.1124.
9
Feature extraction and normalization algorithms for high-density oligonucleotide gene expression array data.用于高密度寡核苷酸基因表达阵列数据的特征提取与归一化算法。
J Cell Biochem Suppl. 2001;Suppl 37:120-5. doi: 10.1002/jcb.10073.
10
Gene expression profiling predicts clinical outcome of breast cancer.基因表达谱分析可预测乳腺癌的临床预后。
Nature. 2002 Jan 31;415(6871):530-6. doi: 10.1038/415530a.