• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

欧几里得距离测量和主成分分析在基因识别中的应用。

Application of Euclidean distance measurement and principal component analysis for gene identification.

作者信息

Ghosh Antara, Barman Soma

机构信息

Institute of Radio Physics and Electronics, University of Calcutta, 92, APC Road, Kolkata 700009, India.

Institute of Radio Physics and Electronics, University of Calcutta, 92, APC Road, Kolkata 700009, India.

出版信息

Gene. 2016 Jun 1;583(2):112-120. doi: 10.1016/j.gene.2016.02.015. Epub 2016 Feb 11.

DOI:10.1016/j.gene.2016.02.015
PMID:26877227
Abstract

Gene systems are extremely complex, heterogeneous, and noisy in nature. Many statistical tools which are used to extract relevant feature from genes provide fuzzy and ambiguous information. High-dimensional gene expression database available in public domain usually contains thousands of genes. Efficient prediction method is demanding nowadays for accurate identification of such database. Euclidean distance measurement and principal component analysis methods are applied on such databases to identify the genes. In both methods, prediction algorithm is based on homology search approach. Digital Signal Processing technique along with statistical method is used for analysis of genes in both cases. A two-level decision logic is used for gene classification as healthy or cancerous. This binary logic minimizes the prediction error and improves prediction accuracy. Superiority of the method is judged by receiver operating characteristic curve.

摘要

基因系统本质上极其复杂、异质且具有噪声。许多用于从基因中提取相关特征的统计工具提供的信息模糊且不明确。公共领域中可用的高维基因表达数据库通常包含数千个基因。如今,对于准确识别此类数据库,高效的预测方法很有必要。欧几里得距离测量和主成分分析方法被应用于此类数据库以识别基因。在这两种方法中,预测算法都基于同源性搜索方法。在这两种情况下,数字信号处理技术与统计方法一起用于基因分析。使用两级决策逻辑将基因分类为健康或癌变。这种二元逻辑可将预测误差降至最低并提高预测准确性。该方法的优越性通过接收者操作特征曲线来判断。

相似文献

1
Application of Euclidean distance measurement and principal component analysis for gene identification.欧几里得距离测量和主成分分析在基因识别中的应用。
Gene. 2016 Jun 1;583(2):112-120. doi: 10.1016/j.gene.2016.02.015. Epub 2016 Feb 11.
2
Hierarchical gene selection and genetic fuzzy system for cancer microarray data classification.用于癌症微阵列数据分类的分层基因选择与遗传模糊系统
PLoS One. 2015 Mar 30;10(3):e0120364. doi: 10.1371/journal.pone.0120364. eCollection 2015.
3
Automated Detection of Cancer Associated Genes Using a Combined Fuzzy-Rough-Set-Based F-Information and Water Swirl Algorithm of Human Gene Expression Data.基于模糊粗糙集的F信息与人类基因表达数据的水漩涡算法相结合自动检测癌症相关基因
PLoS One. 2016 Dec 9;11(12):e0167504. doi: 10.1371/journal.pone.0167504. eCollection 2016.
4
Improved gene prediction by principal component analysis based autoregressive Yule-Walker method.
Gene. 2016 Jan 10;575(2 Pt 2):488-497. doi: 10.1016/j.gene.2015.09.023. Epub 2015 Sep 16.
5
Improving gene expression cancer molecular pattern discovery using nonnegative principal component analysis.使用非负主成分分析改进基因表达癌症分子模式发现
Genome Inform. 2008;21:200-11.
6
Supervised, Unsupervised, and Semi-Supervised Feature Selection: A Review on Gene Selection.监督式、无监督式和半监督式特征选择:基因选择综述
IEEE/ACM Trans Comput Biol Bioinform. 2016 Sep-Oct;13(5):971-989. doi: 10.1109/TCBB.2015.2478454. Epub 2015 Sep 14.
7
New feature selection for gene expression classification based on degree of class overlap in principal dimensions.基于主成分中类重叠程度的基因表达分类的新特征选择。
Comput Biol Med. 2015 Sep;64:292-8. doi: 10.1016/j.compbiomed.2015.01.022. Epub 2015 Feb 7.
8
Nonlinear dimensionality reduction of gene expression data for visualization and clustering analysis of cancer tissue samples.基因表达数据的非线性维数降低,用于癌症组织样本的可视化和聚类分析。
Comput Biol Med. 2010 Aug;40(8):723-32. doi: 10.1016/j.compbiomed.2010.06.007. Epub 2010 Jul 16.
9
Robust Principal Component Analysis Regularized by Truncated Nuclear Norm for Identifying Differentially Expressed Genes.通过截断核范数正则化的稳健主成分分析用于识别差异表达基因
IEEE Trans Nanobioscience. 2017 Sep;16(6):447-454. doi: 10.1109/TNB.2017.2723439. Epub 2017 Jul 4.
10
A case-based reasoning system based on weighted heterogeneous value distance metric for breast cancer diagnosis.一种基于加权异构值距离度量的乳腺癌诊断案例推理系统。
Artif Intell Med. 2017 Mar;77:31-47. doi: 10.1016/j.artmed.2017.02.003. Epub 2017 Feb 11.

引用本文的文献

1
Cancer Cell's Achilles Heels: Considerations for Design of Anti-Cancer Drug Combinations.癌细胞的致命弱点:抗癌药物联合设计的考量
Int J Mol Sci. 2024 Dec 17;25(24):13495. doi: 10.3390/ijms252413495.
2
Transcriptional profiles reveal histologic origin and prognosis across 33 The Cancer Genome Atlas tumor types.转录谱揭示了33种癌症基因组图谱肿瘤类型的组织学起源和预后。
Transl Cancer Res. 2023 Oct 31;12(10):2764-2780. doi: 10.21037/tcr-23-234. Epub 2023 Sep 20.
3
An Intelligent Sorting Method of Film in Cotton Combining Hyperspectral Imaging and the AlexNet-PCA Algorithm.
一种结合高光谱成像与AlexNet-PCA算法的棉花中薄膜智能分拣方法。
Sensors (Basel). 2023 Aug 9;23(16):7041. doi: 10.3390/s23167041.
4
Integrative Analysis of Inflammatory Response-Related Gene for Predicting Prognosis and Immunotherapy in Glioma.基于炎症反应相关基因的综合分析预测胶质瘤的预后和免疫治疗。
J Mol Neurosci. 2023 Aug;73(7-8):608-627. doi: 10.1007/s12031-023-02142-x. Epub 2023 Jul 25.
5
Unique Metabolic Contexts Sensitize Cancer Cells and Discriminate between Glycolytic Tumor Types.独特的代谢环境使癌细胞敏感,并区分糖酵解肿瘤类型。
Cancers (Basel). 2023 Feb 11;15(4):1158. doi: 10.3390/cancers15041158.
6
Using deep learning to detect digitally encoded DNA trigger for Trojan malware in Bio-Cyber attacks.利用深度学习检测生物网络攻击中的木马恶意软件的数字编码 DNA 触发器。
Sci Rep. 2022 Jun 10;12(1):9631. doi: 10.1038/s41598-022-13700-5.
7
DNA methylation-based profiling reveals distinct clusters with survival heterogeneity in high-grade serous ovarian cancer.基于 DNA 甲基化的分析揭示了高级别浆液性卵巢癌中具有生存异质性的不同聚类。
Clin Epigenetics. 2021 Oct 13;13(1):190. doi: 10.1186/s13148-021-01178-3.
8
Prediction of gene expression under drought stress in spring wheat using codon usage pattern.利用密码子使用模式预测春小麦干旱胁迫下的基因表达
Saudi J Biol Sci. 2021 Jul;28(7):4000-4004. doi: 10.1016/j.sjbs.2021.04.015. Epub 2021 Apr 20.
9
Classification of Homo sapiens gene behavior using linear discriminant analysis fused with minimum entropy mapping.利用线性判别分析与最小熵映射融合的方法对人类基因行为进行分类。
Med Biol Eng Comput. 2021 Mar;59(3):673-691. doi: 10.1007/s11517-021-02324-y. Epub 2021 Feb 17.
10
RWRNET: A Gene Regulatory Network Inference Algorithm Using Random Walk With Restart.RWRNET:一种使用带重启的随机游走的基因调控网络推理算法
Front Genet. 2020 Sep 25;11:591461. doi: 10.3389/fgene.2020.591461. eCollection 2020.