• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

相似文献

1
Disease gene prioritization using network and feature.利用网络和特征对疾病基因进行优先级排序。
J Comput Biol. 2015 Apr;22(4):313-23. doi: 10.1089/cmb.2015.0001.
2
Disease gene prioritization by integrating tissue-specific molecular networks using a robust multi-network model.使用稳健的多网络模型整合组织特异性分子网络进行疾病基因优先级排序。
BMC Bioinformatics. 2016 Nov 10;17(1):453. doi: 10.1186/s12859-016-1317-x.
3
Candidate gene prioritization with Endeavour.使用Endeavour进行候选基因优先级排序。
Nucleic Acids Res. 2016 Jul 8;44(W1):W117-21. doi: 10.1093/nar/gkw365. Epub 2016 Apr 30.
4
A Bayesian framework to integrate multi-level genome-scale data for Autism risk gene prioritization.一种贝叶斯框架,用于整合多层次全基因组数据以进行自闭症风险基因优先级排序。
BMC Bioinformatics. 2022 Apr 22;23(1):146. doi: 10.1186/s12859-022-04616-y.
5
"Guilt by association" is not competitive with genetic association for identifying autism risk genes.“关联有罪”在鉴定自闭症风险基因方面并不能与遗传关联相竞争。
Sci Rep. 2021 Aug 5;11(1):15950. doi: 10.1038/s41598-021-95321-y.
6
PANDA: Prioritization of autism-genes using network-based deep-learning approach.基于网络的深度学习方法对自闭症基因进行优先级排序。
Genet Epidemiol. 2020 Jun;44(4):382-394. doi: 10.1002/gepi.22282. Epub 2020 Feb 10.
7
An ensemble rank learning approach for gene prioritization.一种用于基因优先级排序的集成排序学习方法。
Annu Int Conf IEEE Eng Med Biol Soc. 2013;2013:3507-10. doi: 10.1109/EMBC.2013.6610298.
8
A random set scoring model for prioritization of disease candidate genes using protein complexes and data-mining of GeneRIF, OMIM and PubMed records.一种用于利用蛋白质复合物以及对基因功能信息(GeneRIF)、在线人类孟德尔遗传(OMIM)和医学期刊数据库(PubMed)记录进行数据挖掘来对疾病候选基因进行优先级排序的随机集评分模型。
BMC Bioinformatics. 2014 Sep 24;15(1):315. doi: 10.1186/1471-2105-15-315.
9
Prioritization of candidate disease genes by enlarging the seed set and fusing information of the network topology and gene expression.通过扩大种子集并融合网络拓扑结构和基因表达信息来对候选疾病基因进行优先级排序。
Mol Biosyst. 2014 Jun;10(6):1400-8. doi: 10.1039/c3mb70588a. Epub 2014 Apr 3.
10
Candidate gene prioritization.候选基因优先级排序。
Mol Genet Genomics. 2012 Sep;287(9):679-98. doi: 10.1007/s00438-012-0710-z. Epub 2012 Aug 15.

引用本文的文献

1
Ranking Plant Network Nodes Based on Their Centrality Measures.基于中心性度量对植物网络节点进行排名。
Entropy (Basel). 2023 Apr 18;25(4):676. doi: 10.3390/e25040676.
2
WINNER: A network biology tool for biomolecular characterization and prioritization.WINNER:一种用于生物分子表征和优先级排序的网络生物学工具。
Front Big Data. 2022 Nov 4;5:1016606. doi: 10.3389/fdata.2022.1016606. eCollection 2022.
3
Universal concept signature analysis: genome-wide quantification of new biological and pathological functions of genes and pathways.通用概念特征分析:基因和途径新的生物学和病理学功能的全基因组定量分析。
Brief Bioinform. 2020 Sep 25;21(5):1717-1732. doi: 10.1093/bib/bbz093.
4
MGOGP: a gene module-based heuristic algorithm for cancer-related gene prioritization.MGOGP:基于基因模块的癌症相关基因优先级启发式算法。
BMC Bioinformatics. 2018 Jun 5;19(1):215. doi: 10.1186/s12859-018-2216-0.
5
Chemokine expression in the early response to injury in human airway epithelial cells.趋化因子在人呼吸道上皮细胞损伤早期反应中的表达。
PLoS One. 2018 Mar 13;13(3):e0193334. doi: 10.1371/journal.pone.0193334. eCollection 2018.
6
A large-scale benchmark of gene prioritization methods.大规模基因优先级方法基准测试。
Sci Rep. 2017 Apr 21;7:46598. doi: 10.1038/srep46598.
7
Lynx: a knowledge base and an analytical workbench for integrative medicine.Lynx:一个用于整合医学的知识库和分析工作台。
Nucleic Acids Res. 2016 Jan 4;44(D1):D882-7. doi: 10.1093/nar/gkv1257. Epub 2015 Nov 20.

本文引用的文献

1
Lynx: a database and knowledge extraction engine for integrative medicine.灵奇:一个用于整合医学的数据库和知识提取引擎。
Nucleic Acids Res. 2014 Jan;42(Database issue):D1007-12. doi: 10.1093/nar/gkt1166. Epub 2013 Nov 21.
2
The Human Phenotype Ontology project: linking molecular biology and disease through phenotype data.人类表型本体论项目:通过表型数据将分子生物学和疾病联系起来。
Nucleic Acids Res. 2014 Jan;42(Database issue):D966-74. doi: 10.1093/nar/gkt1026. Epub 2013 Nov 11.
3
An unbiased evaluation of gene prioritization tools.基因优先级工具的无偏评估。
Bioinformatics. 2012 Dec 1;28(23):3081-8. doi: 10.1093/bioinformatics/bts581. Epub 2012 Oct 9.
4
Fragile X and X-linked intellectual disability: four decades of discovery.脆性 X 综合征与 X 连锁智力障碍:四十年的探索历程。
Am J Hum Genet. 2012 Apr 6;90(4):579-90. doi: 10.1016/j.ajhg.2012.02.018.
5
Genetic and epigenetic networks in intellectual disabilities.智力障碍的遗传和表观遗传网络。
Annu Rev Genet. 2011;45:81-104. doi: 10.1146/annurev-genet-110410-132512. Epub 2011 Sep 9.
6
PINTA: a web server for network-based gene prioritization from expression data.PINTA:一个基于网络的从表达数据进行基因优先级排序的网络服务器。
Nucleic Acids Res. 2011 Jul;39(Web Server issue):W334-8. doi: 10.1093/nar/gkr289. Epub 2011 May 20.
7
A guide to web tools to prioritize candidate genes.候选基因优先级排序的网络工具指南
Brief Bioinform. 2011 Jan;12(1):22-32. doi: 10.1093/bib/bbq007. Epub 2010 Mar 21.
8
The STRING database in 2011: functional interaction networks of proteins, globally integrated and scored.2011年的STRING数据库:蛋白质的功能相互作用网络,全球整合并评分。
Nucleic Acids Res. 2011 Jan;39(Database issue):D561-8. doi: 10.1093/nar/gkq973. Epub 2010 Nov 2.
9
Computational solutions to large-scale data management and analysis.大规模数据管理和分析的计算解决方案。
Nat Rev Genet. 2010 Sep;11(9):647-57. doi: 10.1038/nrg2857.
10
Disease candidate gene identification and prioritization using protein interaction networks.利用蛋白质相互作用网络进行疾病候选基因的识别与优先级排序。
BMC Bioinformatics. 2009 Feb 27;10:73. doi: 10.1186/1471-2105-10-73.

利用网络和特征对疾病基因进行优先级排序。

Disease gene prioritization using network and feature.

作者信息

Xie Bingqing, Agam Gady, Balasubramanian Sandhya, Xu Jinbo, Gilliam T Conrad, Maltsev Natalia, Börnigen Daniela

机构信息

1 Department of Computer Science, Illinois Institute of Technology , Chicago, Illinois.

出版信息

J Comput Biol. 2015 Apr;22(4):313-23. doi: 10.1089/cmb.2015.0001.

DOI:10.1089/cmb.2015.0001
PMID:25844670
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC4808289/
Abstract

Identifying high-confidence candidate genes that are causative for disease phenotypes, from the large lists of variations produced by high-throughput genomics, can be both time-consuming and costly. The development of novel computational approaches, utilizing existing biological knowledge for the prioritization of such candidate genes, can improve the efficiency and accuracy of the biomedical data analysis. It can also reduce the cost of such studies by avoiding experimental validations of irrelevant candidates. In this study, we address this challenge by proposing a novel gene prioritization approach that ranks promising candidate genes that are likely to be involved in a disease or phenotype under study. This algorithm is based on the modified conditional random field (CRF) model that simultaneously makes use of both gene annotations and gene interactions, while preserving their original representation. We validated our approach on two independent disease benchmark studies by ranking candidate genes using network and feature information. Our results showed both high area under the curve (AUC) value (0.86), and more importantly high partial AUC (pAUC) value (0.1296), and revealed higher accuracy and precision at the top predictions as compared with other well-performed gene prioritization tools, such as Endeavour (AUC-0.82, pAUC-0.083) and PINTA (AUC-0.76, pAUC-0.066). We were able to detect more target genes (9/18/19/27) on top positions (1/5/10/20) compared to Endeavour (3/11/14/23) and PINTA (6/10/13/18). To demonstrate its usability, we applied our method to a case study for the prediction of molecular mechanisms contributing to intellectual disability and autism. Our approach was able to correctly recover genes related to both disorders and provide suggestions for possible additional candidates based on their rankings and functional annotations.

摘要

从高通量基因组学产生的大量变异列表中识别导致疾病表型的高可信度候选基因既耗时又昂贵。利用现有生物学知识对这类候选基因进行优先级排序的新型计算方法的开发,可以提高生物医学数据分析的效率和准确性。它还可以通过避免对无关候选基因进行实验验证来降低此类研究的成本。在本研究中,我们通过提出一种新型基因优先级排序方法来应对这一挑战,该方法对可能参与所研究疾病或表型的有前景的候选基因进行排名。该算法基于改进的条件随机场(CRF)模型,该模型同时利用基因注释和基因相互作用,同时保留它们的原始表示。我们通过使用网络和特征信息对候选基因进行排名,在两项独立的疾病基准研究中验证了我们的方法。我们的结果显示曲线下面积(AUC)值较高(0.86),更重要的是部分AUC(pAUC)值较高(0.1296),并且与其他表现良好的基因优先级排序工具(如Endeavour(AUC - 0.82,pAUC - 0.083)和PINTA(AUC - 0.76,pAUC - 0.066))相比,在顶部预测中显示出更高的准确性和精确性。与Endeavour(3/11/14/23)和PINTA(6/10/13/18)相比,我们能够在顶部位置(1/5/10/20)检测到更多的靶基因(9/18/19/27)。为了证明其可用性,我们将我们的方法应用于一个案例研究,以预测导致智力残疾和自闭症的分子机制。我们的方法能够正确地找回与这两种疾病相关联的基因,并根据它们的排名和功能注释为可能的其他候选基因提供建议。