• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

TopKLists:一个用于多组学排名列表的统计推断、随机聚合和可视化的综合R包。

TopKLists: a comprehensive R package for statistical inference, stochastic aggregation, and visualization of multiple omics ranked lists.

作者信息

Schimek Michael G, Budinská Eva, Kugler Karl G, Švendová Vendula, Ding Jie, Lin Shili

出版信息

Stat Appl Genet Mol Biol. 2015 Jun;14(3):311-6. doi: 10.1515/sagmb-2014-0093.

DOI:10.1515/sagmb-2014-0093
PMID:25968440
Abstract

High-throughput sequencing techniques are increasingly affordable and produce massive amounts of data. Together with other high-throughput technologies, such as microarrays, there are an enormous amount of resources in databases. The collection of these valuable data has been routine for more than a decade. Despite different technologies, many experiments share the same goal. For instance, the aims of RNA-seq studies often coincide with those of differential gene expression experiments based on microarrays. As such, it would be logical to utilize all available data. However, there is a lack of biostatistical tools for the integration of results obtained from different technologies. Although diverse technological platforms produce different raw data, one commonality for experiments with the same goal is that all the outcomes can be transformed into a platform-independent data format - rankings - for the same set of items. Here we present the R package TopKLists, which allows for statistical inference on the lengths of informative (top-k) partial lists, for stochastic aggregation of full or partial lists, and for graphical exploration of the input and consolidated output. A graphical user interface has also been implemented for providing access to the underlying algorithms. To illustrate the applicability and usefulness of the package, we integrated microRNA data of non-small cell lung cancer across different measurement techniques and draw conclusions. The package can be obtained from CRAN under a LGPL-3 license.

摘要

高通量测序技术越来越经济实惠,且能产生大量数据。与其他高通量技术(如微阵列)一起,数据库中有大量资源。收集这些有价值的数据已经有十多年的常规操作了。尽管技术不同,但许多实验有着相同的目标。例如,RNA测序研究的目的常常与基于微阵列的差异基因表达实验的目的一致。因此,利用所有可用数据是合乎逻辑的。然而,缺乏用于整合从不同技术获得的结果的生物统计学工具。尽管不同的技术平台产生不同的原始数据,但对于具有相同目标的实验来说,一个共同点是所有结果都可以转换为一种与平台无关的数据格式——排名——针对同一组项目。在这里,我们展示了R包TopKLists,它允许对信息丰富的(前k个)部分列表的长度进行统计推断,对完整或部分列表进行随机汇总,并对输入和合并输出进行图形化探索。还实现了一个图形用户界面以提供对底层算法的访问。为了说明该包的适用性和实用性,我们整合了不同测量技术下非小细胞肺癌的 microRNA 数据并得出结论。该包可在LGPL - 3许可下从CRAN获取。

相似文献

1
TopKLists: a comprehensive R package for statistical inference, stochastic aggregation, and visualization of multiple omics ranked lists.TopKLists:一个用于多组学排名列表的统计推断、随机聚合和可视化的综合R包。
Stat Appl Genet Mol Biol. 2015 Jun;14(3):311-6. doi: 10.1515/sagmb-2014-0093.
2
Integration of ranked lists via cross entropy Monte Carlo with applications to mRNA and microRNA Studies.通过交叉熵蒙特卡洛方法整合排名列表及其在mRNA和微小RNA研究中的应用
Biometrics. 2009 Mar;65(1):9-18. doi: 10.1111/j.1541-0420.2008.01044.x. Epub 2008 May 13.
3
Differential correlation for sequencing data.测序数据的差异相关性
BMC Res Notes. 2017 Jan 19;10(1):54. doi: 10.1186/s13104-016-2331-9.
4
SILGGM: An extensive R package for efficient statistical inference in large-scale gene networks.SILGGM:一个用于大规模基因网络中高效统计推断的扩展 R 包。
PLoS Comput Biol. 2018 Aug 13;14(8):e1006369. doi: 10.1371/journal.pcbi.1006369. eCollection 2018 Aug.
5
Stability and aggregation of ranked gene lists.排名基因列表的稳定性和聚集性。
Brief Bioinform. 2009 Sep;10(5):556-68. doi: 10.1093/bib/bbp034.
6
Pathway-based visualization of cross-platform microarray datasets.基于通路的跨平台微阵列数据集可视化。
Bioinformatics. 2012 Dec 1;28(23):3021-6. doi: 10.1093/bioinformatics/bts583. Epub 2012 Oct 9.
7
InterferenceAnalyzer: tools for the analysis and simulation of multi-locus genetic data.干扰分析器:用于多位点遗传数据的分析与模拟的工具。
BMC Bioinformatics. 2005 Dec 12;6:297. doi: 10.1186/1471-2105-6-297.
8
OmicsARules: a R package for integration of multi-omics datasets via association rules mining.OmicsARules:一个通过关联规则挖掘整合多组学数据集的 R 包。
BMC Bioinformatics. 2019 Nov 8;20(1):554. doi: 10.1186/s12859-019-3171-0.
9
MILANO--custom annotation of microarray results using automatic literature searches.米兰——使用自动文献检索对微阵列结果进行定制注释。
BMC Bioinformatics. 2005 Jan 20;6:12. doi: 10.1186/1471-2105-6-12.
10
RCPA: An Open-Source R Package for Data Processing, Differential Analysis, Consensus Pathway Analysis, and Visualization.RCPA:一个用于数据处理、差异分析、共识通路分析和可视化的开源 R 包。
Curr Protoc. 2024 May;4(5):e1036. doi: 10.1002/cpz1.1036.

引用本文的文献

1
Pharmacological inhibition of TBK1/IKKε blunts immunopathology in a murine model of SARS-CoV-2 infection.药物抑制 TBK1/IKKε 可减轻 SARS-CoV-2 感染小鼠模型中的免疫病理。
Nat Commun. 2023 Sep 18;14(1):5666. doi: 10.1038/s41467-023-41381-9.
2
Splicing signature database development to delineate cancer pathways using literature mining and transcriptome machine learning.利用文献挖掘和转录组机器学习开发剪接特征数据库以描绘癌症通路。
Comput Struct Biotechnol J. 2023 Mar 2;21:1978-1988. doi: 10.1016/j.csbj.2023.02.052. eCollection 2023.
3
Transcriptomic pan-cancer analysis using rank-based Bayesian inference.
基于排名的贝叶斯推断的泛癌转录组分析。
Mol Oncol. 2023 Apr;17(4):548-563. doi: 10.1002/1878-0261.13354. Epub 2023 Jan 23.
4
Toxicogenomics Data for Chemical Safety Assessment and Development of New Approach Methodologies: An Adverse Outcome Pathway-Based Approach.毒理基因组学数据在化学品安全评估和新方法开发中的应用:基于不良结局路径的方法。
Adv Sci (Weinh). 2023 Jan;10(2):e2203984. doi: 10.1002/advs.202203984. Epub 2022 Dec 7.
5
Integrated Network Pharmacology Approach for Drug Combination Discovery: A Multi-Cancer Case Study.基于整合网络药理学方法的药物组合发现:多癌种案例研究
Cancers (Basel). 2022 Apr 18;14(8):2043. doi: 10.3390/cancers14082043.
6
Nextcast: A software suite to analyse and model toxicogenomics data.Nextcast:一个用于分析和建模毒理基因组学数据的软件套件。
Comput Struct Biotechnol J. 2022 Mar 18;20:1413-1426. doi: 10.1016/j.csbj.2022.03.014. eCollection 2022.
7
Ontology-based identification and prioritization of candidate drugs for epilepsy from literature.基于本体的文献中癫痫候选药物的识别和优先级排序。
J Biomed Semantics. 2022 Jan 24;13(1):3. doi: 10.1186/s13326-021-00258-w.
8
Computationally prioritized drugs inhibit SARS-CoV-2 infection and syncytia formation.计算优先药物抑制 SARS-CoV-2 感染和合胞体形成。
Brief Bioinform. 2022 Jan 17;23(1). doi: 10.1093/bib/bbab507.
9
Inferring a consensus problem list using penalized multistage models for ordered data.使用有序数据的惩罚多阶段模型推断共识问题列表。
Ann Appl Stat. 2020 Sep;14(3):1557-1580. doi: 10.1214/20-aoas1361. Epub 2020 Sep 18.
10
The brain on cocaine at single-cell resolution.单细胞分辨率下可卡因作用的大脑。
Genome Res. 2021 Oct;31(10):1927-1937. doi: 10.1101/gr.268037.120. Epub 2021 May 25.