• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

一种线性时间算法,该算法避免求逆,并能像在可交换半群中进行卷积或其他算子运算一样计算留一法(Jackknife)乘积。

A linear-time algorithm that avoids inverses and computes Jackknife (leave-one-out) products like convolutions or other operators in commutative semigroups.

作者信息

Spouge John L, Ziegelbauer Joseph M, Gonzalez Mileidy

机构信息

National Center for Biotechnology Information, National Library of Medicine, National Institutes of Health, Room 6N603, Building 38A, Bethesda, MD 20894 USA.

HIV and AIDS Malignancy Branch, Center for Cancer Research, National Cancer Institute, National Institutes of Health, Bethesda, MD 20892 USA.

出版信息

Algorithms Mol Biol. 2020 Sep 19;15:17. doi: 10.1186/s13015-020-00178-x. eCollection 2020.

DOI:10.1186/s13015-020-00178-x
PMID:32968428
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC7502207/
Abstract

BACKGROUND

Data about herpesvirus microRNA motifs on human circular RNAs suggested the following statistical question. Consider independent random counts, not necessarily identically distributed. Conditioned on the sum, decide whether one of the counts is unusually large. Exact computation of the pvalue leads to a specific algorithmic problem. Given elements in a set with the closure and associative properties and a commutative product without inverses, compute the jackknife (leave-one-out) products ( ).

RESULTS

This article gives a linear-time Jackknife Product algorithm. Its upward phase constructs a standard segment tree for computing segment products like ; its novel downward phase mirrors the upward phase while exploiting the symmetry of and its complement . The algorithm requires storage for elements of and only about products. In contrast, the standard segment tree algorithms require about products for construction and products for calculating each , i.e., about products in total; and a naïve quadratic algorithm using element-by-element products to compute each requires products.

CONCLUSIONS

In the herpesvirus application, the Jackknife Product algorithm required 15 min; standard segment tree algorithms would have taken an estimated 3 h; and the quadratic algorithm, an estimated 1 month. The Jackknife Product algorithm has many possible uses in bioinformatics and statistics.

摘要

背景

关于人类环状RNA上疱疹病毒微小RNA基序的数据提出了以下统计学问题。考虑独立的随机计数,不一定是同分布的。在总和的条件下,判断其中一个计数是否异常大。p值的精确计算导致了一个特定的算法问题。给定一个具有封闭性和结合性且乘法可交换但无逆元的集合中的元素,计算留一法(Jackknife)乘积( )。

结果

本文给出了一种线性时间的留一法乘积算法。其向上阶段构建一个标准段树来计算诸如 的段乘积;其新颖的向下阶段在利用 及其补集 的对称性的同时镜像向上阶段。该算法需要存储 中的 个元素和仅约 个乘积。相比之下,标准段树算法构建时需要约 个乘积,计算每个 时需要 个乘积,即总共约 个乘积;而使用逐个元素乘积来计算每个 的朴素二次算法需要 个乘积。

结论

在疱疹病毒应用中,留一法乘积算法耗时15分钟;标准段树算法估计需要3小时;二次算法估计需要1个月。留一法乘积算法在生物信息学和统计学中有许多可能的用途。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/544a/7502207/1ce5b0e2830b/13015_2020_178_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/544a/7502207/e95c84cf97f9/13015_2020_178_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/544a/7502207/e0e7d0fae5ca/13015_2020_178_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/544a/7502207/1ce5b0e2830b/13015_2020_178_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/544a/7502207/e95c84cf97f9/13015_2020_178_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/544a/7502207/e0e7d0fae5ca/13015_2020_178_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/544a/7502207/1ce5b0e2830b/13015_2020_178_Fig3_HTML.jpg

相似文献

1
A linear-time algorithm that avoids inverses and computes Jackknife (leave-one-out) products like convolutions or other operators in commutative semigroups.一种线性时间算法,该算法避免求逆,并能像在可交换半群中进行卷积或其他算子运算一样计算留一法(Jackknife)乘积。
Algorithms Mol Biol. 2020 Sep 19;15:17. doi: 10.1186/s13015-020-00178-x. eCollection 2020.
2
Selection on  +   + ⋯ +  via Cartesian product trees.通过笛卡尔积树对 + + ⋯ + 进行选择。
PeerJ Comput Sci. 2021 Apr 28;7:e483. doi: 10.7717/peerj-cs.483. eCollection 2021.
3
A two-stage pruning algorithm for likelihood computation for a population tree.一种用于群体树似然计算的两阶段剪枝算法。
Genetics. 2008 Oct;180(2):1095-105. doi: 10.1534/genetics.107.085753. Epub 2008 Sep 9.
4
Distortion of ethyne on formation of a π complex with silver chloride: C2H2⋯Ag-Cl characterised by rotational spectroscopy and ab initio calculations.乙炔在与氯化银形成π络合物时的扭曲:通过旋转光谱和从头算计算表征的 C2H2⋯Ag-Cl。
J Chem Phys. 2012 Nov 7;137(17):174302. doi: 10.1063/1.4761895.
5
Approximating large convolutions in digital images.在数字图像中近似大卷积
IEEE Trans Image Process. 2001;10(12):1826-35. doi: 10.1109/83.974567.
6
Absolutely closed semigroups.绝对封闭半群
Rev R Acad Cienc Exactas Fis Nat A Mat. 2024;118(1):23. doi: 10.1007/s13398-023-01519-2. Epub 2023 Nov 9.
7
Evaluation of C-H cdots, three dots, centered O hydrogen bonds in native and misfolded proteins.天然和错误折叠蛋白质中C-H…O氢键的评估。
J Mol Biol. 2002 Sep 20;322(3):497-503. doi: 10.1016/s0022-2836(02)00785-4.
8
A family of 512 reverse order laws for generalized inverses of a matrix product: A review.矩阵乘积广义逆的512个反序律族综述。
Heliyon. 2020 Sep 28;6(9):e04924. doi: 10.1016/j.heliyon.2020.e04924. eCollection 2020 Sep.
9
Exact p-value calculation for heterotypic clusters of regulatory motifs and its application in computational annotation of cis-regulatory modules.调控基序异型簇的确切p值计算及其在顺式调控模块计算注释中的应用。
Algorithms Mol Biol. 2007 Oct 10;2:13. doi: 10.1186/1748-7188-2-13.
10
A simple and space efficient segment tree implementation.一种简单且节省空间的线段树实现。
MethodsX. 2019 Mar 2;6:500-512. doi: 10.1016/j.mex.2019.02.028. eCollection 2019.

本文引用的文献

1
Discovery of Kaposi's sarcoma herpesvirus-encoded circular RNAs and a human antiviral circular RNA.卡波氏肉瘤相关疱疹病毒编码的环状 RNA 的发现和一种人类抗病毒环状 RNA。
Proc Natl Acad Sci U S A. 2018 Dec 11;115(50):12805-12810. doi: 10.1073/pnas.1816183115. Epub 2018 Nov 19.
2
Oncodomains: A protein domain-centric framework for analyzing rare variants in tumor samples.癌基因结构域:一种以蛋白质结构域为中心的框架,用于分析肿瘤样本中的罕见变异。
PLoS Comput Biol. 2017 Apr 20;13(4):e1005428. doi: 10.1371/journal.pcbi.1005428. eCollection 2017 Apr.
3
FAST: Fourier transform based algorithms for significance testing of ungapped multiple alignments.
FAST:基于傅里叶变换的无间隙多序列比对显著性检验算法
Bioinformatics. 2008 Feb 15;24(4):577-8. doi: 10.1093/bioinformatics/btm594. Epub 2008 Jan 6.
4
Note on an exact treatment of contingency, goodness of fit and other problems of significance.关于列联表、拟合优度及其他显著性问题的精确处理的注释
Biometrika. 1951 Jun;38(1-2):141-9.