• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

使用网络滤波器对大规模生物数据进行去噪

Denoising large-scale biological data using network filters.

作者信息

Kavran Andrew J, Clauset Aaron

机构信息

Department of Biochemistry, University of Colorado, Boulder, CO, USA.

BioFrontiers Institute, University of Colorado, Boulder, CO, USA.

出版信息

BMC Bioinformatics. 2021 Mar 25;22(1):157. doi: 10.1186/s12859-021-04075-x.

DOI:10.1186/s12859-021-04075-x
PMID:33765911
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC7992843/
Abstract

BACKGROUND

Large-scale biological data sets are often contaminated by noise, which can impede accurate inferences about underlying processes. Such measurement noise can arise from endogenous biological factors like cell cycle and life history variation, and from exogenous technical factors like sample preparation and instrument variation.

RESULTS

We describe a general method for automatically reducing noise in large-scale biological data sets. This method uses an interaction network to identify groups of correlated or anti-correlated measurements that can be combined or "filtered" to better recover an underlying biological signal. Similar to the process of denoising an image, a single network filter may be applied to an entire system, or the system may be first decomposed into distinct modules and a different filter applied to each. Applied to synthetic data with known network structure and signal, network filters accurately reduce noise across a wide range of noise levels and structures. Applied to a machine learning task of predicting changes in human protein expression in healthy and cancerous tissues, network filtering prior to training increases accuracy up to 43% compared to using unfiltered data.

CONCLUSIONS

Network filters are a general way to denoise biological data and can account for both correlation and anti-correlation between different measurements. Furthermore, we find that partitioning a network prior to filtering can significantly reduce errors in networks with heterogenous data and correlation patterns, and this approach outperforms existing diffusion based methods. Our results on proteomics data indicate the broad potential utility of network filters to applications in systems biology.

摘要

背景

大规模生物数据集常常受到噪声污染,这可能会妨碍对潜在过程的准确推断。这种测量噪声可能源于内源性生物因素,如细胞周期和生活史变异,也可能源于外源性技术因素,如样品制备和仪器变异。

结果

我们描述了一种自动降低大规模生物数据集噪声的通用方法。该方法使用相互作用网络来识别相关或反相关测量的组,这些组可以组合或“过滤”以更好地恢复潜在的生物信号。类似于对图像进行去噪的过程,可以将单个网络滤波器应用于整个系统,或者可以首先将系统分解为不同的模块,并对每个模块应用不同的滤波器。应用于具有已知网络结构和信号的合成数据时,网络滤波器能够在广泛的噪声水平和结构范围内准确地降低噪声。应用于预测健康组织和癌组织中人类蛋白质表达变化的机器学习任务时,与使用未过滤数据相比,训练前进行网络滤波可将准确率提高高达43%。

结论

网络滤波器是一种对生物数据进行去噪的通用方法,能够考虑不同测量之间的相关性和反相关性。此外,我们发现滤波前对网络进行划分可以显著减少具有异质数据和相关模式的网络中的误差,并且这种方法优于现有的基于扩散的方法。我们在蛋白质组学数据上的结果表明网络滤波器在系统生物学应用中具有广泛的潜在用途。

相似文献

1
Denoising large-scale biological data using network filters.使用网络滤波器对大规模生物数据进行去噪
BMC Bioinformatics. 2021 Mar 25;22(1):157. doi: 10.1186/s12859-021-04075-x.
2
Wavelet-domain TI Wiener-like filtering for complex MR data denoising.用于复杂磁共振数据去噪的小波域TI类维纳滤波
Magn Reson Imaging. 2016 Oct;34(8):1128-40. doi: 10.1016/j.mri.2016.05.011. Epub 2016 May 26.
3
High-field mr diffusion-weighted image denoising using a joint denoising convolutional neural network.基于联合去噪卷积神经网络的高场磁共振弥散加权图像去噪。
J Magn Reson Imaging. 2019 Dec;50(6):1937-1947. doi: 10.1002/jmri.26761. Epub 2019 Apr 22.
4
Improvement of image quality in PET using post-reconstruction hybrid spatial-frequency domain filtering.使用后重建混合空间频率域滤波提高 PET 图像质量。
Phys Med Biol. 2018 Oct 24;63(21):215010. doi: 10.1088/1361-6560/aae573.
5
Feature-Preserving Noise Removal.特征保持去噪。
IEEE Trans Med Imaging. 2015 Sep;34(9):1822-9. doi: 10.1109/TMI.2015.2409265. Epub 2015 Mar 6.
6
Iterative image-domain decomposition for dual-energy CT.双能CT的迭代图像域分解
Med Phys. 2014 Apr;41(4):041901. doi: 10.1118/1.4866386.
7
Ultra-low-dose CT image denoising using modified BM3D scheme tailored to data statistics.基于数据统计量身定制的改进 BM3D 方案在超低剂量 CT 图像去噪中的应用。
Med Phys. 2019 Jan;46(1):190-198. doi: 10.1002/mp.13252. Epub 2018 Nov 19.
8
Denoising of MR images with Rician noise using a wider neural network and noise range division.基于宽神经网络和噪声范围划分的瑞利噪声磁共振图像去噪。
Magn Reson Imaging. 2019 Dec;64:154-159. doi: 10.1016/j.mri.2019.05.042. Epub 2019 Jun 17.
9
Combined iterative reconstruction and image-domain decomposition for dual energy CT using total-variation regularization.使用全变差正则化的双能CT的联合迭代重建与图像域分解
Med Phys. 2014 May;41(5):051909. doi: 10.1118/1.4870375.
10
A Comprehensive Review on Nature Inspired Neural Network based Adaptive Filter for Eliminating Noise in Medical Images.基于自然启发神经网络的自适应滤波器在医学图像噪声消除中的综合评价
Curr Med Imaging. 2020;16(4):278-287. doi: 10.2174/1573405614666180801113345.

引用本文的文献

1
Medium-sized protein language models perform well at transfer learning on realistic datasets.中等规模的蛋白质语言模型在真实数据集上的迁移学习中表现良好。
Sci Rep. 2025 Jul 1;15(1):21400. doi: 10.1038/s41598-025-05674-x.
2
Scaling down for efficiency: Medium-sized protein language models perform well at transfer learning on realistic datasets.为提高效率而缩小规模:中型蛋白质语言模型在真实数据集的迁移学习中表现良好。
bioRxiv. 2025 Jan 28:2024.11.22.624936. doi: 10.1101/2024.11.22.624936.
3
Detecting and dissecting signaling crosstalk via the multilayer network integration of signaling and regulatory interactions.

本文引用的文献

1
Recording development with single cell dynamic lineage tracing.单细胞动态谱系追踪记录发育过程。
Development. 2019 Jun 27;146(12):dev169730. doi: 10.1242/dev.169730.
2
EMT Transition States during Tumor Progression and Metastasis.肿瘤进展和转移过程中的 EMT 过渡态。
Trends Cell Biol. 2019 Mar;29(3):212-226. doi: 10.1016/j.tcb.2018.12.001. Epub 2018 Dec 26.
3
Multiscale mixing patterns in networks.网络中的多尺度混合模式。
通过信号转导和调控相互作用的多层网络整合来检测和剖析信号串扰。
Nucleic Acids Res. 2024 Jan 11;52(1):e5. doi: 10.1093/nar/gkad1035.
4
Multiscale topology characterizes dynamic tumor vascular networks.多尺度拓扑结构表征动态肿瘤血管网络。
Sci Adv. 2022 Jun 10;8(23):eabm2456. doi: 10.1126/sciadv.abm2456.
Proc Natl Acad Sci U S A. 2018 Apr 17;115(16):4057-4062. doi: 10.1073/pnas.1713019115. Epub 2018 Apr 2.
4
netSmooth: Network-smoothing based imputation for single cell RNA-seq.netSmooth:基于网络平滑的单细胞RNA测序插补法
F1000Res. 2018 Jan 3;7:8. doi: 10.12688/f1000research.13511.3. eCollection 2018.
5
The ground truth about metadata and community detection in networks.网络中关于元数据和社区检测的真相。
Sci Adv. 2017 May 3;3(5):e1602548. doi: 10.1126/sciadv.1602548. eCollection 2017 May.
6
Precancer Atlas to Drive Precision Prevention Trials.癌前图谱推动精准预防试验。
Cancer Res. 2017 Apr 1;77(7):1510-1541. doi: 10.1158/0008-5472.CAN-16-2346.
7
Building a lineage from single cells: genetic techniques for cell lineage tracking.从单细胞构建细胞谱系:细胞谱系追踪的遗传技术
Nat Rev Genet. 2017 Apr;18(4):230-244. doi: 10.1038/nrg.2016.159. Epub 2017 Jan 23.
8
Elucidation of Signaling Pathways from Large-Scale Phosphoproteomic Data Using Protein Interaction Networks.利用蛋白质相互作用网络从大规模磷酸化蛋白质组学数据中阐明信号通路
Cell Syst. 2016 Dec 21;3(6):585-593.e3. doi: 10.1016/j.cels.2016.11.005.
9
Structure and inference in annotated networks.带注释网络中的结构和推理。
Nat Commun. 2016 Jun 16;7:11863. doi: 10.1038/ncomms11863.
10
Avoiding common pitfalls when clustering biological data.聚类生物数据时避免常见陷阱。
Sci Signal. 2016 Jun 14;9(432):re6. doi: 10.1126/scisignal.aad1932.