通过分析 DNA 编码库筛选中的重复样本来理解数据噪声和不确定性。

Understanding Data Noise and Uncertainty through Analysis of Replicate Samples in DNA-Encoded Library Selection.

机构信息

Simulation and Modelling Sciences, Pfizer Inc., Groton, Connecticut 06340, United States.

Discovery Sciences, Pfizer Inc., Groton, Connecticut 06340, United States.

出版信息

J Chem Inf Model. 2022 May 9;62(9):2239-2247. doi: 10.1021/acs.jcim.1c00986. Epub 2021 Dec 4.

DOI:10.1021/acs.jcim.1c00986

PMID:34865473

Abstract

By analyzing data sets of replicate DNA-Encoded Library (DEL) selections, an approach for estimating the noise level of the experiment has been developed. Using a logarithm transformation of the number of counts associated with each compound and a subset of compounds with the highest number of counts, it is possible to assess the quality of the data through normalizing the replicates and use this same data to estimate the noise in the experiment. The noise level is seen to be dependent on sequencing depth as well as specific selection conditions. The noise estimation is independent of any cutoff used to remove low frequency compounds from the data analysis. The removal of compounds with only 1-5 read counts greatly reduces some of the challenges encountered in DEL data analysis as it can reduce the data set by greater than 100-fold without impacting the interpretation of the results.

摘要

通过分析重复 DNA 编码文库 (DEL) 选择的数据组，开发了一种估计实验噪声水平的方法。通过对与每个化合物相关的计数数量进行对数转换，并使用具有最高计数数量的化合物子集，可以通过对重复数据进行归一化来评估数据的质量，并使用相同的数据来估计实验中的噪声。噪声水平取决于测序深度以及特定的选择条件。噪声估计与用于从数据分析中去除低频化合物的任何截止值无关。从数据分析中去除仅具有 1-5 个读取计数的化合物可以极大地减少在 DEL 数据分析中遇到的一些挑战，因为它可以将数据集减少 100 倍以上，而不会影响结果的解释。

相似文献

Understanding Data Noise and Uncertainty through Analysis of Replicate Samples in DNA-Encoded Library Selection.通过分析 DNA 编码库筛选中的重复样本来理解数据噪声和不确定性。

J Chem Inf Model. 2022 May 9;62(9):2239-2247. doi: 10.1021/acs.jcim.1c00986. Epub 2021 Dec 4.

Randomness in DNA Encoded Library Selection Data Can Be Modeled for More Reliable Enrichment Calculation.DNA 编码文库选择数据中的随机性可以建模，以更可靠地计算富集。

SLAS Discov. 2018 Jun;23(5):405-416. doi: 10.1177/2472555218757718. Epub 2018 Feb 13.

Comparative evaluation of DNA-encoded chemical selections performed using DNA in single-stranded or double-stranded format.比较使用单链或双链 DNA 进行 DNA 编码化学选择的效果。

Biochem Biophys Res Commun. 2020 Dec 3;533(2):223-229. doi: 10.1016/j.bbrc.2020.04.035. Epub 2020 May 5.

Machine Learning on DNA-Encoded Library Count Data Using an Uncertainty-Aware Probabilistic Loss Function.基于不确定性感知概率损失函数的 DNA 编码库计数数据的机器学习。

J Chem Inf Model. 2022 May 23;62(10):2316-2331. doi: 10.1021/acs.jcim.2c00041. Epub 2022 May 10.

A method for estimating binding affinity from primary DEL selection data.从原始 DEL 选择数据估算结合亲和力的方法。

Biochem Biophys Res Commun. 2020 Dec 3;533(2):249-255. doi: 10.1016/j.bbrc.2020.04.029. Epub 2020 May 19.

Denoising DNA Encoded Library Screens with Sparse Learning.基于稀疏学习的 DNA 编码文库筛选降噪。

ACS Comb Sci. 2020 Aug 10;22(8):410-421. doi: 10.1021/acscombsci.0c00007. Epub 2020 Jun 26.

Evolution of the Selection Methods of DNA-Encoded Chemical Libraries.DNA 编码化学库的选择方法的演变。

Acc Chem Res. 2021 Sep 7;54(17):3491-3503. doi: 10.1021/acs.accounts.1c00375. Epub 2021 Aug 24.

Quantitative Comparison of Enrichment from DNA-Encoded Chemical Library Selections.DNA 编码化学库筛选的富集度定量比较。

ACS Comb Sci. 2019 Feb 11;21(2):75-82. doi: 10.1021/acscombsci.8b00116. Epub 2019 Jan 23.

Building Block-Based Binding Predictions for DNA-Encoded Libraries.基于积木的 DNA 编码文库结合预测。

J Chem Inf Model. 2023 Aug 28;63(16):5120-5132. doi: 10.1021/acs.jcim.3c00588. Epub 2023 Aug 14.

DEL Selections Against a Soluble Protein Target.DEL 对可溶性蛋白靶标的选择。

Methods Mol Biol. 2022;2541:155-164. doi: 10.1007/978-1-0716-2545-3_19.

引用本文的文献

Machine-Learning-Based Data Analysis Method for Cell-Based Selection of DNA-Encoded Libraries.基于机器学习的用于基于细胞筛选DNA编码文库的数据分析方法

ACS Omega. 2023 May 15;8(21):19057-19071. doi: 10.1021/acsomega.3c02152. eCollection 2023 May 30.

Multitask Deep Ensemble Prediction of Molecular Energetics in Solution: From Quantum Mechanics to Experimental Properties.溶液中分子能量学的多任务深度集成预测：从量子力学到实验性质

J Chem Theory Comput. 2023 Jan 6. doi: 10.1021/acs.jctc.2c01024.

文献检索

告别复杂PubMed语法，用中文像聊天一样搜索，搜遍4000万医学文献。AI智能推荐，让科研检索更轻松。

立即免费搜索

文件翻译

保留排版，准确专业，支持PDF/Word/PPT等文件格式，支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述，25分钟生成高质量综述，智能提取关键信息，辅助科研写作。

立即免费体验

通过分析 DNA 编码库筛选中的重复样本来理解数据噪声和不确定性。

Understanding Data Noise and Uncertainty through Analysis of Replicate Samples in DNA-Encoded Library Selection.

机构信息

出版信息

相似文献

引用本文的文献

文献检索

文件翻译

深度研究

Suppr 超能文献

相似文献

引用本文的文献