MacConnell Andrew B, Paegel Brian M
Department of Chemistry and †Doctoral Program in Chemical and Biological Sciences, The Scripps Research Institute , 130 Scripps Way, Jupiter, Florida 33458, United States.
ACS Comb Sci. 2017 Aug 14;19(8):524-532. doi: 10.1021/acscombsci.7b00061. Epub 2017 Jul 26.
Microfluidic droplet-based screening of DNA-encoded one-bead-one-compound combinatorial libraries is a miniaturized, potentially widely distributable approach to small molecule discovery. In these screens, a microfluidic circuit distributes library beads into droplets of activity assay reagent, photochemically cleaves the compound from the bead, then incubates and sorts the droplets based on assay result for subsequent DNA sequencing-based hit compound structure elucidation. Pilot experimental studies revealed that Poisson statistics describe nearly all aspects of such screens, prompting the development of simulations to understand system behavior. Monte Carlo screening simulation data showed that increasing mean library sampling (ε), mean droplet occupancy, or library hit rate all increase the false discovery rate (FDR). Compounds identified as hits on k > 1 beads (the replicate k class) were much more likely to be authentic hits than singletons (k = 1), in agreement with previous findings. Here, we explain this observation by deriving an equation for authenticity, which reduces to the product of a library sampling bias term (exponential in k) and a sampling saturation term (exponential in ε) setting a threshold that the k-dependent bias must overcome. The equation thus quantitatively describes why each hit structure's FDR is based on its k class, and further predicts the feasibility of intentionally populating droplets with multiple library beads, assaying the micromixtures for function, and identifying the active members by statistical deconvolution.
基于微流控液滴的DNA编码单珠单化合物组合文库筛选是一种小型化、可能广泛分布的小分子发现方法。在这些筛选中,微流控电路将文库珠子分配到活性测定试剂的液滴中,光化学地从珠子上裂解化合物,然后根据测定结果孵育并对液滴进行分类,以便随后基于DNA测序阐明命中化合物的结构。初步实验研究表明,泊松统计描述了此类筛选的几乎所有方面,促使人们开发模拟来理解系统行为。蒙特卡罗筛选模拟数据表明,增加平均文库采样率(ε)、平均液滴占有率或文库命中率都会增加错误发现率(FDR)。与之前的发现一致,在k>1个珠子上被鉴定为命中的化合物(重复k类)比单珠命中(k = 1)更有可能是真实命中。在这里,我们通过推导一个真实性方程来解释这一观察结果,该方程简化为一个文库采样偏差项(k的指数函数)和一个采样饱和项(ε的指数函数)的乘积,该采样饱和项设定了k依赖偏差必须克服的阈值。因此,该方程定量地描述了为什么每个命中结构的FDR基于其k类,并进一步预测了有意用多个文库珠子填充液滴、测定微混合物的功能以及通过统计反卷积识别活性成员的可行性。