Ocari Tommaso, Zin Emilia A, Tekinsoy Muge, Van Meter Timothé, Desrosiers Mélissa, Cammarota Chiara, Dalkara Deniz, Nemoto Takahiro, Ferrari Ulisse
Institut de la Vision, Sorbonne Université, INSERM, CNRS, 17 rue Moreau, 75012 Paris, France.
Physics department, University of Rome 'La Sapienza', Piazzale Aldo Moro 5, 00185 Rome, Italy.
Nucleic Acids Res. 2025 Aug 27;53(16). doi: 10.1093/nar/gkaf793.
In combinatorial genetic engineering experiments, next-generation sequencing (NGS) allows for measuring the concentrations of barcoded or mutated genes within highly diverse libraries. When designing and interpreting these experiments, sequencing depths are thus important parameters to take into account. Service providers follow established guidelines to determine NGS depth depending on the type of experiment, such as RNA sequencing or whole genome sequencing. However, guidelines specifically tailored for measuring barcode concentrations have not yet reached an accepted consensus. To address this issue, we combine the analysis of NGS datasets from barcoded libraries with a mathematical model taking into account the polymerase chain reaction amplification in library preparation. We demonstrate on several datasets that noise in the NGS counts increases with the sequencing depth; consequently, beyond certain limits, deeper sequencing does not improve the precision of measuring barcode concentrations. We propose, as rule of thumb, that the optimal sequencing depth should be about ten times the initial amount of barcoded DNA molecules before any amplification step.
在组合基因工程实验中,新一代测序(NGS)能够测量高度多样化文库中条形码标记或突变基因的浓度。因此,在设计和解读这些实验时,测序深度是需要考虑的重要参数。服务提供商遵循既定指南,根据实验类型(如RNA测序或全基因组测序)来确定NGS深度。然而,专门针对测量条形码浓度的指南尚未达成公认的共识。为解决这一问题,我们将条形码文库的NGS数据集分析与一个考虑文库制备中聚合酶链反应扩增的数学模型相结合。我们在几个数据集上证明,NGS计数中的噪声随测序深度增加;因此,超过一定限度后,更深的测序并不能提高测量条形码浓度的精度。我们建议,根据经验,最佳测序深度应约为任何扩增步骤之前条形码DNA分子初始量的十倍。