Centre de Génétique Moléculaire - CNRS, Avenue de la Terrasse, 91198 Gif sur Yvette, France.
Plateforme Intégrée IMAGIF - CNRS, Avenue de la Terrasse, 91198 Gif sur Yvette, France.
Exp Cell Res. 2014 Mar 10;322(1):12-20. doi: 10.1016/j.yexcr.2014.01.008. Epub 2014 Jan 15.
Next-generation sequencing (NGS) has caused a revolution in biology. NGS requires the preparation of libraries in which (fragments of) DNA or RNA molecules are fused with adapters followed by PCR amplification and sequencing. It is evident that robust library preparation methods that produce a representative, non-biased source of nucleic acid material from the genome under investigation are of crucial importance. Nevertheless, it has become clear that NGS libraries for all types of applications contain biases that compromise the quality of NGS datasets and can lead to their erroneous interpretation. A detailed knowledge of the nature of these biases will be essential for a careful interpretation of NGS data on the one hand and will help to find ways to improve library quality or to develop bioinformatics tools to compensate for the bias on the other hand. In this review we discuss the literature on bias in the most common NGS library preparation protocols, both for DNA sequencing (DNA-seq) as well as for RNA sequencing (RNA-seq). Strikingly, almost all steps of the various protocols have been reported to introduce bias, especially in the case of RNA-seq, which is technically more challenging than DNA-seq. For each type of bias we discuss methods for improvement with a view to providing some useful advice to the researcher who wishes to convert any kind of raw nucleic acid into an NGS library.
下一代测序(NGS)在生物学领域引发了一场革命。NGS 需要进行文库制备,在此过程中,(DNA 或 RNA 分子的)片段与接头融合,然后进行 PCR 扩增和测序。显然,从研究中的基因组中制备出具有代表性、非偏向性的核酸材料的稳健文库方法至关重要。然而,已经清楚的是,所有类型应用的 NGS 文库都存在偏差,这些偏差会影响 NGS 数据集的质量,并可能导致对其的错误解释。详细了解这些偏差的性质对于仔细解释 NGS 数据至关重要,一方面可以帮助找到改善文库质量的方法,或者开发生物信息学工具来弥补另一方面的偏差。在这篇综述中,我们讨论了最常见的 NGS 文库制备方案中偏差的文献,包括 DNA 测序(DNA-seq)和 RNA 测序(RNA-seq)。令人惊讶的是,几乎所有的协议步骤都被报道会引入偏差,尤其是在技术上比 DNA-seq 更具挑战性的 RNA-seq 中。对于每种类型的偏差,我们都讨论了改进的方法,以期为希望将任何类型的原始核酸转化为 NGS 文库的研究人员提供一些有用的建议。