Maxeiner Stephan, Krasteva-Christ Gabriela, Althaus Mike
Institute for Anatomy and Cell Biology, Saarland University, Homburg, Germany.
Department of Natural Sciences, Institute for Functional Gene Analytics, Bonn-Rhein-Sieg University of Applied Sciences, Rheinbach, Germany.
J Physiol. 2023 May;601(9):1611-1623. doi: 10.1113/JP284066. Epub 2023 Feb 21.
Synthesis of DNA fragments based on gene sequences that are available in public resources has become an efficient and affordable method that has gradually replaced traditional cloning efforts such as PCR cloning from cDNA. However, database entries based on genome sequencing results are prone to errors which can lead to false sequence information and, ultimately, errors in functional characterisation of proteins such as ion channels and transporters in heterologous expression systems. We have identified five common problems that repeatedly appear in public resources: (1) Not every gene has yet been annotated; (2) not all gene annotations are necessarily correct; (3) transcripts may contain automated corrections; (4) there are mismatches between gene, mRNA and protein sequences; and (5) splicing patterns often lack experimental validation. This technical review highlights and provides a strategy to bypass these issues in order to avoid critical mistakes that could impact future studies of any gene/protein of interest in heterologous expression systems.
基于公共资源中可用基因序列合成DNA片段已成为一种高效且经济实惠的方法,该方法已逐渐取代了传统的克隆方法,如从cDNA进行PCR克隆。然而,基于基因组测序结果的数据库条目容易出现错误,这可能导致错误的序列信息,并最终导致在异源表达系统中对离子通道和转运蛋白等蛋白质进行功能表征时出现错误。我们已经确定了公共资源中反复出现的五个常见问题:(1)并非每个基因都已注释;(2)并非所有基因注释都一定正确;(3)转录本可能包含自动校正;(4)基因、mRNA和蛋白质序列之间存在错配;(5)剪接模式通常缺乏实验验证。本技术综述强调并提供了一种绕过这些问题的策略,以避免可能影响未来对异源表达系统中任何感兴趣的基因/蛋白质进行研究的关键错误。