Suppr超能文献

实验信息(元数据)对于存档序列数据的重要性:RNA测序中样本采集与RNA保护之间的时间间隔导致特定基因偏差的情况。

Importance of experimental information (metadata) for archived sequence data: case of specific gene bias due to lag time between sample harvest and RNA protection in RNA sequencing.

作者信息

Matsuda Tomoko

机构信息

Nihon BioData Corporation, Kawasaki, Kanagawa, Japan.

出版信息

PeerJ. 2021 Aug 25;9:e11875. doi: 10.7717/peerj.11875. eCollection 2021.

Abstract

Large volumes of high-throughput sequencing data have been submitted to the Sequencing Read Archive (SRA). The lack of experimental metadata associated with the data makes reuse and understanding data quality very difficult. In the case of RNA sequencing (RNA-Seq), which reveals the presence and quantity of RNA in a biological sample at any moment, it is necessary to consider that gene expression responds over a short time interval (several seconds to a few minutes) in many organisms. Therefore, to isolate RNA that accurately reflects the transcriptome at the point of harvest, raw biological samples should be processed by freezing in liquid nitrogen, immersing in RNA stabilization reagent or lysing and homogenizing in RNA lysis buffer containing guanidine thiocyanate as soon as possible. As the number of samples handled simultaneously increases, the time until the RNA is protected can increase. Here, to evaluate the effect of different lag times in RNA protection on RNA-Seq data, we harvested CHO-S cells after 3, 5, 6, and 7 days of cultivation, added RNA lysis buffer in a time course of 15, 30, 45, and 60 min after harvest, and conducted RNA-Seq. These RNA samples showed high RNA integrity number (RIN) values indicating non-degraded RNA, and sequence data from libraries prepared with these RNA samples was of high quality according to FastQC. We observed that, at the same cultivation day, global trends of gene expression were similar across the time course of addition of RNA lysis buffer; however, the expression of some genes was significantly different between the time-course samples of the same cultivation day; most of these differentially expressed genes were related to apoptosis. We conclude that the time lag between sample harvest and RNA protection influences gene expression of specific genes. It is, therefore, necessary to know not only RIN values of RNA and the quality of the sequence data but also how the experiment was performed when acquiring RNA-Seq data from the database.

摘要

大量高通量测序数据已提交至序列读取存档库(SRA)。缺乏与这些数据相关的实验元数据使得数据的再利用和对数据质量的理解变得非常困难。就RNA测序(RNA-Seq)而言,它能揭示生物样品中RNA在任何时刻的存在情况和数量,有必要考虑到在许多生物体中基因表达在短时间间隔(几秒到几分钟)内会发生变化。因此,为了分离出能准确反映收获时转录组的RNA,原始生物样品应通过在液氮中冷冻、浸入RNA稳定试剂或尽快在含有异硫氰酸胍的RNA裂解缓冲液中裂解并匀浆来进行处理。随着同时处理的样品数量增加,RNA得到保护之前的时间可能会延长。在此,为了评估RNA保护中不同延迟时间对RNA-Seq数据的影响,我们在培养3、5、6和7天后收获中国仓鼠卵巢细胞(CHO-S),在收获后15、30、45和60分钟的时间进程中添加RNA裂解缓冲液,并进行RNA-Seq。这些RNA样品显示出较高的RNA完整性数值(RIN),表明RNA未降解,并且根据FastQC,用这些RNA样品制备的文库的序列数据质量很高。我们观察到,在相同的培养天数,添加RNA裂解缓冲液的时间进程中基因表达的整体趋势相似;然而,在相同培养天数的时间进程样品之间,一些基因的表达存在显著差异;这些差异表达的基因大多与细胞凋亡有关。我们得出结论,样品收获和RNA保护之间的时间延迟会影响特定基因的表达。因此,在从数据库获取RNA-Seq数据时,不仅有必要了解RNA的RIN值和序列数据的质量,还需知道实验是如何进行的。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/421f/8401820/8a1b7d2e5824/peerj-09-11875-g001.jpg

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验