• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

MPRA解码器:处理具有感兴趣区域未知序列和相关条形码的原始MPRA数据。

MPRAdecoder: Processing of the Raw MPRA Data With Unknown Sequences of the Region of Interest and Associated Barcodes.

作者信息

Letiagina Anna E, Omelina Evgeniya S, Ivankin Anton V, Pindyurin Alexey V

机构信息

Institute of Molecular and Cellular Biology of the Siberian Branch of the Russian Academy of Sciences, Novosibirsk, Russia.

Faculty of Natural Sciences, Novosibirsk State University, Novosibirsk, Russia.

出版信息

Front Genet. 2021 May 11;12:618189. doi: 10.3389/fgene.2021.618189. eCollection 2021.

DOI:10.3389/fgene.2021.618189
PMID:34046055
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC8148044/
Abstract

Massively parallel reporter assays (MPRAs) enable high-throughput functional evaluation of numerous DNA regulatory elements and/or their mutant variants. The assays are based on the construction of reporter plasmid libraries containing two variable parts, a region of interest (ROI) and a barcode (BC), located outside and within the transcription unit, respectively. Importantly, each plasmid molecule in a such a highly diverse library is characterized by a unique BC-ROI association. The reporter constructs are delivered to target cells and expression of BCs at the transcript level is assayed by RT-PCR followed by next-generation sequencing (NGS). The obtained values are normalized to the abundance of BCs in the plasmid DNA sample. Altogether, this allows evaluating the regulatory potential of the associated ROI sequences. However, depending on the MPRA library construction design, the BC and ROI sequences as well as their associations can be unknown. In such a case, the BC and ROI sequences, their possible mutant variants, and unambiguous BC-ROI associations have to be identified, whereas all uncertain cases have to be excluded from the analysis. Besides the preparation of additional "mapping" samples for NGS, this also requires specific bioinformatics tools. Here, we present a pipeline for processing raw MPRA data obtained by NGS for reporter construct libraries with unknown sequences of BCs and ROIs. The pipeline robustly identifies unambiguous (so-called genuine) BCs and ROIs associated with them, calculates the normalized expression level for each BC and the averaged values for each ROI, and provides a graphical visualization of the processed data.

摘要

大规模平行报告基因检测(MPRAs)能够对众多DNA调控元件及其突变变体进行高通量功能评估。这些检测基于构建报告质粒文库,该文库包含两个可变部分,分别是位于转录单元外部的感兴趣区域(ROI)和位于转录单元内部的条形码(BC)。重要的是,如此高度多样化的文库中的每个质粒分子都具有独特的BC-ROI关联特征。将报告构建体导入靶细胞,通过逆转录聚合酶链反应(RT-PCR)随后进行下一代测序(NGS)来检测转录水平上BC的表达。将获得的值归一化为质粒DNA样品中BC的丰度。总之,这使得能够评估相关ROI序列调控潜力。然而,根据MPRA文库构建设计,BC和ROI序列及其关联可能是未知的。在这种情况下,必须识别BC和ROI序列、它们可能的突变变体以及明确的BC-ROI关联,而所有不确定的情况都必须从分析中排除。除了为NGS制备额外的“定位”样品外,这还需要特定的生物信息学工具。在这里,我们提出了一个流程,用于处理通过NGS获得的、针对BC和ROI序列未知的报告构建体文库的原始MPRA数据。该流程能够可靠地识别与它们相关的明确(即所谓的真实)BC和ROI,计算每个BC的归一化表达水平以及每个ROI的平均值,并提供处理后数据的图形可视化。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/30ea/8148044/e42182ec2346/fgene-12-618189-g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/30ea/8148044/eda026025b02/fgene-12-618189-g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/30ea/8148044/4cf6ea0afa8f/fgene-12-618189-g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/30ea/8148044/a775ebab9ab5/fgene-12-618189-g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/30ea/8148044/e42182ec2346/fgene-12-618189-g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/30ea/8148044/eda026025b02/fgene-12-618189-g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/30ea/8148044/4cf6ea0afa8f/fgene-12-618189-g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/30ea/8148044/a775ebab9ab5/fgene-12-618189-g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/30ea/8148044/e42182ec2346/fgene-12-618189-g004.jpg

相似文献

1
MPRAdecoder: Processing of the Raw MPRA Data With Unknown Sequences of the Region of Interest and Associated Barcodes.MPRA解码器:处理具有感兴趣区域未知序列和相关条形码的原始MPRA数据。
Front Genet. 2021 May 11;12:618189. doi: 10.3389/fgene.2021.618189. eCollection 2021.
2
Optimized PCR conditions minimizing the formation of chimeric DNA molecules from MPRA plasmid libraries.优化 PCR 条件以最大限度减少来自 MPRA 质粒文库的嵌合 DNA 分子的形成。
BMC Genomics. 2019 Jul 11;20(Suppl 7):536. doi: 10.1186/s12864-019-5847-2.
3
Plasmid and Sequencing Library Preparation for CRISPRi Barcoded Expression Reporter Sequencing (CiBER-seq) in .用于CRISPRi条形码表达报告基因测序(CiBER-seq)的质粒和测序文库制备 于……
Bio Protoc. 2022 Apr 5;12(7):e4376. doi: 10.21769/BioProtoc.4376.
4
Deciphering regulatory DNA sequences and noncoding genetic variants using neural network models of massively parallel reporter assays.利用大规模平行报告基因实验的神经网络模型来破译调控 DNA 序列和非编码遗传变异。
PLoS One. 2019 Jun 17;14(6):e0218073. doi: 10.1371/journal.pone.0218073. eCollection 2019.
5
Massively parallel reporter assay: a novel technique for analyzing the regulation of gene expression.大规模平行报告基因检测:一种分析基因表达调控的新方法。
Yi Chuan. 2023 Oct 20;45(10):859-873. doi: 10.16288/j.yczz.23-180.
6
Sequence-based correction of barcode bias in massively parallel reporter assays.基于序列的大规模平行报告分析中条形码偏倚的校正。
Genome Res. 2021 Sep;31(9):1638-1645. doi: 10.1101/gr.268599.120. Epub 2021 Jul 20.
7
QuASAR-MPRA: accurate allele-specific analysis for massively parallel reporter assays.QuASAR-MPRA:用于大规模平行报告分析的精确等位基因特异性分析。
Bioinformatics. 2018 Mar 1;34(5):787-794. doi: 10.1093/bioinformatics/btx598.
8
lentiMPRA and MPRAflow for high-throughput functional characterization of gene regulatory elements.用于基因调控元件高通量功能表征的慢病毒MPRA和MPRAflow技术
Nat Protoc. 2020 Aug;15(8):2387-2412. doi: 10.1038/s41596-020-0333-5. Epub 2020 Jul 8.
9
Design tools for MPRA experiments.MPRA 实验的设计工具。
Bioinformatics. 2018 Aug 1;34(15):2682-2683. doi: 10.1093/bioinformatics/bty150.
10
Statistical considerations for the analysis of massively parallel reporter assays data.大规模平行报告基因检测数据分析的统计学考虑。
Genet Epidemiol. 2020 Oct;44(7):785-794. doi: 10.1002/gepi.22337. Epub 2020 Jul 18.

引用本文的文献

1
Decoding biology with massively parallel reporter assays and machine learning.利用大规模平行报告基因检测和机器学习解码生物学。
Genes Dev. 2024 Oct 16;38(17-20):843-865. doi: 10.1101/gad.351800.124.
2
Slight Variations in the Sequence Downstream of the Polyadenylation Signal Significantly Increase Transgene Expression in HEK293T and CHO Cells.多聚腺苷酸化信号下游序列的微小变化显著增加 HEK293T 和 CHO 细胞中转基因的表达。
Int J Mol Sci. 2022 Dec 7;23(24):15485. doi: 10.3390/ijms232415485.

本文引用的文献

1
Fine gene expression regulation by minor sequence variations downstream of the polyadenylation signal.通过多聚腺苷酸化信号下游的微小序列变异实现精细的基因表达调控。
Mol Biol Rep. 2021 Feb;48(2):1539-1547. doi: 10.1007/s11033-021-06160-z. Epub 2021 Jan 31.
2
A semi-supervised model to predict regulatory effects of genetic variants at single nucleotide resolution using massively parallel reporter assays.一种使用大规模平行报告基因实验,在单核苷酸分辨率下预测遗传变异调控效应的半监督模型。
Bioinformatics. 2021 Aug 4;37(14):1953–1962. doi: 10.1093/bioinformatics/btab040. Epub 2021 Jan 30.
3
Systematic identification of -regulatory variants that cause gene expression differences in a yeast cross.
在酵母杂交中系统性识别引起基因表达差异的 -regulatory 变异。
Elife. 2020 Nov 12;9:e62669. doi: 10.7554/eLife.62669.
4
A systematic evaluation of the design and context dependencies of massively parallel reporter assays.大规模平行报告基因检测设计与背景依赖性的系统评价。
Nat Methods. 2020 Nov;17(11):1083-1091. doi: 10.1038/s41592-020-0965-y. Epub 2020 Oct 12.
5
Deciphering the regulatory genome of , one hundred promoters at a time.一次破译 100 个启动子的调控基因组。
Elife. 2020 Sep 21;9:e55308. doi: 10.7554/eLife.55308.
6
Massively Parallel Reporter Assays: Defining Functional Psychiatric Genetic Variants Across Biological Contexts.大规模平行报告分析:在不同生物学背景下定义功能性精神疾病遗传变异。
Biol Psychiatry. 2021 Jan 1;89(1):76-89. doi: 10.1016/j.biopsych.2020.06.011. Epub 2020 Jun 18.
7
Statistical considerations for the analysis of massively parallel reporter assays data.大规模平行报告基因检测数据分析的统计学考虑。
Genet Epidemiol. 2020 Oct;44(7):785-794. doi: 10.1002/gepi.22337. Epub 2020 Jul 18.
8
lentiMPRA and MPRAflow for high-throughput functional characterization of gene regulatory elements.用于基因调控元件高通量功能表征的慢病毒MPRA和MPRAflow技术
Nat Protoc. 2020 Aug;15(8):2387-2412. doi: 10.1038/s41596-020-0333-5. Epub 2020 Jul 8.
9
Dissection of c-AMP Response Element Architecture by Using Genomic and Episomal Massively Parallel Reporter Assays.使用基因组和附加型大规模平行报告基因检测分析 c-AMP 反应元件结构。
Cell Syst. 2020 Jul 22;11(1):75-85.e7. doi: 10.1016/j.cels.2020.05.011. Epub 2020 Jun 29.
10
Massively parallel reporter assays of melanoma risk variants identify MX2 as a gene promoting melanoma.大规模平行报告基因检测黑色素瘤风险变异,鉴定 MX2 为促进黑色素瘤发生的基因。
Nat Commun. 2020 Jun 1;11(1):2718. doi: 10.1038/s41467-020-16590-1.