Department of Pediatrics, Baylor College of Medicine, Houston, TX, United States; Jan and Dan Duncan Neurological Research Institute, Texas Children's Hospital, Houston, TX, United States; USDA/ARS Children's Nutrition Research Center, Department of Pediatrics, Baylor College of Medicine, Houston, TX, United States.
Department of Biochemistry and Molecular Biology, The University of Texas Medical Branch at Galveston, Galveston, TX, United States.
Methods Enzymol. 2021;655:185-204. doi: 10.1016/bs.mie.2021.04.001. Epub 2021 Jun 5.
An increasing number of investigations have established alternative polyadenylation (APA) as a key mechanism of gene regulation through altering the length of 3' untranslated region (UTR) and generating distinct mRNA termini. Further, appreciation for the significance of APA in disease contexts propelled the development of several 3' sequencing techniques. While these RNA sequencing technologies have advanced APA analysis, the intrinsic limitation of 3' read coverage and lack of appropriate computational tools constrain precise mapping and quantification of polyadenylation sites. Notably, Poly(A)-ClickSeq (PAC-seq) overcomes limiting factors such as poly(A) enrichment and 3' linker ligation steps using click-chemistry. Here we provide an updated PolyA-miner protocol, a computational approach to analyze PAC-seq or other 3'-Seq datasets. As a key practical constraint, we also provide a detailed account on the impact of sequencing depth on the number of detected polyadenylation sites and APA changes. This protocol is also updated to handle unique molecular identifiers used to address PCR duplication potentially observed in PAC-seq.
越来越多的研究已经确定了可变多聚腺苷酸化 (APA) 通过改变 3' 非翻译区 (UTR) 的长度和产生不同的 mRNA 末端,作为一种关键的基因调控机制。此外,对 APA 在疾病背景下的重要性的认识推动了几种 3' 测序技术的发展。虽然这些 RNA 测序技术已经推进了 APA 分析,但 3' 读段覆盖的固有限制和缺乏适当的计算工具限制了多聚腺苷酸化位点的精确映射和定量。值得注意的是,Poly(A)-ClickSeq (PAC-seq) 使用点击化学克服了多聚腺苷酸化富集和 3' 连接子连接步骤等限制因素。在这里,我们提供了一个经过更新的 PolyA-miner 方案,这是一种用于分析 PAC-seq 或其他 3'-Seq 数据集的计算方法。作为一个关键的实际限制因素,我们还详细说明了测序深度对检测到的多聚腺苷酸化位点数量和 APA 变化的影响。该方案也已更新,以处理在 PAC-seq 中可能观察到的 PCR 重复问题中使用的独特分子标识符。