• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

从 Iso-Seq 数据中破译高度相似的多基因家族转录本,使用 IsoCon。

Deciphering highly similar multigene family transcripts from Iso-Seq data with IsoCon.

机构信息

Department of Computer Science and Engineering, Pennsylvania State University, University Park, PA, 16802, USA.

Department of Biology, Pennsylvania State University, University Park, PA, 16802, USA.

出版信息

Nat Commun. 2018 Nov 2;9(1):4601. doi: 10.1038/s41467-018-06910-x.

DOI:10.1038/s41467-018-06910-x
PMID:30389934
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC6214943/
Abstract

A significant portion of genes in vertebrate genomes belongs to multigene families, with each family containing several gene copies whose presence/absence, as well as isoform structure, can be highly variable across individuals. Existing de novo techniques for assaying the sequences of such highly-similar gene families fall short of reconstructing end-to-end transcripts with nucleotide-level precision or assigning alternatively spliced transcripts to their respective gene copies. We present IsoCon, a high-precision method using long PacBio Iso-Seq reads to tackle this challenge. We apply IsoCon to nine Y chromosome ampliconic gene families and show that it outperforms existing methods on both experimental and simulated data. IsoCon has allowed us to detect an unprecedented number of novel isoforms and has opened the door for unraveling the structure of many multigene families and gaining a deeper understanding of genome evolution and human diseases.

摘要

脊椎动物基因组中的很大一部分基因属于多基因家族,每个家族包含几个基因副本,其存在/缺失以及同工型结构在个体之间可能有很大差异。现有的从头分析这些高度相似的基因家族序列的技术无法精确重建核苷酸水平的全长转录本,也无法将选择性剪接的转录本分配给各自的基因副本。我们提出了 IsoCon,这是一种使用长 PacBio Iso-Seq reads 来解决这一挑战的高精度方法。我们将 IsoCon 应用于九个 Y 染色体扩增基因家族,并表明它在实验和模拟数据上都优于现有方法。IsoCon 使我们能够检测到前所未有的大量新同工型,并为揭示许多多基因家族的结构以及更深入地了解基因组进化和人类疾病打开了大门。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/785a/6214943/babb0aaac91c/41467_2018_6910_Fig5_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/785a/6214943/8080f05f1c36/41467_2018_6910_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/785a/6214943/2233cb01e3ed/41467_2018_6910_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/785a/6214943/122a44de3447/41467_2018_6910_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/785a/6214943/c9b81e1f4a32/41467_2018_6910_Fig4_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/785a/6214943/babb0aaac91c/41467_2018_6910_Fig5_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/785a/6214943/8080f05f1c36/41467_2018_6910_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/785a/6214943/2233cb01e3ed/41467_2018_6910_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/785a/6214943/122a44de3447/41467_2018_6910_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/785a/6214943/c9b81e1f4a32/41467_2018_6910_Fig4_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/785a/6214943/babb0aaac91c/41467_2018_6910_Fig5_HTML.jpg

相似文献

1
Deciphering highly similar multigene family transcripts from Iso-Seq data with IsoCon.从 Iso-Seq 数据中破译高度相似的多基因家族转录本,使用 IsoCon。
Nat Commun. 2018 Nov 2;9(1):4601. doi: 10.1038/s41467-018-06910-x.
2
Sparse linear modeling of next-generation mRNA sequencing (RNA-Seq) data for isoform discovery and abundance estimation.基于下一代 mRNA 测序(RNA-Seq)数据的稀疏线性建模用于发现异构体和丰度估计。
Proc Natl Acad Sci U S A. 2011 Dec 13;108(50):19867-72. doi: 10.1073/pnas.1113972108. Epub 2011 Dec 1.
3
Dosage regulation, and variation in gene expression and copy number of human Y chromosome ampliconic genes.剂量调节,以及人类 Y 染色体扩增基因的表达和拷贝数的变化。
PLoS Genet. 2019 Sep 16;15(9):e1008369. doi: 10.1371/journal.pgen.1008369. eCollection 2019 Sep.
4
RSEM: accurate transcript quantification from RNA-Seq data with or without a reference genome.RSEM:有或无参考基因组的 RNA-Seq 数据的准确转录本定量。
BMC Bioinformatics. 2011 Aug 4;12:323. doi: 10.1186/1471-2105-12-323.
5
A survey of the complex transcriptome from the highly polyploid sugarcane genome using full-length isoform sequencing and de novo assembly from short read sequencing.利用全长异构体测序和短读长测序的从头组装对高度多倍体甘蔗基因组的复杂转录组进行的一项调查。
BMC Genomics. 2017 May 22;18(1):395. doi: 10.1186/s12864-017-3757-8.
6
CIDANE: comprehensive isoform discovery and abundance estimation.CIDANE:全面的异构体发现与丰度估计
Genome Biol. 2016 Jan 30;17:16. doi: 10.1186/s13059-015-0865-0.
7
PacBio full-length cDNA sequencing integrated with RNA-seq reads drastically improves the discovery of splicing transcripts in rice.PacBio 全长 cDNA 测序与 RNA-seq reads 相结合极大地提高了水稻剪接转录本的发现。
Plant J. 2019 Jan;97(2):296-305. doi: 10.1111/tpj.14120. Epub 2018 Dec 3.
8
Splicing of exon 9a in FMR1 transcripts results in a truncated FMRP with altered subcellular distribution.外显子 9a 在 FMR1 转录本中的剪接导致截断的 FMRP,其亚细胞分布发生改变。
Gene. 2020 Mar 20;731:144359. doi: 10.1016/j.gene.2020.144359. Epub 2020 Jan 11.
9
Full-length isoform concatenation sequencing to resolve cancer transcriptome complexity.全长异构体拼接测序解析癌症转录组复杂性。
BMC Genomics. 2024 Jan 29;25(1):122. doi: 10.1186/s12864-024-10021-x.
10
CircAST: Full-length Assembly and Quantification of Alternatively Spliced Isoforms in Circular RNAs.CircAST:环状 RNA 中可变剪接异构体的全长组装和定量。
Genomics Proteomics Bioinformatics. 2019 Oct;17(5):522-534. doi: 10.1016/j.gpb.2019.03.004. Epub 2020 Jan 31.

引用本文的文献

1
clustering of long-read amplicons improves phylogenetic insight into microbiome data.长读长扩增子的聚类提高了对微生物组数据的系统发育洞察力。
Gut Microbes. 2025 Dec;17(1):2516703. doi: 10.1080/19490976.2025.2516703. Epub 2025 Jun 11.
2
De novo clustering of large long-read transcriptome datasets with isONclust3.使用isONclust3对大型长读长转录组数据集进行从头聚类。
Bioinformatics. 2025 May 6;41(5). doi: 10.1093/bioinformatics/btaf207.
3
Polyploidization of Indotyphlops braminus: evidence from isoform-sequencing.横纹钝头蛇的多倍体化:同工酶测序的证据。

本文引用的文献

1
Single-molecule, full-length transcript sequencing provides insight into the extreme metabolism of the ruby-throated hummingbird Archilochus colubris.单分子全长转录本测序揭示了猩红喉蜂鸟(Archilochus colubris)极端代谢的奥秘。
Gigascience. 2018 Mar 1;7(3):1-12. doi: 10.1093/gigascience/giy009.
2
Allelic decomposition and exact genotyping of highly polymorphic and structurally variant genes.高度多态性和结构变异基因的等位基因分解和精确基因分型。
Nat Commun. 2018 Feb 26;9(1):828. doi: 10.1038/s41467-018-03273-1.
3
SQANTI: extensive characterization of long-read transcript sequences for quality control in full-length transcriptome identification and quantification.
BMC Genom Data. 2024 Feb 26;25(1):23. doi: 10.1186/s12863-024-01208-y.
4
Highly efficient clustering of long-read transcriptomic data with GeLuster.使用 GeLuster 对长读转录组数据进行高效聚类。
Bioinformatics. 2024 Feb 1;40(2). doi: 10.1093/bioinformatics/btae059.
5
Transcript Isoform Diversity of Ampliconic Genes on the Y Chromosome of Great Apes.Y 染色体上扩增基因的转录本多样性在大型猿类中。
Genome Biol Evol. 2023 Nov 1;15(11). doi: 10.1093/gbe/evad205.
6
Application of third-generation sequencing in cancer research.第三代测序技术在癌症研究中的应用。
Med Rev (2021). 2021 Oct 21;1(2):150-171. doi: 10.1515/mr-2021-0013. eCollection 2021 Dec.
7
Nanopore sequencing of PCR products enables multicopy gene family reconstruction.聚合酶链式反应(PCR)产物的纳米孔测序可实现多拷贝基因家族重建。
Comput Struct Biotechnol J. 2023 Jul 16;21:3656-3664. doi: 10.1016/j.csbj.2023.07.012. eCollection 2023.
8
Locality-sensitive bucketing functions for the edit distance.用于编辑距离的局部敏感桶函数。
Algorithms Mol Biol. 2023 Jul 24;18(1):7. doi: 10.1186/s13015-023-00234-2.
9
isONform: reference-free transcriptome reconstruction from Oxford Nanopore data.isONform:一种无需参考基因组的转录组重构方法,来自牛津纳米孔技术。
Bioinformatics. 2023 Jun 30;39(39 Suppl 1):i222-i231. doi: 10.1093/bioinformatics/btad264.
10
IsoTools: a flexible workflow for long-read transcriptome sequencing analysis.IsoTools:一种用于长读转录组测序分析的灵活工作流程。
Bioinformatics. 2023 Jun 1;39(6). doi: 10.1093/bioinformatics/btad364.
SQANTI:用于全长转录组鉴定和定量的长读转录序列的广泛特征化,以进行质量控制。
Genome Res. 2018 Mar 1;28(3):396-411. doi: 10.1101/gr.222976.117.
4
Long-read sequencing of the coffee bean transcriptome reveals the diversity of full-length transcripts.长读测序技术揭示了咖啡豆转录组全长转录本的多样性。
Gigascience. 2017 Nov 1;6(11):1-13. doi: 10.1093/gigascience/gix086.
5
Isoform Evolution in Primates through Independent Combination of Alternative RNA Processing Events.通过可变RNA加工事件的独立组合实现灵长类动物中的异构体进化。
Mol Biol Evol. 2017 Oct 1;34(10):2453-2468. doi: 10.1093/molbev/msx212.
6
Altered expression of the FMR1 splicing variants landscape in premutation carriers.脆性 X 智力低下 1 号基因剪接变异体景观在前突变携带者中的改变表达。
Biochim Biophys Acta Gene Regul Mech. 2017 Nov;1860(11):1117-1126. doi: 10.1016/j.bbagrm.2017.08.007. Epub 2017 Sep 7.
7
Analysis of 62 hybrid assembled human Y chromosomes exposes rapid structural changes and high rates of gene conversion.对62条杂交组装的人类Y染色体的分析揭示了快速的结构变化和高频率的基因转换。
PLoS Genet. 2017 Aug 28;13(8):e1006834. doi: 10.1371/journal.pgen.1006834. eCollection 2017 Aug.
8
A survey of the complex transcriptome from the highly polyploid sugarcane genome using full-length isoform sequencing and de novo assembly from short read sequencing.利用全长异构体测序和短读长测序的从头组装对高度多倍体甘蔗基因组的复杂转录组进行的一项调查。
BMC Genomics. 2017 May 22;18(1):395. doi: 10.1186/s12864-017-3757-8.
9
Normalized long read RNA sequencing in chicken reveals transcriptome complexity similar to human.鸡的标准化长读长RNA测序揭示了与人类相似的转录组复杂性。
BMC Genomics. 2017 Apr 24;18(1):323. doi: 10.1186/s12864-017-3691-9.
10
HapIso: An Accurate Method for the Haplotype- Specific Isoforms Reconstruction From Long Single-Molecule Reads.HapIso:一种从长单分子读数中准确重建单倍型特异性异构体的方法。
IEEE Trans Nanobioscience. 2017 Mar;16(2):108-115. doi: 10.1109/TNB.2017.2675981. Epub 2017 Mar 17.