Suppr超能文献

五个模式生物的长读、全基因组鸟枪法测序数据。

Long-read, whole-genome shotgun sequence data for five model organisms.

机构信息

Pacific Biosciences of California Inc. , 1380 Willow Road, Menlo Park, California 94025, USA.

Flinders University, School of Biological Sciences , PO Box 2100, Adelaide, South Australia 5001, Australia.

出版信息

Sci Data. 2014 Nov 25;1:140045. doi: 10.1038/sdata.2014.45. eCollection 2014.

Abstract

Single molecule, real-time (SMRT) sequencing from Pacific Biosciences is increasingly used in many areas of biological research including de novo genome assembly, structural-variant identification, haplotype phasing, mRNA isoform discovery, and base-modification analyses. High-quality, public datasets of SMRT sequences can spur development of analytic tools that can accommodate unique characteristics of SMRT data (long read lengths, lack of GC or amplification bias, and a random error profile leading to high consensus accuracy). In this paper, we describe eight high-coverage SMRT sequence datasets from five organisms (Escherichia coli, Saccharomyces cerevisiae, Neurospora crassa, Arabidopsis thaliana, and Drosophila melanogaster) that have been publicly released to the general scientific community (NCBI Sequence Read Archive ID SRP040522). Data were generated using two sequencing chemistries (P4C2 and P5C3) on the PacBio RS II instrument. The datasets reported here can be used without restriction by the research community to generate whole-genome assemblies, test new algorithms, investigate genome structure and evolution, and identify base modifications in some of the most widely-studied model systems in biological research.

摘要

太平洋生物科学公司的单分子实时(SMRT)测序技术在包括从头基因组组装、结构变异识别、单倍型相位、mRNA 异构体发现和碱基修饰分析在内的许多生物学研究领域得到了越来越多的应用。高质量的公共 SMRT 序列数据集可以促进分析工具的发展,这些工具可以适应 SMRT 数据的独特特征(长读取长度、缺乏 GC 或扩增偏差,以及导致高一致性准确性的随机错误分布)。在本文中,我们描述了来自五个生物体(大肠杆菌、酿酒酵母、粗糙脉孢菌、拟南芥和黑腹果蝇)的八个高覆盖率 SMRT 序列数据集,这些数据集已向广大科学界公开(NCBI 序列读取档案 ID SRP040522)。这些数据是使用 PacBio RS II 仪器上的两种测序化学物质(P4C2 和 P5C3)生成的。本报告中所报道的数据集可供研究界在无需限制的情况下使用,以生成全基因组组装、测试新算法、研究基因组结构和进化,并识别在生物学研究中最广泛研究的一些模型系统中的碱基修饰。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7e89/4365909/59c28f3e4e03/sdata201445-f1.jpg

文献AI研究员

20分钟写一篇综述,助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型,支持多种主流文档格式。

立即体验