• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

CoV-Seq,一种用于SARS-CoV-2基因组分析和可视化的新工具:开发与可用性研究

CoV-Seq, a New Tool for SARS-CoV-2 Genome Analysis and Visualization: Development and Usability Study.

作者信息

Liu Boxiang, Liu Kaibo, Zhang He, Zhang Liang, Bian Yuchen, Huang Liang

机构信息

Baidu Research, Sunnyvale, CA, United States.

School of Electrical Engineering & Computer Science, Oregon State University, Corvallis, OR, United States.

出版信息

J Med Internet Res. 2020 Oct 2;22(10):e22299. doi: 10.2196/22299.

DOI:10.2196/22299
PMID:32931441
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC7537720/
Abstract

BACKGROUND

COVID-19 became a global pandemic not long after its identification in late 2019. The genomes of SARS-CoV-2 are being rapidly sequenced and shared on public repositories. To keep up with these updates, scientists need to frequently refresh and reclean data sets, which is an ad hoc and labor-intensive process. Further, scientists with limited bioinformatics or programming knowledge may find it difficult to analyze SARS-CoV-2 genomes.

OBJECTIVE

To address these challenges, we developed CoV-Seq, an integrated web server that enables simple and rapid analysis of SARS-CoV-2 genomes.

METHODS

CoV-Seq is implemented in Python and JavaScript. The web server and source code URLs are provided in this article.

RESULTS

Given a new sequence, CoV-Seq automatically predicts gene boundaries and identifies genetic variants, which are displayed in an interactive genome visualizer and are downloadable for further analysis. A command-line interface is available for high-throughput processing. In addition, we aggregated all publicly available SARS-CoV-2 sequences from the Global Initiative on Sharing Avian Influenza Data (GISAID), National Center for Biotechnology Information (NCBI), European Nucleotide Archive (ENA), and China National GeneBank (CNGB), and extracted genetic variants from these sequences for download and downstream analysis. The CoV-Seq database is updated weekly.

CONCLUSIONS

We have developed CoV-Seq, an integrated web service for fast and easy analysis of custom SARS-CoV-2 sequences. The web server provides an interactive module for the analysis of custom sequences and a weekly updated database of genetic variants of all publicly accessible SARS-CoV-2 sequences. We believe CoV-Seq will help improve our understanding of the genetic underpinnings of COVID-19.

摘要

背景

2019年末新冠病毒被发现后不久,COVID-19就成为了全球大流行病。严重急性呼吸综合征冠状病毒2(SARS-CoV-2)的基因组正在被快速测序并在公共数据库中共享。为了跟上这些更新,科学家们需要频繁刷新和重新清理数据集,这是一个临时且耗费人力的过程。此外,生物信息学或编程知识有限的科学家可能会发现分析SARS-CoV-2基因组很困难。

目的

为应对这些挑战,我们开发了CoV-Seq,这是一个集成的网络服务器,可实现对SARS-CoV-2基因组的简单快速分析。

方法

CoV-Seq用Python和JavaScript实现。本文提供了网络服务器和源代码的网址。

结果

给定一个新序列,CoV-Seq会自动预测基因边界并识别基因变异,这些变异会显示在交互式基因组可视化工具中,并且可供下载以进行进一步分析。还有一个命令行界面可用于高通量处理。此外,我们汇总了来自全球共享禽流感数据倡议组织(GISAID)、美国国立生物技术信息中心(NCBI)、欧洲核苷酸档案馆(ENA)和中国国家基因库(CNGB)的所有公开可用的SARS-CoV-2序列,并从这些序列中提取基因变异以供下载和下游分析。CoV-Seq数据库每周更新一次。

结论

我们开发了CoV-Seq,这是一个用于快速轻松分析定制SARS-CoV-2序列的集成网络服务。该网络服务器提供了一个用于分析定制序列的交互式模块以及一个每周更新的所有可公开获取的SARS-CoV-2序列的基因变异数据库。我们相信CoV-Seq将有助于增进我们对COVID-19遗传基础的理解。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c724/7537720/c1a26bcf316d/jmir_v22i10e22299_fig3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c724/7537720/713c157cff73/jmir_v22i10e22299_fig1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c724/7537720/d8ff6427b2fb/jmir_v22i10e22299_fig2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c724/7537720/c1a26bcf316d/jmir_v22i10e22299_fig3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c724/7537720/713c157cff73/jmir_v22i10e22299_fig1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c724/7537720/d8ff6427b2fb/jmir_v22i10e22299_fig2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c724/7537720/c1a26bcf316d/jmir_v22i10e22299_fig3.jpg

相似文献

1
CoV-Seq, a New Tool for SARS-CoV-2 Genome Analysis and Visualization: Development and Usability Study.CoV-Seq,一种用于SARS-CoV-2基因组分析和可视化的新工具:开发与可用性研究
J Med Internet Res. 2020 Oct 2;22(10):e22299. doi: 10.2196/22299.
2
Genetic grouping of SARS-CoV-2 coronavirus sequences using informative subtype markers for pandemic spread visualization.利用有信息意义的亚型标记对 SARS-CoV-2 冠状病毒序列进行遗传分组,以可视化大流行传播。
PLoS Comput Biol. 2020 Sep 17;16(9):e1008269. doi: 10.1371/journal.pcbi.1008269. eCollection 2020 Sep.
3
Evolving geographic diversity in SARS-CoV2 and in silico analysis of replicating enzyme 3CL targeting repurposed drug candidates.SARS-CoV2 的地理多样性演变和针对复制酶 3CL 的计算机分析,以寻找可再利用药物的候选物。
J Transl Med. 2020 Jul 9;18(1):278. doi: 10.1186/s12967-020-02448-z.
4
Assessing oligonucleotide designs from early lab developed PCR diagnostic tests for SARS-CoV-2 using the PCR_strainer pipeline.使用 PCR_strainer 管道评估早期实验室开发的用于 SARS-CoV-2 的 PCR 诊断检测的寡核苷酸设计。
J Clin Virol. 2020 Oct;131:104581. doi: 10.1016/j.jcv.2020.104581. Epub 2020 Aug 21.
5
COVID19-world: a shiny application to perform comprehensive country-specific data visualization for SARS-CoV-2 epidemic.COVID19 世界:一个闪亮的应用程序,用于对 SARS-CoV-2 疫情进行全面的特定国家数据可视化。
BMC Med Res Methodol. 2020 Sep 21;20(1):235. doi: 10.1186/s12874-020-01121-9.
6
Emerging of a SARS-CoV-2 viral strain with a deletion in nsp1.出现一种nsp1基因有缺失的新型严重急性呼吸综合征冠状病毒2(SARS-CoV-2)毒株。
J Transl Med. 2020 Aug 31;18(1):329. doi: 10.1186/s12967-020-02507-5.
7
International Expansion of a Novel SARS-CoV-2 Mutant.一种新型严重急性呼吸综合征冠状病毒2(SARS-CoV-2)突变株的国际传播
J Virol. 2020 Jun 1;94(12). doi: 10.1128/JVI.00567-20.
8
Unfolding SARS-CoV-2 viral genome to understand its gene expression regulation.解析 SARS-CoV-2 病毒基因组以了解其基因表达调控。
Infect Genet Evol. 2020 Oct;84:104386. doi: 10.1016/j.meegid.2020.104386. Epub 2020 May 29.
9
The 2019 novel coronavirus resource.2019新型冠状病毒资源
Yi Chuan. 2020 Feb 20;42(2):212-221. doi: 10.16288/j.yczz.20-030.
10
Rescue of SARS-CoV-2 from a Single Bacterial Artificial Chromosome.从单个细菌人工染色体中拯救 SARS-CoV-2。
mBio. 2020 Sep 25;11(5):e02168-20. doi: 10.1128/mBio.02168-20.

引用本文的文献

1
On the collection and integration of SARS-CoV-2 genome data.关于严重急性呼吸综合征冠状病毒2(SARS-CoV-2)基因组数据的收集与整合
Biosaf Health. 2023 Jul 4;5(4):204-210. doi: 10.1016/j.bsheal.2023.07.004. eCollection 2023 Aug.
2
VITALdb: to select the best viroinformatics tools for a desired virus or application.VITALdb:为所需病毒或应用选择最佳的病毒信息学工具。
Brief Bioinform. 2025 Mar 4;26(2). doi: 10.1093/bib/bbaf084.
3
Trends and impacts of SARS-CoV-2 genome sharing: a comparative analysis of China and the global community, 2020-2023.

本文引用的文献

1
An interactive web-based dashboard to track COVID-19 in real time.一个基于网络的交互式仪表盘,用于实时追踪新冠病毒。
Lancet Infect Dis. 2020 May;20(5):533-534. doi: 10.1016/S1473-3099(20)30120-1. Epub 2020 Feb 19.
2
Data, disease and diplomacy: GISAID's innovative contribution to global health.数据、疾病与外交:全球共享流感数据倡议组织对全球健康的创新贡献。
Glob Chall. 2017 Jan 10;1(1):33-46. doi: 10.1002/gch2.1018. eCollection 2017 Jan.
3
VAPiD: a lightweight cross-platform viral annotation pipeline and identification tool to facilitate virus genome submissions to NCBI GenBank.
2020 - 2023年新冠病毒基因组共享的趋势与影响:中国与全球的比较分析
Front Public Health. 2024 Nov 20;12:1491623. doi: 10.3389/fpubh.2024.1491623. eCollection 2024.
4
Some common deleterious mutations are shared in SARS-CoV-2 genomes from deceased COVID-19 patients across continents.一些常见的有害突变存在于来自不同大洲的 COVID-19 死亡患者的 SARS-CoV-2 基因组中。
Sci Rep. 2023 Oct 30;13(1):18644. doi: 10.1038/s41598-023-45517-1.
5
Applying the digital data and the bioinformatics tools in SARS-CoV-2 research.将数字数据和生物信息学工具应用于2019冠状病毒病研究。
Comput Struct Biotechnol J. 2023 Oct 1;21:4697-4705. doi: 10.1016/j.csbj.2023.09.044. eCollection 2023.
6
Temporal dynamics and fatality of SARS-CoV-2 variants in Bangladesh.孟加拉国新冠病毒变异株的时间动态和致死率
Health Sci Rep. 2023 Apr 17;6(4):e1209. doi: 10.1002/hsr2.1209. eCollection 2023 Apr.
7
Mutations in SARS-CoV-2: Insights on structure, variants, vaccines, and biomedical interventions.SARS-CoV-2 突变:结构、变体、疫苗和生物医学干预的洞察。
Biomed Pharmacother. 2023 Jan;157:113977. doi: 10.1016/j.biopha.2022.113977. Epub 2022 Nov 7.
8
Transcriptomics and RNA-Based Therapeutics as Potential Approaches to Manage SARS-CoV-2 Infection.转录组学和基于 RNA 的治疗方法作为管理 SARS-CoV-2 感染的潜在方法。
Int J Mol Sci. 2022 Sep 21;23(19):11058. doi: 10.3390/ijms231911058.
9
High Performance Integration Pipeline for Viral and Epitope Sequences.用于病毒和表位序列的高性能整合管道
BioTech (Basel). 2022 Mar 21;11(1):7. doi: 10.3390/biotech11010007.
10
NGS data vectorization, clustering, and finding key codons in SARS-CoV-2 variations.对 SARS-CoV-2 变异进行 NGS 数据矢量化、聚类和寻找关键密码子。
BMC Bioinformatics. 2022 May 17;23(1):187. doi: 10.1186/s12859-022-04718-7.
VAPiD:一个轻量级跨平台病毒注释管道和鉴定工具,可方便病毒基因组提交到 NCBI GenBank。
BMC Bioinformatics. 2019 Jan 23;20(1):48. doi: 10.1186/s12859-019-2606-y.
4
Nextstrain: real-time tracking of pathogen evolution.Nextstrain:实时追踪病原体进化。
Bioinformatics. 2018 Dec 1;34(23):4121-4123. doi: 10.1093/bioinformatics/bty407.
5
NCBI viral genomes resource.美国国立生物技术信息中心病毒基因组资源
Nucleic Acids Res. 2015 Jan;43(Database issue):D571-7. doi: 10.1093/nar/gku1207. Epub 2014 Nov 26.
6
MAFFT multiple sequence alignment software version 7: improvements in performance and usability.MAFFT 多序列比对软件版本 7:性能和易用性的改进。
Mol Biol Evol. 2013 Apr;30(4):772-80. doi: 10.1093/molbev/mst010. Epub 2013 Jan 16.
7
A program for annotating and predicting the effects of single nucleotide polymorphisms, SnpEff: SNPs in the genome of Drosophila melanogaster strain w1118; iso-2; iso-3.一个用于注释和预测单核苷酸多态性影响的程序,即SnpEff:黑腹果蝇品系w1118、iso-2、iso-3基因组中的单核苷酸多态性。
Fly (Austin). 2012 Apr-Jun;6(2):80-92. doi: 10.4161/fly.19695.
8
A statistical framework for SNP calling, mutation discovery, association mapping and population genetical parameter estimation from sequencing data.一种用于从测序数据中进行 SNP 调用、突变发现、关联映射和群体遗传参数估计的统计框架。
Bioinformatics. 2011 Nov 1;27(21):2987-93. doi: 10.1093/bioinformatics/btr509. Epub 2011 Sep 8.
9
VIGOR, an annotation program for small viral genomes.VIGOR,一个小型病毒基因组注释程序。
BMC Bioinformatics. 2010 Sep 7;11:451. doi: 10.1186/1471-2105-11-451.
10
The EMBL Nucleotide Sequence Database.欧洲分子生物学实验室核苷酸序列数据库。
Nucleic Acids Res. 2005 Jan 1;33(Database issue):D29-33. doi: 10.1093/nar/gki098.