• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

相似文献

1
ReadItAndKeep: rapid decontamination of SARS-CoV-2 sequencing reads.ReadItAndKeep:快速清除 SARS-CoV-2 测序reads。
Bioinformatics. 2022 Jun 13;38(12):3291-3293. doi: 10.1093/bioinformatics/btac311.
2
Hostile: accurate decontamination of microbial host sequences.宿主敌对:微生物宿主序列的精确净化。
Bioinformatics. 2023 Dec 1;39(12). doi: 10.1093/bioinformatics/btad728.
3
Nubeam-dedup: a fast and RAM-efficient tool to de-duplicate sequencing reads without mapping.Nubeam-dedup:一款快速且节省内存的去重工具,无需进行测序读取映射。
Bioinformatics. 2020 May 1;36(10):3254-3256. doi: 10.1093/bioinformatics/btaa112.
4
BleTIES: annotation of natural genome editing in ciliates using long read sequencing.BleTIES:使用长读测序对纤毛虫中的自然基因组编辑进行注释。
Bioinformatics. 2021 Nov 5;37(21):3929-3931. doi: 10.1093/bioinformatics/btab613.
5
lordFAST: sensitive and Fast Alignment Search Tool for LOng noisy Read sequencing Data.lordFAST:用于长噪声测序数据的敏感快速比对搜索工具。
Bioinformatics. 2019 Jan 1;35(1):20-27. doi: 10.1093/bioinformatics/bty544.
6
WALT: fast and accurate read mapping for bisulfite sequencing.目标:用于亚硫酸氢盐测序的快速准确的读段比对
Bioinformatics. 2016 Nov 15;32(22):3507-3509. doi: 10.1093/bioinformatics/btw490. Epub 2016 Jul 27.
7
PredictION: a predictive model to establish the performance of Oxford sequencing reads of SARS-CoV-2.预测:一种预测 SARS-CoV-2 牛津测序读段性能的模型。
PeerJ. 2022 Nov 30;10:e14425. doi: 10.7717/peerj.14425. eCollection 2022.
8
A spectral algorithm for fast de novo layout of uncorrected long nanopore reads.一种用于快速从头设计未经校正的长纳米孔读段的谱算法。
Bioinformatics. 2017 Oct 15;33(20):3188-3194. doi: 10.1093/bioinformatics/btx370.
9
Toward perfect reads: self-correction of short reads via mapping on de Bruijn graphs.迈向完美读段:通过在 De Bruijn 图上进行映射来自我纠正短读段。
Bioinformatics. 2020 Mar 1;36(5):1374-1381. doi: 10.1093/bioinformatics/btz102.
10
NextPolish: a fast and efficient genome polishing tool for long-read assembly.NextPolish:一种用于长读长组装的快速高效基因组精修工具。
Bioinformatics. 2020 Apr 1;36(7):2253-2255. doi: 10.1093/bioinformatics/btz891.

引用本文的文献

1
Bioinformatic approaches to blood and tissue microbiome analyses: challenges and perspectives.血液和组织微生物组分析的生物信息学方法:挑战与展望。
Brief Bioinform. 2025 Mar 4;26(2). doi: 10.1093/bib/bbaf176.
2
SWGTS-a platform for stream-based host DNA depletion.SWGTS-一种基于流的宿主 DNA 耗竭平台。
Bioinformatics. 2024 Jun 3;40(6). doi: 10.1093/bioinformatics/btae332.
3
Nanopore sequencing technology and its applications.纳米孔测序技术及其应用。
MedComm (2020). 2023 Jul 10;4(4):e316. doi: 10.1002/mco2.316. eCollection 2023 Aug.
4
Low expression of EXOSC2 protects against clinical COVID-19 and impedes SARS-CoV-2 replication.EXOSC2 低表达可预防临床 COVID-19 并阻碍 SARS-CoV-2 复制。
Life Sci Alliance. 2022 Oct 14;6(1). doi: 10.26508/lsa.202201449. Print 2023 Jan.
5
Low expression of EXOSC2 protects against clinical COVID-19 and impedes SARS-CoV-2 replication.EXOSC2低表达可预防临床COVID-19并阻碍SARS-CoV-2复制。
bioRxiv. 2022 Mar 7:2022.03.06.483172. doi: 10.1101/2022.03.06.483172.

本文引用的文献

1
High-coverage whole-genome sequencing of the expanded 1000 Genomes Project cohort including 602 trios.对扩展的 1000 基因组项目队列进行高覆盖率全基因组测序,包括 602 个三核苷酸重复序列。
Cell. 2022 Sep 1;185(18):3426-3440.e19. doi: 10.1016/j.cell.2022.08.004.
2
Assignment of epidemiological lineages in an emerging pandemic using the pangolin tool.使用穿山甲工具对新出现的大流行中的流行病学谱系进行分类。
Virus Evol. 2021 Jul 30;7(2):veab064. doi: 10.1093/ve/veab064. eCollection 2021.
3
Evaluation of methods for detecting human reads in microbial sequencing datasets.评估微生物测序数据集检测人读的方法。
Microb Genom. 2020 Jul;6(7). doi: 10.1099/mgen.0.000393.
4
Improved metagenomic analysis with Kraken 2.Kraken 2 提升宏基因组分析。
Genome Biol. 2019 Nov 28;20(1):257. doi: 10.1186/s13059-019-1891-0.
5
Nanopore sequencing and assembly of a human genome with ultra-long reads.纳米孔测序和超长读长组装人类基因组。
Nat Biotechnol. 2018 Apr;36(4):338-345. doi: 10.1038/nbt.4060. Epub 2018 Jan 29.
6
Fast gapped-read alignment with Bowtie 2.快速缺口读对准与 Bowtie 2。
Nat Methods. 2012 Mar 4;9(4):357-9. doi: 10.1038/nmeth.1923.

ReadItAndKeep:快速清除 SARS-CoV-2 测序reads。

ReadItAndKeep: rapid decontamination of SARS-CoV-2 sequencing reads.

机构信息

EMBL-EBI, Cambridge CB10 1SD, UK.

Nuffield Department of Medicine, University of Oxford, Oxford OX3 9DU, UK.

出版信息

Bioinformatics. 2022 Jun 13;38(12):3291-3293. doi: 10.1093/bioinformatics/btac311.

DOI:10.1093/bioinformatics/btac311
PMID:35551365
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC9191204/
Abstract

SUMMARY

Viral sequence data from clinical samples frequently contain contaminating human reads, which must be removed prior to sharing for legal and ethical reasons. To enable host read removal for SARS-CoV-2 sequencing data on low-specification laptops, we developed ReadItAndKeep, a fast lightweight tool for Illumina and nanopore data that only keeps reads matching the SARS-CoV-2 genome. Peak RAM usage is typically below 10 MB, and runtime less than 1 min. We show that by excluding the polyA tail from the viral reference, ReadItAndKeep prevents bleed-through of human reads, whereas mapping to the human genome lets some reads escape. We believe our test approach (including all possible reads from the human genome, human samples from each of the 26 populations in the 1000 genomes data and a diverse set of SARS-CoV-2 genomes) will also be useful for others.

AVAILABILITY AND IMPLEMENTATION

ReadItAndKeep is implemented in C++, released under the MIT license, and available from https://github.com/GenomePathogenAnalysisService/read-it-and-keep.

SUPPLEMENTARY INFORMATION

Supplementary data are available at Bioinformatics online.

摘要

摘要

出于法律和伦理原因,在分享临床样本的病毒序列数据之前,必须先去除其中含有的污染人类读段。为了能够在低规格笔记本电脑上对 SARS-CoV-2 测序数据进行宿主读段去除,我们开发了 ReadItAndKeep,这是一个用于 Illumina 和纳米孔数据的快速轻量级工具,它只保留与 SARS-CoV-2 基因组匹配的读段。峰值 RAM 使用量通常低于 10MB,运行时间不到 1 分钟。我们表明,通过从病毒参考序列中排除 polyA 尾巴,ReadItAndKeep 可以防止人类读段的串扰,而映射到人类基因组则会让一些读段逃脱。我们相信我们的测试方法(包括人类基因组的所有可能读段、来自 1000 基因组数据中 26 个人群的每个人群的人类样本以及一组多样化的 SARS-CoV-2 基因组)对其他人也将是有用的。

可用性和实施

ReadItAndKeep 是用 C++ 实现的,根据 MIT 许可证发布,并可从 https://github.com/GenomePathogenAnalysisService/read-it-and-keep 获得。

补充信息

补充数据可在生物信息学在线获得。