• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

MarFERReT,一个海洋微生物真核生物功能基因的开源、版本受控参考文库。

MarFERReT, an open-source, version-controlled reference library of marine microbial eukaryote functional genes.

作者信息

Groussman R D, Blaskowski S, Coesel S N, Armbrust E V

机构信息

School of Oceanography, University of Washington, Benjamin Hall IRB, Room 306 616 NE Northlake Place, Seattle, WA, 98105, USA.

Molecular Engineering and Sciences Institute, University of Washington, Molecular Engineering & Sciences Building 3946 W Stevens Way NE, Seattle, WA, 98195, USA.

出版信息

Sci Data. 2023 Dec 21;10(1):926. doi: 10.1038/s41597-023-02842-4.

DOI:10.1038/s41597-023-02842-4
PMID:38129449
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC10739892/
Abstract

Metatranscriptomics generates large volumes of sequence data about transcribed genes in natural environments. Taxonomic annotation of these datasets depends on availability of curated reference sequences. For marine microbial eukaryotes, current reference libraries are limited by gaps in sequenced organism diversity and barriers to updating libraries with new sequence data, resulting in taxonomic annotation of about half of eukaryotic environmental transcripts. Here, we introduce Marine Functional EukaRyotic Reference Taxa (MarFERReT), a marine microbial eukaryotic sequence library designed for use with taxonomic annotation of eukaryotic metatranscriptomes. We gathered 902 publicly accessible marine eukaryote genomes and transcriptomes and assessed their sequence quality and cross-contamination issues, selecting 800 validated entries for inclusion in MarFERReT. Version 1.1 of MarFERReT contains reference sequences from 800 marine eukaryotic genomes and transcriptomes, covering 453 species- and strain-level taxa, totaling nearly 28 million protein sequences with associated NCBI and PR Taxonomy identifiers and Pfam functional annotations. The MarFERReT project repository hosts containerized build scripts, documentation on installation and use case examples, and information on new versions of MarFERReT.

摘要

宏转录组学可生成有关自然环境中转录基因的大量序列数据。这些数据集的分类注释取决于经过整理的参考序列的可用性。对于海洋微生物真核生物而言,当前的参考文库受到测序生物多样性缺口以及用新序列数据更新文库的障碍的限制,导致约一半的真核生物环境转录本的分类注释受到影响。在此,我们引入了海洋功能性真核生物参考分类群(MarFERReT),这是一个设计用于真核生物宏转录组分类注释的海洋微生物真核生物序列文库。我们收集了902个可公开获取的海洋真核生物基因组和转录组,并评估了它们的序列质量和交叉污染问题,选择了800个经过验证的条目纳入MarFERReT。MarFERReT 1.1版本包含来自800个海洋真核生物基因组和转录组的参考序列,涵盖453个物种和菌株水平的分类群,共有近2800万个蛋白质序列,并带有相关的NCBI和PR分类标识符以及Pfam功能注释。MarFERReT项目存储库托管容器化的构建脚本、关于安装和用例示例的文档以及有关MarFERReT新版本的信息。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6222/10739892/a34c026d1886/41597_2023_2842_Fig4_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6222/10739892/42cefaa9b260/41597_2023_2842_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6222/10739892/1fc952488644/41597_2023_2842_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6222/10739892/fca90641d95c/41597_2023_2842_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6222/10739892/a34c026d1886/41597_2023_2842_Fig4_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6222/10739892/42cefaa9b260/41597_2023_2842_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6222/10739892/1fc952488644/41597_2023_2842_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6222/10739892/fca90641d95c/41597_2023_2842_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6222/10739892/a34c026d1886/41597_2023_2842_Fig4_HTML.jpg

相似文献

1
MarFERReT, an open-source, version-controlled reference library of marine microbial eukaryote functional genes.MarFERReT,一个海洋微生物真核生物功能基因的开源、版本受控参考文库。
Sci Data. 2023 Dec 21;10(1):926. doi: 10.1038/s41597-023-02842-4.
2
Aspects of Genetic Diversity, Host Specificity and Public Health Significance of Single-Celled Intestinal Parasites Commonly Observed in Humans and Mostly Referred to as 'Non-Pathogenic'.人类常见且大多被称为“非致病性”的单细胞肠道寄生虫的遗传多样性、宿主特异性及公共卫生意义
APMIS. 2025 Sep;133(9):e70036. doi: 10.1111/apm.70036.
3
Use to identify commercially available American Type Culture Collection strains based on sequence queries.用于根据序列查询识别市售的美国典型培养物保藏中心菌株。
PeerJ. 2025 Aug 13;13:e19832. doi: 10.7717/peerj.19832. eCollection 2025.
4
Prescription of Controlled Substances: Benefits and Risks管制药品的处方:益处与风险
5
Evaluation of DNA barcoding reference databases for marine species in the western and central Pacific Ocean.西太平洋和中太平洋海洋物种DNA条形码参考数据库评估
PeerJ. 2025 Jul 14;13:e19674. doi: 10.7717/peerj.19674. eCollection 2025.
6
Rethinking large-scale phylogenomics with EukPhylo v.1.0, a flexible toolkit to enable phylogeny-informed data curation and analyses of diverse eukaryotic lineages.使用EukPhylo v.1.0重新思考大规模系统发育基因组学,这是一个灵活的工具包,可实现系统发育信息指导的数据管理以及对多种真核生物谱系的分析。
mBio. 2025 Aug 27:e0177025. doi: 10.1128/mbio.01770-25.
7
KSGP 3.1: improved taxonomic annotation of Archaea communities using LotuS2, the genome taxonomy database and RNAseq data.KSGP 3.1:使用LotuS2、基因组分类数据库和RNAseq数据改进古菌群落的分类注释。
ISME Commun. 2025 Jun 3;5(1):ycaf094. doi: 10.1093/ismeco/ycaf094. eCollection 2025 Jan.
8
High-throughput library transgenesis in via Transgenic Arrays Resulting in Diversity of Integrated Sequences (TARDIS).利用 Transgenic Arrays Resulting in Diversity of Integrated Sequences (TARDIS) 进行 中的高通量文库转基因
Elife. 2023 Jul 4;12:RP84831. doi: 10.7554/eLife.84831.
9
Characterization of microbial dark matter at scale with MetaSBT and taxonomy-aware Sequence Bloom Trees.使用MetaSBT和分类学感知序列布隆树对大规模微生物暗物质进行表征。
bioRxiv. 2025 Aug 30:2025.08.25.672238. doi: 10.1101/2025.08.25.672238.
10
ParAquaSeq, a Database of Ecologically Annotated rRNA Sequences Covering Zoosporic Parasites Infecting Aquatic Primary Producers in Natural and Industrial Systems.ParAquaSeq,一个涵盖在自然和工业系统中感染水生初级生产者的游动孢子寄生虫的生态注释rRNA序列数据库。
Mol Ecol Resour. 2025 Aug;25(6):e14099. doi: 10.1111/1755-0998.14099. Epub 2025 Mar 15.

引用本文的文献

1
Environmental adaptations in metagenomes revealed by deep learning.深度学习揭示的宏基因组中的环境适应性
BMC Biol. 2025 Aug 11;23(1):252. doi: 10.1186/s12915-025-02361-1.
2
Proportional relationship between transcript concentrations and carbon biomass for open ocean plankton groups.大洋浮游生物类群中转录本浓度与碳生物量之间的比例关系。
ISME J. 2025 Jan 2;19(1). doi: 10.1093/ismejo/wraf079.
3
Microbial functional diversity and redundancy: moving forward.微生物功能多样性与冗余性:前行之路

本文引用的文献

1
Convergent evolution and horizontal gene transfer in Arctic Ocean microalgae.北极海洋微藻的趋同进化和水平基因转移。
Life Sci Alliance. 2022 Dec 15;6(3). doi: 10.26508/lsa.202201833. Print 2023 Mar.
2
InterPro in 2022.InterPro 在 2022 年。
Nucleic Acids Res. 2023 Jan 6;51(D1):D418-D427. doi: 10.1093/nar/gkac993.
3
The dynamic trophic architecture of open-ocean protist communities revealed through machine-guided metatranscriptomics.通过机器引导的宏转录组学揭示开阔海域原生生物群落的动态营养结构。
FEMS Microbiol Rev. 2025 Jan 14;49. doi: 10.1093/femsre/fuae031.
4
The North Pacific Eukaryotic Gene Catalog of metatranscriptome assemblies and annotations.北太平洋真核生物宏转录组组装和注释基因目录。
Sci Data. 2024 Oct 22;11(1):1161. doi: 10.1038/s41597-024-04005-5.
5
Digital Microbe: a genome-informed data integration framework for team science on emerging model organisms.数字微生物:用于新兴模式生物的团队科学的基于基因组的综合数据框架。
Sci Data. 2024 Sep 4;11(1):967. doi: 10.1038/s41597-024-03778-z.
6
First regional reference database of northern Adriatic diatom transcriptomes.首个亚得里亚海北部硅藻转录组区域参考数据库。
Sci Rep. 2024 Jul 13;14(1):16209. doi: 10.1038/s41598-024-67043-4.
Proc Natl Acad Sci U S A. 2022 Feb 15;119(7). doi: 10.1073/pnas.2100916119.
4
Deeply conserved synteny and the evolution of metazoan chromosomes.深度保守的染色体同线性与后生动物染色体的进化
Sci Adv. 2022 Feb 4;8(5):eabi5884. doi: 10.1126/sciadv.abi5884. Epub 2022 Feb 2.
5
Diel-Regulated Transcriptional Cascades of Microbial Eukaryotes in the North Pacific Subtropical Gyre.北太平洋亚热带环流中微生物真核生物的昼夜节律调控转录级联反应。
Front Microbiol. 2021 Sep 29;12:682651. doi: 10.3389/fmicb.2021.682651. eCollection 2021.
6
The genome of a nonphotosynthetic diatom provides insights into the metabolic shift to heterotrophy and constraints on the loss of photosynthesis.非光合硅藻的基因组为研究代谢向异养的转变以及光合作用丧失的限制因素提供了线索。
New Phytol. 2021 Nov;232(4):1750-1764. doi: 10.1111/nph.17673. Epub 2021 Sep 3.
7
Decontamination, pooling and dereplication of the 678 samples of the Marine Microbial Eukaryote Transcriptome Sequencing Project.海洋微生物真核生物转录组测序项目的 678 个样本的净化、汇集和去重。
BMC Res Notes. 2021 Aug 9;14(1):306. doi: 10.1186/s13104-021-05717-2.
8
The Genome of the Haptophyte Diacronema lutheri (Pavlova lutheri, Pavlovales): A Model for Lipid Biosynthesis in Eukaryotic Algae.《叶滴虫藻(Pavlova lutheri,Pavlovales)基因组:真核藻类中脂质生物合成的模式》。
Genome Biol Evol. 2021 Aug 3;13(8). doi: 10.1093/gbe/evab178.
9
Diploid genomic architecture of Nitzschia inconspicua, an elite biomass production diatom.具精英生物质生产能力的硅藻小环藻的二倍体基因组结构。
Sci Rep. 2021 Aug 2;11(1):15592. doi: 10.1038/s41598-021-95106-3.
10
The American lobster genome reveals insights on longevity, neural, and immune adaptations.美国龙虾基因组揭示了长寿、神经和免疫适应的见解。
Sci Adv. 2021 Jun 23;7(26). doi: 10.1126/sciadv.abe8290. Print 2021 Jun.