• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

最短物种特异性寡核苷酸序列的鉴定。

Identification of the shortest species-specific oligonucleotide sequences.

作者信息

Mouratidis Ioannis, Konnaris Maxwell A, Chantzi Nikol, Chan Candace S Y, Patsakis Michail, Provatas Kimonas, Montgomery Austin, Baltoumas Fotis A, Sha Congzhou M, Mareboina Manvita, Pavlopoulos Georgios A, Chartoumpekis Dionysios V, Georgakopoulos-Soares Ilias

机构信息

Institute for Personalized Medicine, Department of Biochemistry and Molecular Biology, The Pennsylvania State University College of Medicine, Hershey, Pennsylvania 17033, USA.

Huck Institutes of the Life Sciences, The Pennsylvania State University, University Park, Pennsylvania 16802, USA.

出版信息

Genome Res. 2025 Feb 14;35(2):279-295. doi: 10.1101/gr.280070.124.

DOI:10.1101/gr.280070.124
PMID:39746719
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC11874967/
Abstract

Despite the exponential increase in sequencing information driven by massively parallel DNA sequencing technologies, universal and succinct genomic fingerprints for each organism are still missing. Identifying the shortest species-specific nucleotide sequences offers insights into species evolution and holds potential practical applications in agriculture, wildlife conservation, and healthcare. We propose a new method for sequence analysis termed nucleic "quasi-primes," the shortest occurring sequences in each of 45,076 organismal reference genomes, present in one genome and absent from every other examined genome. In the human genome, we find that the genomic loci of nucleic quasi-primes are most enriched for genes associated with brain development and cognitive function. In a single-cell case study focusing on the human primary motor cortex, nucleic quasi-prime genes account for a significantly larger proportion of the variation based on average gene expression. Nonneuronal cell types, including astrocytes, endothelial cells, microglia perivascular-macrophages, oligodendrocytes, and vascular and leptomeningeal cells, exhibit significant activation of quasi-prime-containing gene associations related to cancer, whereas simultaneously suppressing quasi-prime-containing genes are associated with cognitive, mental, and developmental disorders. We also show that human disease-causing variants, eQTLs, mQTLs, and sQTLs are 4.43-fold, 4.34-fold, 4.29-fold, and 4.21-fold enriched at human quasi-prime loci, respectively. These findings indicate that nucleic quasi-primes are genomic loci linked to the evolution of species-specific traits, and in humans, they provide insights in the development of cognitive traits and human diseases, including neurodevelopmental disorders.

摘要

尽管大规模平行DNA测序技术推动测序信息呈指数级增长,但仍缺乏针对每个生物体的通用且简洁的基因组指纹。识别最短的物种特异性核苷酸序列有助于深入了解物种进化,并在农业、野生动物保护和医疗保健领域具有潜在的实际应用价值。我们提出了一种新的序列分析方法,称为核酸“准质数”,它是45076个生物体参考基因组中每个基因组中出现的最短序列,存在于一个基因组中,而在其他所有检测的基因组中均不存在。在人类基因组中,我们发现核酸准质数的基因组位点在与大脑发育和认知功能相关的基因中最为富集。在一项针对人类初级运动皮层的单细胞案例研究中,基于平均基因表达,核酸准质数基因在变异中所占比例显著更大。非神经元细胞类型,包括星形胶质细胞、内皮细胞、小胶质细胞、血管周围巨噬细胞、少突胶质细胞以及血管和软脑膜细胞,表现出与癌症相关的含准质数基因关联的显著激活,而同时抑制含准质数的基因则与认知、精神和发育障碍相关。我们还表明,人类致病变异、eQTL、mQTL和sQTL在人类准质数位点的富集倍数分别为4.43倍、4.34倍、4.29倍和4.21倍。这些发现表明,核酸准质数是与物种特异性性状进化相关的基因组位点,在人类中,它们为认知性状和人类疾病(包括神经发育障碍)的发展提供了见解。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4fe2/11874967/3f1af408d93d/279f07.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4fe2/11874967/74e74c317340/279f01.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4fe2/11874967/7f4a7f42640e/279f02.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4fe2/11874967/4c729f2f4ab4/279f03.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4fe2/11874967/4cfb3240fdbf/279f04.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4fe2/11874967/a942a7c00a81/279f05.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4fe2/11874967/1851c1471945/279f06.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4fe2/11874967/3f1af408d93d/279f07.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4fe2/11874967/74e74c317340/279f01.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4fe2/11874967/7f4a7f42640e/279f02.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4fe2/11874967/4c729f2f4ab4/279f03.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4fe2/11874967/4cfb3240fdbf/279f04.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4fe2/11874967/a942a7c00a81/279f05.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4fe2/11874967/1851c1471945/279f06.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4fe2/11874967/3f1af408d93d/279f07.jpg

相似文献

1
Identification of the shortest species-specific oligonucleotide sequences.最短物种特异性寡核苷酸序列的鉴定。
Genome Res. 2025 Feb 14;35(2):279-295. doi: 10.1101/gr.280070.124.
2
Quasi-prime peptides: identification of the shortest peptide sequences unique to a species.准主要肽段:某一物种特有的最短肽段序列的鉴定
NAR Genom Bioinform. 2023 Apr 24;5(2):lqad039. doi: 10.1093/nargab/lqad039. eCollection 2023 Jun.
3
A Catalogue of 59,732 Human-Specific Regulatory Sequences Reveals Unique-to-Human Regulatory Patterns Associated with Virus-Interacting Proteins, Pluripotency, and Brain Development.59732 个人类特异性调控序列目录揭示了与病毒相互作用蛋白、多能性和脑发育相关的人类特有的调控模式。
DNA Cell Biol. 2020 Jan;39(1):126-143. doi: 10.1089/dna.2019.4988. Epub 2019 Nov 15.
4
The complete genome of an individual by massively parallel DNA sequencing.通过大规模平行DNA测序获得个体的完整基因组。
Nature. 2008 Apr 17;452(7189):872-6. doi: 10.1038/nature06884.
5
Molecular diversity and phenotypic pleiotropy of ancient genomic regulatory loci derived from human endogenous retrovirus type H (HERVH) promoter LTR7 and HERVK promoter LTR5_Hs and their contemporary impacts on pathophysiology of Modern Humans.源自人类内源性逆转录病毒 H 型(HERVH)启动子 LTR7 和 HERVK 启动子 LTR5_Hs 的古老基因组调控位点的分子多样性和表型多效性及其对现代人类病理生理学的当代影响。
Mol Genet Genomics. 2022 Nov;297(6):1711-1740. doi: 10.1007/s00438-022-01954-7. Epub 2022 Sep 19.
6
Massively parallel sequencing: the new frontier of hematologic genomics.大规模平行测序:血液基因组学的新前沿。
Blood. 2013 Nov 7;122(19):3268-75. doi: 10.1182/blood-2013-07-460287. Epub 2013 Sep 10.
7
A programmable method for massively parallel targeted sequencing.一种用于大规模平行靶向测序的可编程方法。
Nucleic Acids Res. 2014 Jun;42(10):e88. doi: 10.1093/nar/gku282. Epub 2014 Apr 29.
8
kmerDB: A database encompassing the set of genomic and proteomic sequence information for each species.kmer数据库:一个包含每个物种基因组和蛋白质组序列信息集合的数据库。
Comput Struct Biotechnol J. 2024 Apr 21;23:1919-1928. doi: 10.1016/j.csbj.2024.04.050. eCollection 2024 Dec.
9
Improving the efficiency of genomic loci capture using oligonucleotide arrays for high throughput resequencing.利用寡核苷酸阵列提高基因组基因座捕获效率,实现高通量重测序。
BMC Genomics. 2009 Dec 31;10:646. doi: 10.1186/1471-2164-10-646.
10
Unraveling diversity by isolating peptide sequences specific to distinct taxonomic groups.通过分离不同分类群特有的肽序列来揭示多样性。
bioRxiv. 2025 Feb 10:2025.02.05.636664. doi: 10.1101/2025.02.05.636664.

本文引用的文献

1
Bioframe: operations on genomic intervals in Pandas dataframes.Bioframe:在 Pandas 数据框中操作基因组区间。
Bioinformatics. 2024 Feb 1;40(2). doi: 10.1093/bioinformatics/btae088.
2
Utilizing nullomers in cell-free RNA for early cancer detection.利用无细胞 RNA 中的无义寡核苷酸进行早期癌症检测。
Cancer Gene Ther. 2024 Jun;31(6):861-870. doi: 10.1038/s41417-024-00741-3. Epub 2024 Feb 14.
3
TeloBase: a community-curated database of telomere sequences across the tree of life.TeloBase:一个经过社区策展的端粒序列数据库,涵盖了生命之树的各个分支。
Nucleic Acids Res. 2024 Jan 5;52(D1):D311-D321. doi: 10.1093/nar/gkad672.
4
Annotating and prioritizing human non-coding variants with RegulomeDB v.2.使用RegulomeDB v.2对人类非编码变异进行注释和优先级排序。
Nat Genet. 2023 May;55(5):724-726. doi: 10.1038/s41588-023-01365-3.
5
OrthoVenn3: an integrated platform for exploring and visualizing orthologous data across genomes.OrthoVenn3:一个用于跨基因组探索和可视化同源数据的集成平台。
Nucleic Acids Res. 2023 Jul 5;51(W1):W397-W403. doi: 10.1093/nar/gkad313.
6
A genomic timescale for placental mammal evolution.胎盘哺乳动物进化的基因组时间尺度。
Science. 2023 Apr 28;380(6643):eabl8189. doi: 10.1126/science.abl8189.
7
Quasi-prime peptides: identification of the shortest peptide sequences unique to a species.准主要肽段:某一物种特有的最短肽段序列的鉴定
NAR Genom Bioinform. 2023 Apr 24;5(2):lqad039. doi: 10.1093/nargab/lqad039. eCollection 2023 Jun.
8
DNA methylation QTL mapping across diverse human tissues provides molecular links between genetic variation and complex traits.在不同的人类组织中进行 DNA 甲基化 QTL 图谱绘制为遗传变异与复杂性状之间提供了分子联系。
Nat Genet. 2023 Jan;55(1):112-122. doi: 10.1038/s41588-022-01248-z. Epub 2022 Dec 12.
9
GENCODE: reference annotation for the human and mouse genomes in 2023.GENCODE:2023 年人类和小鼠基因组的参考注释。
Nucleic Acids Res. 2023 Jan 6;51(D1):D942-D949. doi: 10.1093/nar/gkac1071.
10
The UCSC Genome Browser database: 2023 update.UCSC 基因组浏览器数据库:2023 年更新。
Nucleic Acids Res. 2023 Jan 6;51(D1):D1188-D1195. doi: 10.1093/nar/gkac1072.