文献检索文档翻译深度研究
Suppr Zotero 插件Zotero 插件
邀请有礼套餐&价格历史记录

新学期,新优惠

限时优惠:9月1日-9月22日

30天高级会员仅需29元

1天体验卡首发特惠仅需5.99元

了解详情
不再提醒
插件&应用
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
高级版
套餐订阅购买积分包
AI 工具
文献检索文档翻译深度研究
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2025

Pytrf:一个用于从基因组序列中查找串联重复序列的Python软件包。

Pytrf: a python package for finding tandem repeats from genomic sequences.

作者信息

Du Lianming, Sun Dalin, Chen Jiahao, Zhou Xinyi, Zhao Kelei, Zeng Qianglin, Yang Nan

机构信息

Antibiotics Research and Re-Evaluation Key Laboratory of Sichuan Province, Institute for Advanced Study, Chengdu University, Chengdu, 610106, China.

Key Laboratory of Qinghai-Tibetan Plateau Animal Genetic Resource Reservation and Utilization, Sichuan Province and Ministry of Education, Southwest Minzu University, Chengdu, 610225, China.

出版信息

BMC Bioinformatics. 2025 Jun 4;26(1):151. doi: 10.1186/s12859-025-06168-3.


DOI:10.1186/s12859-025-06168-3
PMID:40462000
Abstract

BACKGROUND: Tandem repeats (TRs) are major sources of genetic variation and important genetic markers. Their expansions are not only involved in gene expression regulation but also associated with many nervous system diseases and cancers. However, there is a lack of an efficient tandem repeat identification tool for seamless integration with larger bioinformatics programs developed with the popular Python language. RESULTS: We introduce pytrf, a Python package for identification of both exact and approximate TRs from genomic sequences. It allows seamless embedding into other programs developed by Python or using in Python interactive environment and Jupyter notebooks. It also provides command line tools for assisting users to find tandem repeats from FASTA/Q files. Compared to other tools, the pytrf shows the highest performance in aspect of running time with comparable peak memory usage. CONCLUSIONS: Pytrf provides simple interfaces and command line tools to facilitate identification of tandem repeats from genomic sequences. Pytrf can easily be installed from PyPI ( https://pypi.org/project/pytrf ) and the source code is freely available at https://github.com/lmdu/pytrf .

摘要

背景:串联重复序列(TRs)是遗传变异的主要来源和重要的遗传标记。它们的扩增不仅参与基因表达调控,还与许多神经系统疾病和癌症相关。然而,缺乏一种能与使用流行的Python语言开发的更大的生物信息学程序无缝集成的高效串联重复序列识别工具。 结果:我们引入了pytrf,一个用于从基因组序列中识别精确和近似TRs的Python包。它允许无缝嵌入到由Python开发的其他程序中,或在Python交互式环境和Jupyter笔记本中使用。它还提供命令行工具,以协助用户从FASTA/Q文件中查找串联重复序列。与其他工具相比,pytrf在运行时间方面表现出最高性能,峰值内存使用量相当。 结论:Pytrf提供了简单的接口和命令行工具,便于从基因组序列中识别串联重复序列。Pytrf可以很容易地从PyPI(https://pypi.org/project/pytrf)安装,并且源代码可在https://github.com/lmdu/pytrf免费获取。

相似文献

[1]
Pytrf: a python package for finding tandem repeats from genomic sequences.

BMC Bioinformatics. 2025-6-4

[2]
Pyfastx: a robust Python package for fast random access to sequences from plain and gzipped FASTA/Q files.

Brief Bioinform. 2021-7-20

[3]
PERF: an exhaustive algorithm for ultra-fast and efficient identification of microsatellites from large DNA sequences.

Bioinformatics. 2018-3-15

[4]
Krait2: a versatile software for microsatellite investigation, visualization and marker development.

BMC Genomics. 2025-1-25

[5]
JBrowse Jupyter: a Python interface to JBrowse 2.

Bioinformatics. 2023-1-1

[6]
plotnineSeqSuite: a Python package for visualizing sequence data using ggplot2 style.

BMC Genomics. 2023-10-3

[7]
Visualizing protein big data using Python and Jupyter notebooks.

Biochem Mol Biol Educ. 2022-9

[8]
TRTools: a toolkit for genome-wide analysis of tandem repeats.

Bioinformatics. 2021-5-5

[9]
Pygenprop: a Python library for programmatic exploration and comparison of organism genome properties.

Bioinformatics. 2019-12-1

[10]
Krait: an ultrafast tool for genome-wide survey of microsatellites and primer design.

Bioinformatics. 2018-2-15

本文引用的文献

[1]
ULTRA-effective labeling of tandem repeats in genomic sequence.

Bioinform Adv. 2024-10-9

[2]
Building a catalogue of short tandem repeats in diverse populations.

Nat Rev Genet. 2024-7

[3]
A deep population reference panel of tandem repeat variation.

Nat Commun. 2023-10-23

[4]
Short tandem repeats bind transcription factors to tune eukaryotic gene expression.

Science. 2023-9-22

[5]
A landscape of complex tandem repeats within individual human genomes.

Nat Commun. 2023-9-14

[6]
Genome-wide identification of tandem repeats associated with splicing variation across 49 tissues in humans.

Genome Res. 2023-3

[7]
The motif composition of variable number tandem repeats impacts gene expression.

Genome Res. 2023-4

[8]
Native functions of short tandem repeats.

Elife. 2023-3-20

[9]
RPTRF: A rapid perfect tandem repeat finder tool for DNA sequences.

Biosystems. 2023-4

[10]
Recurrent repeat expansions in human cancer genomes.

Nature. 2023-1

文献AI研究员

20分钟写一篇综述,助力文献阅读效率提升50倍

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

推荐工具

医学文档翻译智能文献检索