• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

EPIGEN-Brazil Initiative 资源:一个拉丁美洲推断面板和科学工作流程。

EPIGEN-Brazil Initiative resources: a Latin American imputation panel and the Scientific Workflow.

机构信息

Departamento de Biologia Geral, Universidade Federal de Minas Gerais, Belo Horizonte, Minas Gerais, 31270-901, Brazil.

Instituto Mario Penna, Núcleo de Ensino e Pesquisa, Belo Horizonte, Minas Gerais, 30380-472, Brazil.

出版信息

Genome Res. 2018 Jul;28(7):1090-1095. doi: 10.1101/gr.225458.117. Epub 2018 Jun 14.

DOI:10.1101/gr.225458.117
PMID:29903722
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC6028131/
Abstract

EPIGEN-Brazil is one of the largest Latin American initiatives at the interface of human genomics, public health, and computational biology. Here, we present two resources to address two challenges to the global dissemination of precision medicine and the development of the bioinformatics know-how to support it. To address the underrepresentation of non-European individuals in human genome diversity studies, we present the EPIGEN-5M+1KGP imputation panel-the fusion of the public 1000 Genomes Project (1KGP) Phase 3 imputation panel with haplotypes derived from the EPIGEN-5M data set (a product of the genotyping of 4.3 million SNPs in 265 admixed individuals from the EPIGEN-Brazil Initiative). When we imputed a target SNPs data set (6487 admixed individuals genotyped for 2.2 million SNPs from the EPIGEN-Brazil project) with the EPIGEN-5M+1KGP panel, we gained 140,452 more SNPs in total than when using the 1KGP Phase 3 panel alone and 788,873 additional high confidence SNPs ( ≥ 0.8). Thus, the major effect of the inclusion of the EPIGEN-5M data set in this new imputation panel is not only to gain more SNPs but also to improve the quality of imputation. To address the lack of transparency and reproducibility of bioinformatics protocols, we present a conceptual Scientific Workflow in the form of a website that models the scientific process (by including publications, flowcharts, masterscripts, documents, and bioinformatics protocols), making it accessible and interactive. Its applicability is shown in the context of the development of our EPIGEN-5M+1KGP imputation panel. The Scientific Workflow also serves as a repository of bioinformatics resources.

摘要

EPIGEN-Brazil 是人类基因组学、公共卫生和计算生物学交叉领域中最大的拉丁美洲倡议之一。在这里,我们提出了两个资源,以解决精准医学全球传播和支持它的生物信息学专业知识发展所面临的两个挑战。为了解决人类基因组多样性研究中代表性不足的非欧洲个体的问题,我们提出了 EPIGEN-5M+1KGP imputation 面板——将公共的 1000 基因组计划(1KGP)第三阶段 imputation 面板与来自 EPIGEN-5M 数据集的单倍型融合在一起(EPIGEN-5M 数据集是 265 名混合个体中 430 万个 SNP 基因分型的产物,来自 EPIGEN-Brazil 倡议)。当我们使用 EPIGEN-5M+1KGP 面板对一个目标 SNP 数据集(6487 名混合个体的 220 万个 SNP 进行基因分型,来自 EPIGEN-Brazil 项目)进行 imputation 时,我们总共获得了 140452 个额外的 SNP,比单独使用 1KGP 第三阶段面板时多了 140452 个 SNP,并且还获得了 788873 个额外的高置信度 SNP(≥0.8)。因此,在这个新的 imputation 面板中纳入 EPIGEN-5M 数据集的主要影响不仅是获得更多的 SNP,而且还提高了 imputation 的质量。为了解决生物信息学协议缺乏透明度和可重复性的问题,我们以网站的形式提出了一个概念性的科学工作流程,该流程通过包括出版物、流程图、主脚本、文档和生物信息学协议来模拟科学过程,使其具有可访问性和交互性。它的适用性在我们开发 EPIGEN-5M+1KGP imputation 面板的背景下得到了展示。科学工作流程还充当生物信息学资源的存储库。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9f8c/6028131/e7f59ccd4a3f/1090f03.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9f8c/6028131/5b45d073f011/1090f01.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9f8c/6028131/b94decb8852d/1090f02.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9f8c/6028131/e7f59ccd4a3f/1090f03.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9f8c/6028131/5b45d073f011/1090f01.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9f8c/6028131/b94decb8852d/1090f02.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9f8c/6028131/e7f59ccd4a3f/1090f03.jpg

相似文献

1
EPIGEN-Brazil Initiative resources: a Latin American imputation panel and the Scientific Workflow.EPIGEN-Brazil Initiative 资源:一个拉丁美洲推断面板和科学工作流程。
Genome Res. 2018 Jul;28(7):1090-1095. doi: 10.1101/gr.225458.117. Epub 2018 Jun 14.
2
Evaluating the coverage and potential of imputing the exome microarray with next-generation imputation using the 1000 Genomes Project.使用千人基因组计划评估通过新一代插补法对外显子微阵列进行插补的覆盖范围和潜力。
PLoS One. 2014 Sep 9;9(9):e106681. doi: 10.1371/journal.pone.0106681. eCollection 2014.
3
HLA imputation in an admixed population: An assessment of the 1000 Genomes data as a training set.混合人群中的HLA基因填充:以千人基因组数据作为训练集的评估
Hum Immunol. 2016 Mar;77(3):307-312. doi: 10.1016/j.humimm.2015.11.004. Epub 2015 Nov 12.
4
Imputation Performance in Latin American Populations: Improving Rare Variants Representation With the Inclusion of Native American Genomes.拉丁美洲人群中的插补性能:通过纳入美洲原住民基因组改善罕见变异的表征
Front Genet. 2022 Jan 3;12:719791. doi: 10.3389/fgene.2021.719791. eCollection 2021.
5
A genotype imputation reference panel specific for native Southeast Asian populations.一个专门针对东南亚本土人群的基因型填充参考面板。
NPJ Genom Med. 2024 Oct 5;9(1):47. doi: 10.1038/s41525-024-00435-7.
6
Effect of genome-wide genotyping and reference panels on rare variants imputation.全基因组基因分型和参考面板对稀有变异体推断的影响。
J Genet Genomics. 2012 Oct 20;39(10):545-50. doi: 10.1016/j.jgg.2012.07.002. Epub 2012 Jul 24.
7
Evaluating the quality of the 1000 genomes project data.评估 1000 基因组计划数据的质量。
BMC Genomics. 2019 Aug 16;20(1):620. doi: 10.1186/s12864-019-5957-x.
8
A genotype imputation method for de-identified haplotype reference information by using recurrent neural network.基于循环神经网络的匿名单倍型参考信息基因型推断方法。
PLoS Comput Biol. 2020 Oct 1;16(10):e1008207. doi: 10.1371/journal.pcbi.1008207. eCollection 2020 Oct.
9
Evaluation of the imputation performance of the program IMPUTE in an admixed sample from Mexico City using several model designs.评价 IMPUTE 程序在使用多种模型设计的墨西哥城混合样本中的插补性能。
BMC Med Genomics. 2012 May 1;5:12. doi: 10.1186/1755-8794-5-12.
10
DISTMIX: direct imputation of summary statistics for unmeasured SNPs from mixed ethnicity cohorts.DISTMIX:从混合种族队列中直接推算未测量单核苷酸多态性的汇总统计量。
Bioinformatics. 2015 Oct 1;31(19):3099-104. doi: 10.1093/bioinformatics/btv348. Epub 2015 Jun 9.

引用本文的文献

1
The Consortium for Genomic Diversity, Ancestry, and Health in Colombia (CÓDIGO): building local capacity in genomics and bioinformatics.哥伦比亚基因组多样性、血统与健康联盟(CÓDIGO):建设基因组学和生物信息学方面的本地能力。
Commun Biol. 2025 Jul 17;8(1):1062. doi: 10.1038/s42003-025-08496-9.
2
Genome-Wide Insights into Internalizing Symptoms in Admixed Latin American Children.对拉丁裔混血儿童内化症状的全基因组洞察
Genes (Basel). 2025 Jan 8;16(1):63. doi: 10.3390/genes16010063.
3
Population molecular genetics in Brazil: From genomic databases and research to the implementation of precision medicine.

本文引用的文献

1
Evolutionary genomic dynamics of Peruvians before, during, and after the Inca Empire.秘鲁人在印加帝国前后的进化基因组动态。
Proc Natl Acad Sci U S A. 2018 Jul 10;115(28):E6526-E6535. doi: 10.1073/pnas.1720798115. Epub 2018 Jun 26.
2
The Evolution of Science in a Latin-American Country: Genetics and Genomics in Brazil.拉丁美洲国家的科学演进:巴西的遗传学和基因组学。
Genetics. 2018 Mar;208(3):823-832. doi: 10.1534/genetics.118.300690.
3
Inclusion of Population-specific Reference Panel from India to the 1000 Genomes Phase 3 Panel Improves Imputation Accuracy.
巴西的群体分子遗传学:从基因组数据库与研究到精准医学的实施。
J Community Genet. 2024 Nov 19. doi: 10.1007/s12687-024-00752-5.
4
Low Expression of SCN4B Predicts Poor Prognosis in Non-small Cell Lung Cancer.SCN4B低表达预示非小细胞肺癌预后不良。
Curr Cancer Drug Targets. 2025;25(5):445-466. doi: 10.2174/0115680096293516240607071915.
5
Comparing the effect of imputation reference panel composition in four distinct Latin American cohorts.比较四种不同拉丁美洲队列中插补参考面板组成的效果。
bioRxiv. 2024 Apr 15:2024.04.11.589057. doi: 10.1101/2024.04.11.589057.
6
Imputation Performance in Latin American Populations: Improving Rare Variants Representation With the Inclusion of Native American Genomes.拉丁美洲人群中的插补性能:通过纳入美洲原住民基因组改善罕见变异的表征
Front Genet. 2022 Jan 3;12:719791. doi: 10.3389/fgene.2021.719791. eCollection 2021.
7
Association Analysis of Candidate Variants in Admixed Brazilian Patients With Genetic Generalized Epilepsies.巴西混血儿遗传性全身性癫痫患者候选变异的关联分析
Front Genet. 2021 Jul 8;12:672304. doi: 10.3389/fgene.2021.672304. eCollection 2021.
8
Gene Variants Associated With a Higher Risk for Alcohol Dependence in Multiethnic Populations.多族裔人群中与酒精依赖高风险相关的基因变异
Front Psychiatry. 2021 May 31;12:665257. doi: 10.3389/fpsyt.2021.665257. eCollection 2021.
9
Admixture/fine-mapping in Brazilians reveals a West African associated potential regulatory variant (rs114066381) with a strong female-specific effect on body mass and fat mass indexes.巴西人的混合/精细映射揭示了一个与西非相关的潜在调节变异体(rs114066381),它对女性的体重和体脂肪指数有很强的特异性影响。
Int J Obes (Lond). 2021 May;45(5):1017-1029. doi: 10.1038/s41366-021-00761-1. Epub 2021 Feb 26.
10
The genetic structure and adaptation of Andean highlanders and Amazonians are influenced by the interplay between geography and culture.安第斯高地人和亚马逊人的遗传结构和适应受到地理和文化相互作用的影响。
Proc Natl Acad Sci U S A. 2020 Dec 22;117(51):32557-32565. doi: 10.1073/pnas.2013773117. Epub 2020 Dec 4.
纳入来自印度的特定人群参考面板可提高 1000 基因组计划第 3 阶段面板的推断准确性。
Sci Rep. 2017 Jul 27;7(1):6733. doi: 10.1038/s41598-017-06905-6.
4
Improved imputation accuracy of rare and low-frequency variants using population-specific high-coverage WGS-based imputation reference panel.使用基于全基因组测序(WGS)的特定人群高覆盖度插补参考面板提高罕见和低频变异的插补准确性。
Eur J Hum Genet. 2017 Jun;25(7):869-876. doi: 10.1038/ejhg.2017.51. Epub 2017 Apr 12.
5
Suggestive association between variants in IL1RAPL and asthma symptoms in Latin American children.IL1RAPL基因变异与拉丁美洲儿童哮喘症状之间的潜在关联。
Eur J Hum Genet. 2017 Apr;25(4):439-445. doi: 10.1038/ejhg.2016.197. Epub 2017 Jan 25.
6
A radical revision of human genetics.人类遗传学的一次彻底修订。
Nature. 2016 Oct 13;538(7624):154-157. doi: 10.1038/538154a.
7
Genomics is failing on diversity.基因组学在多样性方面表现不佳。
Nature. 2016 Oct 13;538(7624):161-164. doi: 10.1038/538161a.
8
The Galaxy platform for accessible, reproducible and collaborative biomedical analyses: 2016 update.用于可访问、可重复和协作式生物医学分析的Galaxy平台:2016年更新
Nucleic Acids Res. 2016 Jul 8;44(W1):W3-W10. doi: 10.1093/nar/gkw343. Epub 2016 May 2.
9
Reproducible Research Practices and Transparency across the Biomedical Literature.生物医学文献中的可重复研究实践与透明度
PLoS Biol. 2016 Jan 4;14(1):e1002333. doi: 10.1371/journal.pbio.1002333. eCollection 2016 Jan.
10
Socioeconomic Position, But Not African Genomic Ancestry, Is Associated With Blood Pressure in the Bambui-Epigen (Brazil) Cohort Study of Aging.在班布伊 - 表观遗传学(巴西)衰老队列研究中,社会经济地位而非非洲基因组血统与血压相关。
Hypertension. 2016 Feb;67(2):349-55. doi: 10.1161/HYPERTENSIONAHA.115.06609. Epub 2015 Dec 28.