• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

基因组学流程与数据整合:研究环境中的挑战与机遇

Genomics pipelines and data integration: challenges and opportunities in the research setting.

作者信息

Davis-Turak Jeremy, Courtney Sean M, Hazard E Starr, Glen W Bailey, da Silveira Willian A, Wesselman Timothy, Harbin Larry P, Wolf Bethany J, Chung Dongjun, Hardiman Gary

机构信息

a OnRamp Bioinformatics, Inc ., San Diego , CA.

b MUSC Bioinformatics , Center for Genomics Medicine, Medical University of South Carolina (MUSC) , Charleston , SC.

出版信息

Expert Rev Mol Diagn. 2017 Mar;17(3):225-237. doi: 10.1080/14737159.2017.1282822. Epub 2017 Jan 25.

DOI:10.1080/14737159.2017.1282822
PMID:28092471
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC5580401/
Abstract

The emergence and mass utilization of high-throughput (HT) technologies, including sequencing technologies (genomics) and mass spectrometry (proteomics, metabolomics, lipids), has allowed geneticists, biologists, and biostatisticians to bridge the gap between genotype and phenotype on a massive scale. These new technologies have brought rapid advances in our understanding of cell biology, evolutionary history, microbial environments, and are increasingly providing new insights and applications towards clinical care and personalized medicine. Areas covered: The very success of this industry also translates into daunting big data challenges for researchers and institutions that extend beyond the traditional academic focus of algorithms and tools. The main obstacles revolve around analysis provenance, data management of massive datasets, ease of use of software, interpretability and reproducibility of results. Expert commentary: The authors review the challenges associated with implementing bioinformatics best practices in a large-scale setting, and highlight the opportunity for establishing bioinformatics pipelines that incorporate data tracking and auditing, enabling greater consistency and reproducibility for basic research, translational or clinical settings.

摘要

高通量(HT)技术的出现和大规模应用,包括测序技术(基因组学)和质谱技术(蛋白质组学、代谢组学、脂质组学),使得遗传学家、生物学家和生物统计学家能够在大规模层面上弥合基因型与表型之间的差距。这些新技术在我们对细胞生物学、进化史、微生物环境的理解方面取得了快速进展,并且越来越多地为临床护理和个性化医疗提供新的见解和应用。涵盖领域:该行业的巨大成功也给研究人员和机构带来了令人生畏的大数据挑战,这些挑战超出了算法和工具等传统学术重点。主要障碍围绕分析来源、海量数据集的数据管理、软件的易用性、结果的可解释性和可重复性。专家评论:作者回顾了在大规模环境中实施生物信息学最佳实践所面临的挑战,并强调了建立纳入数据跟踪和审计的生物信息学管道的机会,从而为基础研究、转化或临床环境实现更高的一致性和可重复性。

相似文献

1
Genomics pipelines and data integration: challenges and opportunities in the research setting.基因组学流程与数据整合:研究环境中的挑战与机遇
Expert Rev Mol Diagn. 2017 Mar;17(3):225-237. doi: 10.1080/14737159.2017.1282822. Epub 2017 Jan 25.
2
New trends in bioinformatics: from genome sequence to personalized medicine.生物信息学的新趋势:从基因组序列到个性化医疗。
Exp Gerontol. 2003 Oct;38(10):1031-6. doi: 10.1016/s0531-5565(03)00168-2.
3
Big Data Analysis in Computational Biology and Bioinformatics.计算生物学与生物信息学中的大数据分析
Methods Mol Biol. 2024;2719:181-197. doi: 10.1007/978-1-0716-3461-5_11.
4
Developing reproducible bioinformatics analysis workflows for heterogeneous computing environments to support African genomics.为异构计算环境开发可重现的生物信息学分析工作流程,以支持非洲基因组学。
BMC Bioinformatics. 2018 Nov 29;19(1):457. doi: 10.1186/s12859-018-2446-1.
5
Advances in omics and bioinformatics tools for systems analyses of plant functions.组学和生物信息学工具在植物功能系统分析中的进展。
Plant Cell Physiol. 2011 Dec;52(12):2017-38. doi: 10.1093/pcp/pcr153.
6
Where are we in genomics?我们在基因组学领域处于什么位置?
J Physiol Pharmacol. 2005 Jun;56 Suppl 3:37-70.
7
SnakeLines: integrated set of computational pipelines for sequencing reads.SnakeLines:一套用于测序读取的集成计算管道。
J Integr Bioinform. 2023 Aug 21;20(3). doi: 10.1515/jib-2022-0059. eCollection 2023 Sep 1.
8
Design considerations for workflow management systems use in production genomics research and the clinic.用于生产基因组学研究和临床的工作流管理系统的设计考虑因素。
Sci Rep. 2021 Nov 4;11(1):21680. doi: 10.1038/s41598-021-99288-8.
9
Accelerating Bioinformatics Applications via Emerging Parallel Computing Systems.通过新兴并行计算系统加速生物信息学应用
IEEE/ACM Trans Comput Biol Bioinform. 2015 Sep-Oct;12(5):971-2. doi: 10.1109/tcbb.2015.2457736.
10
Bioinformatics: bringing it all together.生物信息学:整合一切。
Nature. 2002 Oct 17;419(6908):751, 753, 755 passim. doi: 10.1038/419751a.

引用本文的文献

1
Bayesian Optimization for Modular Black-Box Systems with Switching Costs.具有切换成本的模块化黑箱系统的贝叶斯优化
Proc Mach Learn Res. 2021 Jul;161:1024-1034.
2
Transforming Clinical Research: The Power of High-Throughput Omics Integration.变革临床研究:高通量组学整合的力量
Proteomes. 2024 Sep 6;12(3):25. doi: 10.3390/proteomes12030025.
3
Next-Generation Sequencing (NGS): Platforms and Applications.下一代测序(NGS):平台与应用

本文引用的文献

1
CIViC is a community knowledgebase for expert crowdsourcing the clinical interpretation of variants in cancer.CIViC 是一个社区知识库,用于专家众包对癌症变异的临床解释。
Nat Genet. 2017 Jan 31;49(2):170-174. doi: 10.1038/ng.3774.
2
Toward a Shared Vision for Cancer Genomic Data.迈向癌症基因组数据的共同愿景。
N Engl J Med. 2016 Sep 22;375(12):1109-12. doi: 10.1056/NEJMp1607591.
3
Erratum to: A survey of best practices for RNA-seq data analysis.《RNA测序数据分析最佳实践调查》勘误
J Pharm Bioallied Sci. 2024 Feb;16(Suppl 1):S41-S45. doi: 10.4103/jpbs.jpbs_838_23. Epub 2024 Feb 29.
4
Next-Generation Sequencing Informatic Architecture Considerations.下一代测序信息学架构的考虑因素。
Methods Mol Biol. 2023;2621:27-37. doi: 10.1007/978-1-0716-2950-5_3.
5
The Power of Clinical Diagnosis for Deciphering Complex Genetic Mechanisms in Rare Diseases.临床诊断在破译罕见病复杂遗传机制中的威力。
Genes (Basel). 2023 Jan 12;14(1):196. doi: 10.3390/genes14010196.
6
Multiscale modeling in the framework of biological systems and its potential for spaceflight biology studies.生物系统框架下的多尺度建模及其在航天生物学研究中的潜力。
iScience. 2022 Oct 26;25(11):105421. doi: 10.1016/j.isci.2022.105421. eCollection 2022 Nov 18.
7
Local data commons: the sleeping beauty in the community of data commons.本地数据共享:数据共享社区中的睡美人。
BMC Bioinformatics. 2022 Sep 23;23(Suppl 12):386. doi: 10.1186/s12859-022-04922-5.
8
A Method for Variant Agnostic Detection of SARS-CoV-2, Rapid Monitoring of Circulating Variants, and Early Detection of Emergent Variants Such as Omicron.一种用于 SARS-CoV-2 的变异不可知检测、快速监测循环变异以及早期检测奥密克戎等新兴变异的方法。
J Clin Microbiol. 2022 Jul 20;60(7):e0034222. doi: 10.1128/jcm.00342-22. Epub 2022 Jun 29.
9
Whole Genome Sequencing Contributions and Challenges in Disease Reduction Focused on Malaria.全基因组测序在以疟疾为重点的疾病防控中的贡献与挑战
Biology (Basel). 2022 Apr 13;11(4):587. doi: 10.3390/biology11040587.
10
miRNA and lncRNA Expression Networks Modulate Cell Cycle and DNA Repair Inhibition in Senescent Prostate Cells.miRNA 和 lncRNA 表达网络调节衰老前列腺细胞中的细胞周期和 DNA 修复抑制。
Genes (Basel). 2022 Jan 24;13(2):208. doi: 10.3390/genes13020208.
Genome Biol. 2016 Aug 26;17(1):181. doi: 10.1186/s13059-016-1047-4.
4
Analysis of protein-coding genetic variation in 60,706 humans.对60706名人类的蛋白质编码基因变异进行分析。
Nature. 2016 Aug 18;536(7616):285-91. doi: 10.1038/nature19057.
5
The Ensembl Variant Effect Predictor.Ensembl变异效应预测器。
Genome Biol. 2016 Jun 6;17(1):122. doi: 10.1186/s13059-016-0974-4.
6
Amplification of WHSC1L1 regulates expression and estrogen-independent activation of ERα in SUM-44 breast cancer cells and is associated with ERα over-expression in breast cancer.WHSC1L1 的扩增调节 SUM-44 乳腺癌细胞中 ERα 的表达和雌激素非依赖性激活,并与乳腺癌中 ERα 的过表达相关。
Mol Oncol. 2016 Jun;10(6):850-65. doi: 10.1016/j.molonc.2016.02.003. Epub 2016 Feb 27.
7
Trimming of sequence reads alters RNA-Seq gene expression estimates.序列 reads 的修剪会改变 RNA-Seq 基因表达估计值。
BMC Bioinformatics. 2016 Feb 25;17:103. doi: 10.1186/s12859-016-0956-2.
8
Cancer statistics for African Americans, 2016: Progress and opportunities in reducing racial disparities.2016 年非裔美国人癌症统计数据:减少种族差异方面的进展和机会。
CA Cancer J Clin. 2016 Jul;66(4):290-308. doi: 10.3322/caac.21340. Epub 2016 Feb 22.
9
The clinical trial landscape in oncology and connectivity of somatic mutational profiles to targeted therapies.肿瘤学中的临床试验概况以及体细胞突变谱与靶向治疗的关联性。
Hum Genomics. 2016 Jan 16;10:4. doi: 10.1186/s40246-016-0061-7.
10
The 2016 database issue of Nucleic Acids Research and an updated molecular biology database collection.《核酸研究》2016年数据库特刊及更新的分子生物学数据库合集。
Nucleic Acids Res. 2016 Jan 4;44(D1):D1-6. doi: 10.1093/nar/gkv1356.