Pemmasani Sandhya Kiran, Raman Rasika, Mohapatra Rajkishore, Vidyasagar Mathukumalli, Acharya Anuradha
Research and Development Division, Mapmygenome India Limited, Hyderabad, India.
Front Genet. 2020 Jul 22;11:753. doi: 10.3389/fgene.2020.00753. eCollection 2020.
Today, genomic data holds great potential to improve healthcare strategies across various dimensions - be it disease prevention, enhanced diagnosis, or optimized treatment. The biggest hurdle faced by the medical and research community in India is the lack of genotype-phenotype correlations for Indians at a population-wide and an individual level. This leads to inefficient translation of genomic information during clinical decision making. Population-wide sequencing projects for Indian genomes help overcome hurdles and enable us to unearth and validate the genetic markers for different health conditions. Machine learning algorithms are essential to analyze huge amounts of genotype data in synergy with gene expression, demographic, clinical, and pathological data. Predictive models developed through these algorithms help in classifying the individuals into different risk groups, so that preventive measures and personalized therapies can be designed. They also help in identifying the impact of each genetic marker with the associated condition, from a clinical perspective. In India, genome sequencing technologies have now become more accessible to the general population. However, information on variants associated with several major diseases is not available in publicly-accessible databases. Creating a centralized database of variants facilitates early detection and mitigation of health risks in individuals. In this article, we discuss the challenges faced by genetic researchers and genomic testing facilities in India, in terms of dearth of public databases, people with knowledge on machine learning algorithms, computational resources and awareness in the medical community in interpreting genetic variants. Potential solutions to enhance genomic research in India, are also discussed.
如今,基因组数据在改善各个层面的医疗保健策略方面具有巨大潜力——无论是疾病预防、强化诊断还是优化治疗。印度医学和研究界面临的最大障碍是,在全人群和个体层面上,印度人缺乏基因型与表型的相关性。这导致在临床决策过程中基因组信息的转化效率低下。针对印度人基因组的全人群测序项目有助于克服障碍,使我们能够发掘并验证针对不同健康状况的遗传标记。机器学习算法对于协同分析大量基因型数据与基因表达、人口统计学、临床和病理数据至关重要。通过这些算法开发的预测模型有助于将个体分类到不同的风险组,从而能够设计预防措施和个性化治疗方案。从临床角度来看,它们还有助于确定每个遗传标记对相关疾病的影响。在印度,基因组测序技术现在已更易于普通人群使用。然而,公开可访问的数据库中没有与几种主要疾病相关的变异信息。创建一个集中的变异数据库有助于早期发现和降低个体的健康风险。在本文中,我们讨论了印度遗传研究人员和基因组检测机构在公共数据库匮乏、缺乏熟悉机器学习算法的人员、计算资源以及医学界对解读基因变异的认识等方面所面临的挑战。我们还讨论了加强印度基因组研究的潜在解决方案。