• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

鉴定人群中的中等大小缺失并推断其对基因表达的影响。

Identification of intermediate-sized deletions and inference of their impact on gene expression in a human population.

机构信息

Department of Drug Discovery Medicine, Kyoto University Graduate School of Medicine, Kyoto, Japan.

Department of Human Genetics, Graduate School of Medicine, The University of Tokyo, Tokyo, Japan.

出版信息

Genome Med. 2019 Jul 24;11(1):44. doi: 10.1186/s13073-019-0656-4.

DOI:10.1186/s13073-019-0656-4
PMID:31340865
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC6657090/
Abstract

BACKGROUND

Next-generation sequencing has allowed for the identification of different genetic variations, which are known to contribute to diseases. Of these, insertions and deletions are the second most abundant type of variations in the genome, but their biological importance or disease association is not well-studied, especially for deletions of intermediate sizes.

METHODS

We identified intermediate-sized deletions from whole-genome sequencing (WGS) data of Japanese samples (n = 174) with a novel deletion calling method which considered multiple samples. These deletions were used to construct a reference panel for use in imputation. Imputation was then conducted using the reference panel and data from 82 publically available Japanese samples with gene expression data. The accuracy of the deletion calling and imputation was examined with Nanopore long-read sequencing technology. We also conducted an expression quantitative trait loci (eQTL) association analysis using the deletions to infer their functional impacts on genes, before characterizing the deletions causal for gene expression level changes.

RESULTS

We obtained a set of polymorphic 4378 high-confidence deletions and constructed a reference panel. The deletions were successfully imputed into the Japanese samples with high accuracy (97.3%). The eQTL analysis identified 181 deletions (4.1%) suggested as causal for gene expression level changes. The causal deletion candidates were significantly enriched in promoters, super-enhancers, and transcription elongation chromatin states. Generation of deletions in a cell line with the CRISPR-Cas9 system confirmed that they were indeed causative variants for gene expression change. Furthermore, one of the deletions was observed to affect the gene expression levels of a gene it was not located in.

CONCLUSIONS

This paper reports an accurate deletion calling method for genotype imputation at the whole genome level and shows the importance of intermediate-sized deletions in the human population.

摘要

背景

下一代测序技术已经能够鉴定出不同的遗传变异,这些变异已知会导致疾病。其中,插入和缺失是基因组中第二丰富的变异类型,但它们的生物学重要性或与疾病的关联尚未得到充分研究,尤其是对于中等大小的缺失。

方法

我们使用一种新的考虑多个样本的缺失调用方法,从日本样本的全基因组测序(WGS)数据中鉴定出中等大小的缺失。这些缺失被用来构建一个参考面板,用于进行基因分型。然后,使用参考面板和来自 82 个公开的日本样本的基因表达数据进行基因分型。使用纳米孔长读测序技术检查缺失调用和基因分型的准确性。我们还使用这些缺失进行了表达数量性状基因座(eQTL)关联分析,以推断它们对基因的功能影响,然后对导致基因表达水平变化的缺失进行特征描述。

结果

我们获得了一组多态性的 4378 个高可信度缺失,并构建了一个参考面板。这些缺失被成功地高精度(97.3%)地基因分型到日本样本中。eQTL 分析确定了 181 个缺失(4.1%)被认为是导致基因表达水平变化的原因。候选的因果缺失在启动子、超级增强子和转录延伸染色质状态中显著富集。使用 CRISPR-Cas9 系统在细胞系中产生缺失,证实了它们确实是导致基因表达变化的变异。此外,一个缺失被观察到影响了它不在的基因的表达水平。

结论

本文报道了一种用于全基因组水平基因分型的准确缺失调用方法,并展示了中等大小缺失在人类群体中的重要性。

相似文献

1
Identification of intermediate-sized deletions and inference of their impact on gene expression in a human population.鉴定人群中的中等大小缺失并推断其对基因表达的影响。
Genome Med. 2019 Jul 24;11(1):44. doi: 10.1186/s13073-019-0656-4.
2
Characterization of intermediate-sized insertions using whole-genome sequencing data and analysis of their functional impact on gene expression.利用全基因组测序数据对中等大小插入进行特征分析,并分析它们对基因表达的功能影响。
Hum Genet. 2021 Aug;140(8):1201-1216. doi: 10.1007/s00439-021-02291-2. Epub 2021 May 12.
3
Copy number variations in the genome of the Qatari population.卡塔尔人群基因组中的拷贝数变异
BMC Genomics. 2015 Oct 22;16:834. doi: 10.1186/s12864-015-1991-5.
4
Functional regression method for whole genome eQTL epistasis analysis with sequencing data.用于基于测序数据的全基因组eQTL上位性分析的功能回归方法。
BMC Genomics. 2017 May 18;18(1):385. doi: 10.1186/s12864-017-3777-4.
5
Multiple Functional Variants at 13q14 Risk Locus for Osteoporosis Regulate RANKL Expression Through Long-Range Super-Enhancer.13q14 骨质疏松症风险位点的多个功能变异通过长距离超级增强子调节 RANKL 表达。
J Bone Miner Res. 2018 Jul;33(7):1335-1346. doi: 10.1002/jbmr.3419. Epub 2018 May 17.
6
Genome-wide analysis of deletions in maize population reveals abundant genetic diversity and functional impact.玉米群体中缺失的全基因组分析揭示了丰富的遗传多样性和功能影响。
Theor Appl Genet. 2022 Jan;135(1):273-290. doi: 10.1007/s00122-021-03965-1. Epub 2021 Oct 18.
7
iSVP: an integrated structural variant calling pipeline from high-throughput sequencing data.iSVP:一种基于高通量测序数据的整合结构变异检测流程
BMC Syst Biol. 2013;7 Suppl 6(Suppl 6):S8. doi: 10.1186/1752-0509-7-S6-S8. Epub 2013 Dec 13.
8
Comparison of genotype imputation strategies using a combined reference panel for chicken population.利用鸡群体的组合参考面板比较基因型推断策略。
Animal. 2019 Jun;13(6):1119-1126. doi: 10.1017/S1751731118002860. Epub 2018 Oct 29.
9
Whole-genome characterization in pedigreed non-human primates using genotyping-by-sequencing (GBS) and imputation.利用简化基因组测序(GBS)和填充技术对圈养非人灵长类动物进行全基因组特征分析。
BMC Genomics. 2016 Aug 24;17(1):676. doi: 10.1186/s12864-016-2966-x.
10
Integration of Multi-omics Data for Expression Quantitative Trait Loci (eQTL) Analysis and eQTL Epistasis.整合多组学数据用于表达数量性状位点(eQTL)分析和eQTL上位性分析。
Methods Mol Biol. 2020;2082:157-171. doi: 10.1007/978-1-0716-0026-9_11.

引用本文的文献

1
Long-read sequencing reveals novel isoform-specific eQTLs and regulatory mechanisms of isoform expression in human B cells.长读长测序揭示了人类B细胞中新型异构体特异性eQTL以及异构体表达的调控机制。
Genome Biol. 2025 May 8;26(1):110. doi: 10.1186/s13059-025-03583-w.
2
Mapping crossover events of mouse meiotic recombination by restriction fragment ligation-based Refresh-seq.通过基于限制性片段连接的Refresh-seq技术对小鼠减数分裂重组的交叉事件进行定位。
Cell Discov. 2024 Mar 5;10(1):26. doi: 10.1038/s41421-023-00638-9.
3
Long-read-based single sperm genome sequencing for chromosome-wide haplotype phasing of both SNPs and SVs.

本文引用的文献

1
Comparative Analyses of Copy-Number Variation in Autism Spectrum Disorder and Schizophrenia Reveal Etiological Overlap and Biological Insights.自闭症谱系障碍和精神分裂症的拷贝数变异比较分析揭示了病因重叠和生物学见解。
Cell Rep. 2018 Sep 11;24(11):2838-2856. doi: 10.1016/j.celrep.2018.08.022.
2
Long reads: their purpose and place.长读序列:它们的用途和位置。
Hum Mol Genet. 2018 Aug 1;27(R2):R234-R241. doi: 10.1093/hmg/ddy177.
3
Minimap2: pairwise alignment for nucleotide sequences.Minimap2:核苷酸序列的两两比对。
基于长读长测序的单精子基因组测序技术,可对 SNP 和 SV 进行全染色体范围的单倍型相位分析。
Nucleic Acids Res. 2023 Aug 25;51(15):8020-8034. doi: 10.1093/nar/gkad532.
4
Targeted deletion of ecto-5'-nucleotidase results in retention of inosine monophosphate content in postmortem muscle of medaka (Oryzias latipes).靶向敲除外核苷酸酶导致斑马鱼(Oryzias latipes)死后肌肉中肌苷单磷酸含量的滞留。
Sci Rep. 2022 Nov 3;12(1):18588. doi: 10.1038/s41598-022-22029-y.
5
Expression and Genotype Are Possible Discriminators in Different Forms of Dementia.表达和基因型可能是不同形式痴呆症的鉴别因素。
Front Aging Neurosci. 2022 Mar 14;14:858162. doi: 10.3389/fnagi.2022.858162. eCollection 2022.
6
Compound genetic etiology in a patient with a syndrome including diabetes, intellectual deficiency and distichiasis.患者同时患有糖尿病、智力缺陷和睫毛乱生综合征,存在复合遗传病因。
Orphanet J Rare Dis. 2022 Feb 28;17(1):86. doi: 10.1186/s13023-022-02248-2.
7
Characterization of intermediate-sized insertions using whole-genome sequencing data and analysis of their functional impact on gene expression.利用全基因组测序数据对中等大小插入进行特征分析,并分析它们对基因表达的功能影响。
Hum Genet. 2021 Aug;140(8):1201-1216. doi: 10.1007/s00439-021-02291-2. Epub 2021 May 12.
8
Whole-genome sequencing with long reads reveals complex structure and origin of structural variation in human genetic variations and somatic mutations in cancer.全基因组测序与长读长揭示了人类遗传变异和癌症体细胞突变中结构变异的复杂结构和起源。
Genome Med. 2021 Apr 29;13(1):65. doi: 10.1186/s13073-021-00883-1.
Bioinformatics. 2018 Sep 15;34(18):3094-3100. doi: 10.1093/bioinformatics/bty191.
4
Accurate detection of complex structural variations using single-molecule sequencing.利用单分子测序技术准确检测复杂结构变异。
Nat Methods. 2018 Jun;15(6):461-468. doi: 10.1038/s41592-018-0001-7. Epub 2018 Apr 30.
5
IMSindel: An accurate intermediate-size indel detection tool incorporating de novo assembly and gapped global-local alignment with split read analysis.IMSindel:一种准确的中等大小插入缺失检测工具,结合从头组装和缺口全局-局部比对以及拆分读分析。
Sci Rep. 2018 Apr 4;8(1):5608. doi: 10.1038/s41598-018-23978-z.
6
Nanopore sequencing and assembly of a human genome with ultra-long reads.纳米孔测序和超长读长组装人类基因组。
Nat Biotechnol. 2018 Apr;36(4):338-345. doi: 10.1038/nbt.4060. Epub 2018 Jan 29.
7
Use of deep whole-genome sequencing data to identify structure risk variants in breast cancer susceptibility genes.利用深度全基因组测序数据鉴定乳腺癌易感基因中的结构风险变异。
Hum Mol Genet. 2018 Mar 1;27(5):853-859. doi: 10.1093/hmg/ddy005.
8
The UCSC Genome Browser database: 2018 update.UCSC 基因组浏览器数据库:2018 年更新。
Nucleic Acids Res. 2018 Jan 4;46(D1):D762-D769. doi: 10.1093/nar/gkx1020.
9
Structure, mechanism, and regulation of polycomb-repressive complex 2.多梳抑制复合物 2 的结构、机制与调控
J Biol Chem. 2018 Sep 7;293(36):13805-13814. doi: 10.1074/jbc.R117.800367. Epub 2017 Sep 14.
10
GeneHancer: genome-wide integration of enhancers and target genes in GeneCards.基因增强子:基因卡片中增强子与靶基因的全基因组整合
Database (Oxford). 2017 Jan 1;2017. doi: 10.1093/database/bax028.