• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

果蝇胚胎基因表达模式的自动图像分析

Automatic image analysis for gene expression patterns of fly embryos.

作者信息

Peng Hanchuan, Long Fuhui, Zhou Jie, Leung Garmay, Eisen Michael B, Myers Eugene W

机构信息

Janelia Farm Research Campus, Howard Hughes Medical Institute, Ashburn, VA 20147, USA.

出版信息

BMC Cell Biol. 2007 Jul 10;8 Suppl 1(Suppl 1):S7. doi: 10.1186/1471-2121-8-S1-S7.

DOI:10.1186/1471-2121-8-S1-S7
PMID:17634097
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC1924512/
Abstract

BACKGROUND

Staining the mRNA of a gene via in situ hybridization (ISH) during the development of a D. melanogaster embryo delivers the detailed spatio-temporal pattern of expression of the gene. Many biological problems such as the detection of co-expressed genes, co-regulated genes, and transcription factor binding motifs rely heavily on the analyses of these image patterns. The increasing availability of ISH image data motivates the development of automated computational approaches to the analysis of gene expression patterns.

RESULTS

We have developed algorithms and associated software that extracts a feature representation of a gene expression pattern from an ISH image, that clusters genes sharing the same spatio-temporal pattern of expression, that suggests transcription factor binding (TFB) site motifs for genes that appear to be co-regulated (based on the clustering), and that automatically identifies the anatomical regions that express a gene given a training set of annotations. In fact, we developed three different feature representations, based on Gaussian Mixture Models (GMM), Principal Component Analysis (PCA), and wavelet functions, each having different merits with respect to the tasks above. For clustering image patterns, we developed a minimum spanning tree method (MSTCUT), and for proposing TFB sites we used standard motif finders on clustered/co-expressed genes with the added twist of requiring conservation across the genomes of 8 related fly species. Lastly, we trained a suite of binary-classifiers, one for each anatomical annotation term in a controlled vocabulary or ontology that operate on the wavelet feature representation. We report the results of applying these methods to the Berkeley Drosophila Genome Project (BDGP) gene expression database.

CONCLUSION

Our automatic image analysis methods recapitulate known co-regulated genes and give correct developmental-stage classifications with 99+% accuracy, despite variations in morphology, orientation, and focal plane suggesting that these techniques form a set of useful tools for the large-scale computational analysis of fly embryonic gene expression patterns.

摘要

背景

在黑腹果蝇胚胎发育过程中,通过原位杂交(ISH)对基因的mRNA进行染色,可呈现该基因详细的时空表达模式。许多生物学问题,如共表达基因、共调控基因的检测以及转录因子结合基序,都严重依赖于对这些图像模式的分析。ISH图像数据的日益丰富,促使人们开发自动化计算方法来分析基因表达模式。

结果

我们开发了算法及相关软件,可从ISH图像中提取基因表达模式的特征表示,对具有相同时空表达模式的基因进行聚类,为基于聚类显示似乎共调控的基因推测转录因子结合(TFB)位点基序,并在给定注释训练集的情况下自动识别表达某一基因的解剖区域。事实上,我们基于高斯混合模型(GMM)、主成分分析(PCA)和小波函数开发了三种不同的特征表示,每种在上述任务方面都有不同优点。对于聚类图像模式,我们开发了一种最小生成树方法(MSTCUT),对于推测TFB位点,我们在聚类/共表达基因上使用标准基序查找器,并增加了在8种相关果蝇基因组中保守性的要求。最后,我们训练了一组二元分类器,针对受控词汇表或本体中的每个解剖注释术语各有一个,这些分类器基于小波特征表示进行操作。我们报告了将这些方法应用于伯克利果蝇基因组计划(BDGP)基因表达数据库的结果。

结论

我们的自动图像分析方法概括了已知的共调控基因,并以99%以上的准确率给出正确的发育阶段分类,尽管存在形态、方向和焦平面的变化,这表明这些技术构成了一组用于果蝇胚胎基因表达模式大规模计算分析的有用工具。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f87c/1924512/f32b2aa44a7f/1471-2121-8-S1-S7-4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f87c/1924512/14bc9eed864e/1471-2121-8-S1-S7-1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f87c/1924512/be591df238f6/1471-2121-8-S1-S7-2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f87c/1924512/378412f4a43e/1471-2121-8-S1-S7-3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f87c/1924512/f32b2aa44a7f/1471-2121-8-S1-S7-4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f87c/1924512/14bc9eed864e/1471-2121-8-S1-S7-1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f87c/1924512/be591df238f6/1471-2121-8-S1-S7-2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f87c/1924512/378412f4a43e/1471-2121-8-S1-S7-3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f87c/1924512/f32b2aa44a7f/1471-2121-8-S1-S7-4.jpg

相似文献

1
Automatic image analysis for gene expression patterns of fly embryos.果蝇胚胎基因表达模式的自动图像分析
BMC Cell Biol. 2007 Jul 10;8 Suppl 1(Suppl 1):S7. doi: 10.1186/1471-2121-8-S1-S7.
2
Joint stage recognition and anatomical annotation of Drosophila gene expression patterns.果蝇基因表达模式的联合阶段识别和解剖注释。
Bioinformatics. 2012 Jun 15;28(12):i16-24. doi: 10.1093/bioinformatics/bts220.
3
SPEX2: automated concise extraction of spatial gene expression patterns from Fly embryo ISH images.SPEX2:从果蝇胚胎原位杂交图像中自动简洁地提取空间基因表达模式。
Bioinformatics. 2010 Jun 15;26(12):i47-56. doi: 10.1093/bioinformatics/btq172.
4
Automatic recognition and annotation of gene expression patterns of fly embryos.果蝇胚胎基因表达模式的自动识别与注释。
Bioinformatics. 2007 Mar 1;23(5):589-96. doi: 10.1093/bioinformatics/btl680. Epub 2007 Jan 19.
5
GINI: from ISH images to gene interaction networks.GINI:从图像到基因交互网络。
PLoS Comput Biol. 2013;9(10):e1003227. doi: 10.1371/journal.pcbi.1003227. Epub 2013 Oct 10.
6
Automated annotation of developmental stages of Drosophila embryos in images containing spatial patterns of expression.在含有表达空间模式的图像中自动注释果蝇胚胎的发育阶段。
Bioinformatics. 2014 Jan 15;30(2):266-73. doi: 10.1093/bioinformatics/btt648. Epub 2013 Dec 3.
7
A mesh generation and machine learning framework for Drosophila gene expression pattern image analysis.用于果蝇基因表达模式图像分析的网格生成和机器学习框架。
BMC Bioinformatics. 2013 Dec 28;14:372. doi: 10.1186/1471-2105-14-372.
8
FlyIT: Drosophila Embryogenesis Image Annotation based on Image Tiling and Convolutional Neural Networks.FlyIT:基于图像分块和卷积神经网络的果蝇胚胎发育图像标注
IEEE/ACM Trans Comput Biol Bioinform. 2021 Jan-Feb;18(1):194-204. doi: 10.1109/TCBB.2019.2935723. Epub 2021 Feb 3.
9
Classification of Drosophila embryonic developmental stage range based on gene expression pattern images.基于基因表达模式图像的果蝇胚胎发育阶段范围分类
Comput Syst Bioinformatics Conf. 2006:293-8.
10
Medium-throughput processing of whole mount in situ hybridisation experiments into gene expression domains.高通量处理整体原位杂交实验以获得基因表达图谱。
PLoS One. 2012;7(9):e46658. doi: 10.1371/journal.pone.0046658. Epub 2012 Sep 28.

引用本文的文献

1
New methods for computational decomposition of whole-mount in situ images enable effective curation of a large, highly redundant collection of Xenopus images.新的计算分解全胚胎原位图像的方法使得对大量高度冗余的爪蟾图像进行有效整理成为可能。
PLoS Comput Biol. 2018 Aug 29;14(8):e1006077. doi: 10.1371/journal.pcbi.1006077. eCollection 2018 Aug.
2
Stability-driven nonnegative matrix factorization to interpret spatial gene expression and build local gene networks.基于稳定性驱动的非负矩阵分解用于解释空间基因表达并构建局部基因网络。
Proc Natl Acad Sci U S A. 2016 Apr 19;113(16):4290-5. doi: 10.1073/pnas.1521171113. Epub 2016 Apr 6.
3

本文引用的文献

1
Eigenfaces for recognition.特征脸识别。
J Cogn Neurosci. 1991 Winter;3(1):71-86. doi: 10.1162/jocn.1991.3.1.71.
2
Automatic recognition and annotation of gene expression patterns of fly embryos.果蝇胚胎基因表达模式的自动识别与注释。
Bioinformatics. 2007 Mar 1;23(5):589-96. doi: 10.1093/bioinformatics/btl680. Epub 2007 Jan 19.
3
Genome-wide atlas of gene expression in the adult mouse brain.成年小鼠大脑基因表达的全基因组图谱。
Towards 3D in silico modeling of the sea urchin embryonic development.
迈向海胆胚胎发育的三维计算机模拟建模
J Chem Biol. 2013 Sep 13;7(1):17-28. doi: 10.1007/s12154-013-0101-x.
4
Segmentation and quantitative analysis of individual cells in developmental tissues.发育组织中单个细胞的分割与定量分析。
Methods Mol Biol. 2014;1092:235-53. doi: 10.1007/978-1-60327-292-6_16.
5
Automated annotation of developmental stages of Drosophila embryos in images containing spatial patterns of expression.在含有表达空间模式的图像中自动注释果蝇胚胎的发育阶段。
Bioinformatics. 2014 Jan 15;30(2):266-73. doi: 10.1093/bioinformatics/btt648. Epub 2013 Dec 3.
6
BIOCAT: a pattern recognition platform for customizable biological image classification and annotation.BIOCAT:一个用于可定制生物图像分类和标注的模式识别平台。
BMC Bioinformatics. 2013 Oct 4;14:291. doi: 10.1186/1471-2105-14-291.
7
FuncISH: learning a functional representation of neural ISH images.FuncISH:学习神经原位杂交图像的功能表示。
Bioinformatics. 2013 Jul 1;29(13):i36-43. doi: 10.1093/bioinformatics/btt207.
8
Automated annotation of gene expression image sequences via non-parametric factor analysis and conditional random fields.通过非参数因子分析和条件随机场对基因表达图像序列进行自动注释。
Bioinformatics. 2013 Jul 1;29(13):i27-35. doi: 10.1093/bioinformatics/btt206.
9
Unsupervised clustering of subcellular protein expression patterns in high-throughput microscopy images reveals protein complexes and functional relationships between proteins.无监督的亚细胞蛋白质表达模式聚类在高通量显微镜图像中揭示了蛋白质复合物和蛋白质之间的功能关系。
PLoS Comput Biol. 2013;9(6):e1003085. doi: 10.1371/journal.pcbi.1003085. Epub 2013 Jun 13.
10
A high-fidelity cell lineage tracing method for obtaining systematic spatiotemporal gene expression patterns in Caenorhabditis elegans.一种高保真的细胞谱系追踪方法,用于获取秀丽隐杆线虫中系统的时空基因表达模式。
G3 (Bethesda). 2013 May 20;3(5):851-63. doi: 10.1534/g3.113.005918.
Nature. 2007 Jan 11;445(7124):168-76. doi: 10.1038/nature05453. Epub 2006 Dec 6.
4
Quantitative and predictive model of transcriptional control of the Drosophila melanogaster even skipped gene.果蝇even skipped基因转录调控的定量和预测模型。
Nat Genet. 2006 Oct;38(10):1159-65. doi: 10.1038/ng1886. Epub 2006 Sep 17.
5
Feature selection based on mutual information: criteria of max-dependency, max-relevance, and min-redundancy.基于互信息的特征选择:最大依赖、最大相关和最小冗余准则。
IEEE Trans Pattern Anal Mach Intell. 2005 Aug;27(8):1226-38. doi: 10.1109/TPAMI.2005.159.
6
Identifying spatially similar gene expression patterns in early stage fruit fly embryo images: binary feature versus invariant moment digital representations.识别早期果蝇胚胎图像中空间相似的基因表达模式:二元特征与不变矩数字表示法
BMC Bioinformatics. 2004 Dec 16;5:202. doi: 10.1186/1471-2105-5-202.
7
PhyME: a probabilistic algorithm for finding motifs in sets of orthologous sequences.PhyME:一种用于在直系同源序列集中寻找基序的概率算法。
BMC Bioinformatics. 2004 Oct 28;5:170. doi: 10.1186/1471-2105-5-170.
8
Dynamic control of positional information in the early Drosophila embryo.果蝇早期胚胎中位置信息的动态控制。
Nature. 2004 Jul 15;430(6997):368-71. doi: 10.1038/nature02678.
9
Combining phylogenetic data with co-regulated genes to identify regulatory motifs.结合系统发育数据与共调控基因以识别调控基序。
Bioinformatics. 2003 Dec 12;19(18):2369-80. doi: 10.1093/bioinformatics/btg329.
10
A gene-coexpression network for global discovery of conserved genetic modules.用于全面发现保守遗传模块的基因共表达网络。
Science. 2003 Oct 10;302(5643):249-55. doi: 10.1126/science.1087447. Epub 2003 Aug 21.