Lou Ronghui, Tang Pan, Ding Kang, Li Shanshan, Tian Cuiping, Li Yunxia, Zhao Suwen, Zhang Yaoyang, Shui Wenqing
iHuman Institute, ShanghaiTech University, Shanghai 201210, China; School of Life Science and Technology, ShanghaiTech University, Shanghai 201210, China; University of Chinese Academy of Sciences, Beijing 100049, China.
iHuman Institute, ShanghaiTech University, Shanghai 201210, China.
iScience. 2020 Mar 27;23(3):100903. doi: 10.1016/j.isci.2020.100903. Epub 2020 Feb 12.
Data-independent acquisition mass spectrometry (DIA-MS) is a powerful technique that enables relatively deep proteomic profiling with superior quantification reproducibility. DIA data mining predominantly relies on a spectral library of sufficient proteome coverage that, in most cases, is built on data-dependent acquisition-based analysis of the same sample. To expand the proteome coverage for a pre-determined protein family, we report herein on the construction of a hybrid spectral library that supplements a DIA experiment-derived library with a protein family-targeted virtual library predicted by deep learning. Leveraging this DIA hybrid library substantially deepens the coverage of three transmembrane protein families (G protein-coupled receptors, ion channels, and transporters) in mouse brain tissues with increases in protein identification of 37%-87% and peptide identification of 58%-161%. Moreover, of the 412 novel GPCR peptides exclusively identified with the DIA hybrid library strategy, 53.6% were validated as present in mouse brain tissues based on orthogonal experimental measurement.
数据非依赖采集质谱法(DIA-MS)是一种强大的技术,能够实现相对深入的蛋白质组分析,具有卓越的定量重现性。DIA数据挖掘主要依赖于具有足够蛋白质组覆盖率的谱图库,在大多数情况下,该谱图库是基于对同一样本的依赖数据采集分析构建的。为了扩大对预定蛋白质家族的蛋白质组覆盖率,我们在此报告构建一种混合谱图库,该谱图库用深度学习预测的靶向蛋白质家族的虚拟库补充DIA实验衍生的库。利用这个DIA混合库可大幅加深小鼠脑组织中三个跨膜蛋白家族(G蛋白偶联受体、离子通道和转运蛋白)的覆盖率,蛋白质鉴定增加37%-87%,肽段鉴定增加58%-161%。此外,基于DIA混合库策略专门鉴定出的412种新型GPCR肽段中,53.6%经正交实验测量验证存在于小鼠脑组织中。