Open-pFind:新一代开放式蛋白质鉴定搜索引擎

报告时间:2018年12月26日(星期三)上午9:00

报告地点:生物楼学术报告厅

报告人:迟浩副研究员,中国科学院计算技术研究所

  报告人简介:

  迟浩,中国科学院计算技术研究所副研究员,2013年毕业于中科院计算所并获得工学博士学位,曾获2013年度中国科学院院长特别奖。自2006年以来,致力于蛋白质组海量质谱数据的深度解析研究,是蛋白质鉴定搜索引擎pFind系统的主要设计者和开发者,在数据库搜索领域提出新一代开放式数据库搜索引擎Open-pFind,在从头测序领域提出并发展了pNovo系列算法,速度和精度均达到当时国际最佳水平。相关研究工作发表于Nature BiotechnologyJournal of Proteome ResearchJournal of Proteomics等期刊。pFind软件目前在国内外注册下载达2000余套,支持领域内200余项研究工作发表,在蛋白质组学科研和教学中发挥着积极作用。

  报告摘要:

  Shotgun proteomics has grown rapidly in recent decades, especially for peptide and protein identification. However, more than 50% of MS/MS data acquired in shotgun proteomics have not been successfully identified. As shown in a number of studies, unexpected modifications is a major reason underlying the low identification rate, and several other factors also hinder precise peptide identification, e.g., semi- and non-specific digestion, in-source fragmentation and co-eluting peptides in mixed spectra. We have developed a novel database search algorithm, Open-pFind, to efficiently identify peptides even in an ultra-large search space which takes into account unexpected modifications, amino acid mutations, semi- or non-specific digestion and co-eluting peptides. We re-analyzed an entire human proteome dataset consisting of ~25 million spectra. It took Open-pFind ~5 hours on a 64-core workstation to search all of these spectra. More than one million peptides were identified, which were 86.7% more than those reported previously. The results obtained with Open-pFind demonstrated that the characteristics of MS/MS data vary according to different methods for sample preparation and LC-MS/MS. Open search strategies, as made practical by Open-pFind, will most likely be the preferred tools for large-scale MS/MS data analyses in the future.

  联系人:1810组 张丽华

  联系电话:89720

版权所有 © 中国科学院大连化学物理研究所 本站内容如涉及知识产权问题请联系我们 备案号:辽ICP备05000861号-1 辽公网安备21020402000367号