Automatic annotation of GPI sstructures using grid computing

Clemente Aguilar-Bonavides, Gerardo A. Cardenas, Ernesto S. Nakayasu, Felipe Gazos-Lopes, Igor C. Almeida, Ming Ying Leung

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Abstract

Glycosylphosphatidylinositol (GPI)-anchored proteins are involved in many biological processes and are of medical importance. The identification and analysis of the entire collection of free and protein-linked GPIs within an organism (i.e., GPIomics) requires highly sensitive instruments. At present, liquid chromatography-tandem mass spectrometry (LC-MS/MS or -MSn) is the most efficient laboratory technique for these tasks. As a typical MS n experiment produces hundreds of thousands of spectra, the data analysis creates a major bottleneck in high-throughput GPIomic projects. Yet, no computational tool for characterizing the chemical structures of GPI is available to date. We propose a library-search algorithm to identify GPIs by matching fragment peaks in the spectra with molecular masses derived from a collection of theoretical GPI structures constructed based on properties of all currently known GPIs. A theoretically possible GPI structure is assessed by a scoring scheme that incorporates its fitness values for individual observed spectra as well as its frequency of being considered as a good fit. The algorithm has been tested on a set of experimentally confirmed GPIs for the protozoan parasite Trypanosoma cruzi, the causative agent of Chagas disease. The final list of 4686 predicted GPI candidates contains 76 out of the 78 known structures in the test set. Over 70% of the known structures have fitness values among the top 647. The first version of this tool runs on a single computer, the results are obtained within 10 days. A second version has been developed using the Condor high throughput computing distributed environment; for the same amount of data results are obtained within 3 days, 19 hours, 38 minutes. This computational tool is expected to quicken the discovery and characterization of GPI molecules, increasing the number of experimentally confirmed GPI structures.

Original languageEnglish (US)
Title of host publication5th International Conference on Bioinformatics and Computational Biology 2013, BICoB 2013
Pages219-224
Number of pages6
StatePublished - 2013
Event5th International Conference on Bioinformatics and Computational Biology 2013, BICoB 2013 - Honolulu, HI, United States
Duration: Mar 4 2013Mar 6 2013

Publication series

Name5th International Conference on Bioinformatics and Computational Biology 2013, BICoB 2013

Other

Other5th International Conference on Bioinformatics and Computational Biology 2013, BICoB 2013
Country/TerritoryUnited States
CityHonolulu, HI
Period3/4/133/6/13

ASJC Scopus subject areas

  • Biomedical Engineering
  • Health Information Management

Fingerprint

Dive into the research topics of 'Automatic annotation of GPI sstructures using grid computing'. Together they form a unique fingerprint.

Cite this