A large peptidome dataset improves HLA class I epitope prediction across most of the human population

Siranush Sarkizova, Susan Klaeger, Phuong M. Le, Letitia W. Li, Giacomo Oliveira, Hasmik Keshishian, Christina R. Hartigan, Wandi Zhang, David A. Braun, Keith L. Ligon, Pavan Bachireddy, Ioannis K. Zervantonakis, Jennifer M. Rosenbluth, Tamara Ouspenskaia, Travis Law, Sune Justesen, Jonathan Stevens, William J. Lane, Thomas Eisenhaure, Guang Lan ZhangKarl R. Clauser, Nir Hacohen, Steven A. Carr, Catherine J. Wu, Derin B. Keskin

Research output: Contribution to journalArticlepeer-review

247 Scopus citations

Abstract

Prediction of HLA epitopes is important for the development of cancer immunotherapies and vaccines. However, current prediction algorithms have limited predictive power, in part because they were not trained on high-quality epitope datasets covering a broad range of HLA alleles. To enable prediction of endogenous HLA class I-associated peptides across a large fraction of the human population, we used mass spectrometry to profile >185,000 peptides eluted from 95 HLA-A, -B, -C and -G mono-allelic cell lines. We identified canonical peptide motifs per HLA allele, unique and shared binding submotifs across alleles and distinct motifs associated with different peptide lengths. By integrating these data with transcript abundance and peptide processing, we developed HLAthena, providing allele-and-length-specific and pan-allele-pan-length prediction models for endogenous peptide presentation. These models predicted endogenous HLA class I-associated ligands with 1.5-fold improvement in positive predictive value compared with existing tools and correctly identified >75% of HLA-bound peptides that were observed experimentally in 11 patient-derived tumor cell lines.

Original languageEnglish (US)
Pages (from-to)199-209
Number of pages11
JournalNature biotechnology
Volume38
Issue number2
DOIs
StatePublished - Feb 1 2020
Externally publishedYes

ASJC Scopus subject areas

  • Biotechnology
  • Bioengineering
  • Applied Microbiology and Biotechnology
  • Molecular Medicine
  • Biomedical Engineering

Fingerprint

Dive into the research topics of 'A large peptidome dataset improves HLA class I epitope prediction across most of the human population'. Together they form a unique fingerprint.

Cite this