TSAFinder: exhaustive tumor-specific antigen detection with RNAseq

dc.contributor.authorSharpnack, Michael F.
dc.contributor.authorJohnson, Travis S.
dc.contributor.authorChalkley, Robert
dc.contributor.authorHan, Zhi
dc.contributor.authorCarbone, David
dc.contributor.authorHuang, Kun
dc.contributor.authorHe, Kai
dc.contributor.departmentBiostatistics and Health Data Science, Richard M. Fairbanks School of Public Health
dc.date.accessioned2024-07-11T08:46:37Z
dc.date.available2024-07-11T08:46:37Z
dc.date.issued2022
dc.description.abstractMotivation: Tumor-specific antigen (TSA) identification in human cancer predicts response to immunotherapy and provides targets for cancer vaccine and adoptive T-cell therapies with curative potential, and TSAs that are highly expressed at the RNA level are more likely to be presented on major histocompatibility complex (MHC)-I. Direct measurements of the RNA expression of peptides would allow for generalized prediction of TSAs. Human leukocyte antigen (HLA)-I genotypes were predicted with seq2HLA. RNA sequencing (RNAseq) fastq files were translated into all possible peptides of length 8-11, and peptides with high and low expressions in the tumor and control samples, respectively, were tested for their MHC-I binding potential with netMHCpan-4.0. Results: A novel pipeline for TSA prediction from RNAseq was used to predict all possible unique peptides size 8-11 on previously published murine and human lung and lymphoma tumors and validated on matched tumor and control lung adenocarcinoma (LUAD) samples. We show that neoantigens predicted by exomeSeq are typically poorly expressed at the RNA level, and a fraction is expressed in matched normal samples. TSAs presented in the proteomics data have higher RNA abundance and lower MHC-I binding percentile, and these attributes are used to discover high confidence TSAs within the validation cohort. Finally, a subset of these high confidence TSAs is expressed in a majority of LUAD tumors and represents attractive vaccine targets. Availability and implementation: The datasets were derived from sources in the public domain as follows: TSAFinder is open-source software written in python and R. It is licensed under CC-BY-NC-SA and can be downloaded at https://github.com/RNAseqTSA.
dc.eprint.versionFinal published version
dc.identifier.citationSharpnack MF, Johnson TS, Chalkley R, et al. TSAFinder: exhaustive tumor-specific antigen detection with RNAseq. Bioinformatics. 2022;38(9):2422-2427. doi:10.1093/bioinformatics/btac116
dc.identifier.urihttps://hdl.handle.net/1805/42104
dc.language.isoen_US
dc.publisherOxford University Press
dc.relation.isversionof10.1093/bioinformatics/btac116
dc.relation.journalBioinformatics
dc.rightsPublisher Policy
dc.sourcePMC
dc.subjectAdenocarcinoma of lung
dc.subjectNeoplasm antigens
dc.subjectLung neoplasms
dc.subjectPeptides
dc.titleTSAFinder: exhaustive tumor-specific antigen detection with RNAseq
dc.typeArticle
Files
Original bundle
Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
Sharpnack2022TSAFinder-PP.pdf
Size:
327.38 KB
Format:
Adobe Portable Document Format
License bundle
Now showing 1 - 1 of 1
No Thumbnail Available
Name:
license.txt
Size:
2.04 KB
Format:
Item-specific license agreed upon to submission
Description: