CASowary: CRISPR-Cas13 guide RNA predictor for transcript depletion

dc.contributor.authorKrohannon, Alexander
dc.contributor.authorSrivastava, Mansi
dc.contributor.authorRauch, Simone
dc.contributor.authorSrivastava, Rajneesh
dc.contributor.authorDickinson, Bryan C.
dc.contributor.authorJanga, Sarath Chandra
dc.contributor.departmentBioHealth Informatics, School of Informatics and Computingen_US
dc.date.accessioned2023-05-16T13:02:13Z
dc.date.available2023-05-16T13:02:13Z
dc.date.issued2022
dc.description.abstractBackground: Recent discovery of the gene editing system - CRISPR (Clustered Regularly Interspersed Short Palindromic Repeats) associated proteins (Cas), has resulted in its widespread use for improved understanding of a variety of biological systems. Cas13, a lesser studied Cas protein, has been repurposed to allow for efficient and precise editing of RNA molecules. The Cas13 system utilizes base complementarity between a crRNA/sgRNA (crispr RNA or single guide RNA) and a target RNA transcript, to preferentially bind to only the target transcript. Unlike targeting the upstream regulatory regions of protein coding genes on the genome, the transcriptome is significantly more redundant, leading to many transcripts having wide stretches of identical nucleotide sequences. Transcripts also exhibit complex three-dimensional structures and interact with an array of RBPs (RNA Binding Proteins), both of which may impact the effectiveness of transcript depletion of target sequences. However, our understanding of the features and corresponding methods which can predict whether a specific sgRNA will effectively knockdown a transcript is very limited. Results: Here we present a novel machine learning and computational tool, CASowary, to predict the efficacy of a sgRNA. We used publicly available RNA knockdown data from Cas13 characterization experiments for 555 sgRNAs targeting the transcriptome in HEK293 cells, in conjunction with transcriptome-wide protein occupancy information. Our model utilizes a Decision Tree architecture with a set of 112 sequence and target availability features, to classify sgRNA efficacy into one of four classes, based upon expected level of target transcript knockdown. After accounting for noise in the training data set, the noise-normalized accuracy exceeds 70%. Additionally, highly effective sgRNA predictions have been experimentally validated using an independent RNA targeting Cas system - CIRTS, confirming the robustness and reproducibility of our model's sgRNA predictions. Utilizing transcriptome wide protein occupancy map generated using POP-seq in HeLa cells against publicly available protein-RNA interaction map in Hek293 cells, we show that CASowary can predict high quality guides for numerous transcripts in a cell line specific manner. Conclusions: Application of CASowary to whole transcriptomes should enable rapid deployment of CRISPR/Cas13 systems, facilitating the development of therapeutic interventions linked with aberrations in RNA regulatory processes.en_US
dc.eprint.versionFinal published versionen_US
dc.identifier.citationKrohannon A, Srivastava M, Rauch S, Srivastava R, Dickinson BC, Janga SC. CASowary: CRISPR-Cas13 guide RNA predictor for transcript depletion. BMC Genomics. 2022;23(1):172. Published 2022 Mar 2. doi:10.1186/s12864-022-08366-2en_US
dc.identifier.urihttps://hdl.handle.net/1805/33009
dc.language.isoen_USen_US
dc.publisherBMCen_US
dc.relation.isversionof10.1186/s12864-022-08366-2en_US
dc.relation.journalBMC Genomicsen_US
dc.rightsAttribution 4.0 International*
dc.rights.urihttp://creativecommons.org/licenses/by/4.0/*
dc.sourcePMCen_US
dc.subjectCRISPRen_US
dc.subjectCas13en_US
dc.subjectmRNA regulationen_US
dc.subjectGene editingen_US
dc.subjectFunctional genomicsen_US
dc.subjectMachine learningen_US
dc.subjectProtein expressionen_US
dc.titleCASowary: CRISPR-Cas13 guide RNA predictor for transcript depletionen_US
dc.typeArticleen_US
Files
Original bundle
Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
12864_2022_Article_8366.pdf
Size:
2.08 MB
Format:
Adobe Portable Document Format
Description:
License bundle
Now showing 1 - 1 of 1
No Thumbnail Available
Name:
license.txt
Size:
1.99 KB
Format:
Item-specific license agreed upon to submission
Description: