Prediction of regulatory motifs from human Chip-sequencing data using a deep learning framework

dc.contributor.authorYang, Jinyu
dc.contributor.authorMa, Anjun
dc.contributor.authorHoppe, Adam D.
dc.contributor.authorWang, Cankun
dc.contributor.authorLi, Yang
dc.contributor.authorZhang, Chi
dc.contributor.authorWang, Yan
dc.contributor.authorLiu, Bingqiang
dc.contributor.authorMa, Qin
dc.contributor.departmentMedical and Molecular Genetics, School of Medicineen_US
dc.date.accessioned2019-12-26T18:19:28Z
dc.date.available2019-12-26T18:19:28Z
dc.date.issued2019-09-05
dc.description.abstractThe identification of transcription factor binding sites and cis-regulatory motifs is a frontier whereupon the rules governing protein-DNA binding are being revealed. Here, we developed a new method (DEep Sequence and Shape mOtif or DESSO) for cis-regulatory motif prediction using deep neural networks and the binomial distribution model. DESSO outperformed existing tools, including DeepBind, in predicting motifs in 690 human ENCODE ChIP-sequencing datasets. Furthermore, the deep-learning framework of DESSO expanded motif discovery beyond the state-of-the-art by allowing the identification of known and new protein-protein-DNA tethering interactions in human transcription factors (TFs). Specifically, 61 putative tethering interactions were identified among the 100 TFs expressed in the K562 cell line. In this work, the power of DESSO was further expanded by integrating the detection of DNA shape features. We found that shape information has strong predictive power for TF-DNA binding and provides new putative shape motif information for human TFs. Thus, DESSO improves in the identification and structural analysis of TF binding sites, by integrating the complexities of DNA binding into a deep-learning framework.en_US
dc.identifier.citationYang, J., Ma, A., Hoppe, A. D., Wang, C., Li, Y., Zhang, C., … Ma, Q. (2019). Prediction of regulatory motifs from human Chip-sequencing data using a deep learning framework. Nucleic acids research, 47(15), 7809–7824. doi:10.1093/nar/gkz672en_US
dc.identifier.urihttps://hdl.handle.net/1805/21586
dc.language.isoen_USen_US
dc.publisherOxford University Pressen_US
dc.relation.isversionof10.1093/nar/gkz672en_US
dc.relation.journalNucleic Acids Researchen_US
dc.rights.urihttp://creativecommons.org/licenses/by/4.0/*
dc.sourcePMCen_US
dc.subjectComputational Biologyen_US
dc.subjectDNAen_US
dc.subjectGene Expression Regulationen_US
dc.subjectTranscription Factorsen_US
dc.titlePrediction of regulatory motifs from human Chip-sequencing data using a deep learning frameworken_US
dc.typeArticleen_US
Files
Original bundle
Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
gkz672.pdf
Size:
6.4 MB
Format:
Adobe Portable Document Format
Description:
License bundle
Now showing 1 - 1 of 1
No Thumbnail Available
Name:
license.txt
Size:
1.99 KB
Format:
Item-specific license agreed upon to submission
Description: