MOLECULAR PROFILING IN BREAST CANCER AND TOXICOGENOMICS

dc.contributor.advisorZhou, Yaoqi
dc.contributor.authorLiu, Jiangang
dc.contributor.otherDunker, A. Keith
dc.contributor.otherChen, Jake
dc.contributor.otherUversky, Vladimir N.
dc.contributor.otherLiu, Yunlong
dc.contributor.otherLi, Dan S.
dc.date.accessioned2011-08-23T16:00:56Z
dc.date.available2011-08-23T16:00:56Z
dc.date.issued2011-08-23
dc.degree.date2010en_US
dc.degree.grantorIndiana Universityen_US
dc.degree.levelPh.D.en_US
dc.descriptionIndiana University-Purdue University Indianapolis (IUPUI)en_US
dc.description.abstractThis dissertation presents a body of research that attempts to tackle the ‘overfitting’ problem for gene signature and biomarker development in two different aspects (mechanistically and computationally). In achievement of a deeper understanding of cancer molecular mechanisms, this study presents new approaches to derive gene signatures for various biological phenotypes, including breast cancer, in the context of well-defined and mechanistically associated biological pathways. We identified the pattern of gene expression in the cell cycle pathway can indeed serve as a powerful biomarker for breast cancer prognosis. We further built a predictive model for prognosis based on the cell cycle gene signature, and found our model to be more accurate than the Amsterdam 70-gene signature when tested with multiple gene expression datasets generated from several patient populations. Aside from demonstrating the effectiveness of dimensionality reduction, phenotypic dissection, and prognostic or diagnostic prediction, this approach also provides an alternative to the current methodology of identifying gene expression markers that links to biological mechanism. This dissertation also presents the development of a novel feature selection algorithm called Predictive Power Estimate Analysis (PPEA) to computationally tackle on overfitting. The algorithm iteratively apply a two-way bootstrapping procedure to estimate predictive power of each individual gene, and make it possible to construct a predictive model from a much smaller set of genes with the highest predictive power. Using DrugMatrix™ rat liver data, we identified genomic biomarkers of hepatic specific injury for inflammation, cell death, and bile duct hyperplasia. We demonstrated that the signature genes were mechanistically related to the phenotype the signature intended to predict (e.g. 17 out of top 20 genes for inflammation selected by PPEA were members of NF-kB pathway, which is a key pre-inflammatory pathway for a xenobiotic response). The top 4 gene signature for BDH has been further validated by QPCR in a toxicology lab. This is important because our results suggest that the PPEA model not largely deters the over-fitting problem, but also has the capability to elucidate mechanism(s) of drug action and / or of toxicity.en_US
dc.identifier.urihttps://hdl.handle.net/1805/2636
dc.identifier.urihttp://dx.doi.org/10.7912/C2/916
dc.language.isoen_USen_US
dc.subjectmolecular profiling,breast cancer, toxicogenomics, feature selectionen_US
dc.subject.lcshBiochemical markersen_US
dc.subject.lcshGene expressionen_US
dc.subject.lcshBreast -- Canceren_US
dc.subject.lcshGenetic toxicologyen_US
dc.subject.lcshPhenotypeen_US
dc.titleMOLECULAR PROFILING IN BREAST CANCER AND TOXICOGENOMICSen_US
Files
Original bundle
Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
Dissertation_PhD_LiuJ_Informatic_school_Dec21.pdf
Size:
1.32 MB
Format:
Adobe Portable Document Format
Description:
Dissertation
License bundle
Now showing 1 - 1 of 1
No Thumbnail Available
Name:
license.txt
Size:
1.88 KB
Format:
Item-specific license agreed upon to submission
Description: