Multivariate semiparametric regression models for longitudinal data
dc.contributor.advisor | Tu, Wanzhu | |
dc.contributor.author | Li, Zhuokai | |
dc.contributor.other | Liu, Hai | |
dc.contributor.other | Katz, Barry P. | |
dc.contributor.other | Fortenberry, J. Dennis | |
dc.date.accessioned | 2015-05-29T19:15:37Z | |
dc.date.available | 2015-05-29T19:15:37Z | |
dc.date.issued | 2014 | |
dc.degree.date | 2014 | en_US |
dc.degree.discipline | Biostatistics | en |
dc.degree.grantor | Indiana University | en_US |
dc.degree.level | Ph.D. | en_US |
dc.description.abstract | Multiple-outcome longitudinal data are abundant in clinical investigations. For example, infections with different pathogenic organisms are often tested concurrently, and assessments are usually taken repeatedly over time. It is therefore natural to consider a multivariate modeling approach to accommodate the underlying interrelationship among the multiple longitudinally measured outcomes. This dissertation proposes a multivariate semiparametric modeling framework for such data. Relevant estimation and inference procedures as well as model selection tools are discussed within this modeling framework. The first part of this research focuses on the analytical issues concerning binary data. The second part extends the binary model to a more general situation for data from the exponential family of distributions. The proposed model accounts for the correlations across the outcomes as well as the temporal dependency among the repeated measures of each outcome within an individual. An important feature of the proposed model is the addition of a bivariate smooth function for the depiction of concurrent nonlinear and possibly interacting influences of two independent variables on each outcome. For model implementation, a general approach for parameter estimation is developed by using the maximum penalized likelihood method. For statistical inference, a likelihood-based resampling procedure is proposed to compare the bivariate nonlinear effect surfaces across the outcomes. The final part of the dissertation presents a variable selection tool to facilitate model development in practical data analysis. Using the adaptive least absolute shrinkage and selection operator (LASSO) penalty, the variable selection tool simultaneously identifies important fixed effects and random effects, determines the correlation structure of the outcomes, and selects the interaction effects in the bivariate smooth functions. Model selection and estimation are performed through a two-stage procedure based on an expectation-maximization (EM) algorithm. Simulation studies are conducted to evaluate the performance of the proposed methods. The utility of the methods is demonstrated through several clinical applications. | en_US |
dc.identifier.uri | https://hdl.handle.net/1805/6462 | |
dc.identifier.uri | http://dx.doi.org/10.7912/C2/2781 | |
dc.language.iso | en_US | en_US |
dc.rights | Attribution-NonCommercial-NoDerivs 3.0 United States | |
dc.rights.uri | http://creativecommons.org/licenses/by-nc-nd/3.0/us/ | |
dc.subject | Biostatistics | en_US |
dc.subject.lcsh | Regression analysis -- Data processing -- Research -- Methodology | en_US |
dc.subject.lcsh | Mathematical statistics -- Longitudinal studies -- Research | en_US |
dc.subject.lcsh | Multivariate analysis -- Research -- Methodology | en_US |
dc.subject.lcsh | Estimation theory -- Research | en_US |
dc.subject.lcsh | Biometry -- Methodology -- Research | en_US |
dc.subject.lcsh | Clinical trials -- Statistical methods -- Research | en_US |
dc.subject.lcsh | Expectation-maximization algorithms -- Research | en_US |
dc.subject.lcsh | Binary system (Mathematics) -- Research | en_US |
dc.subject.lcsh | Nonparametric statistics -- Research | en_US |
dc.subject.lcsh | Probabilities -- Data processing | en_US |
dc.subject.lcsh | Real-time data processing -- Research | en_US |
dc.subject.lcsh | Parameter estimation -- Research | en_US |
dc.subject.lcsh | Latent variables -- Research | en_US |
dc.subject.lcsh | Meta-analysis -- Research -- Methodology | en_US |
dc.subject.lcsh | Stochastic processes -- Research | en_US |
dc.subject.lcsh | Least squares -- Research | en_US |
dc.title | Multivariate semiparametric regression models for longitudinal data | en_US |
dc.type | Thesis | en |