- Browse by Subject
Browsing by Subject "Endogeneity"
Now showing 1 - 3 of 3
Results Per Page
Sort Options
Item Avoiding Bad Control in Regression for Partially Qualitative Outcomes, and Correcting for Endogeneity Bias in Two-Part Models: Causal Inference from the Potential Outcomes Perspective(2021-05) Asfaw, Daniel Abebe; Terza, Joseph; Ottoni-Wilhelm, Mark; Tennekoon, Vidhura; Tan, FeiThe general potential outcomes framework (GPOF) is an essential structure that facilitates clear and coherent specification, identification, and estimation of causal effects. This dissertation utilizes and extends the GPOF, to specify, identify, and estimate causally interpretable (CI) effect parameter (EP) for an outcome of interest that manifests as either a value in a specified subset of the real line or a qualitative event -- a partially qualitative outcome (PQO). The limitations of the conventional GPOF for casting a regression model for a PQO is discussed. The GPOF is only capable of delivering an EP that is subject to a bias due to bad control. The dissertation proposes an outcome measure that maintains all of the essential features of a PQO that is entirely real-valued and is not subject to the bad control critique; the P-weighted outcome – the outcome weighted by the probability that it manifests as a quantitative (real) value. I detail a regression-based estimation method for such EP and, using simulated data, demonstrate its implementation and validate its consistency for the targeted EP. The practicality of the proposed approach is demonstrated by estimating the causal effect of a fully effective policy that bans pregnant women from smoking during pregnancy on a new measure of birth weight. The dissertation also proposes a Generalized Control Function (GCF) approach for modeling and estimating a CI parameter in the context of a fully parametric two-part model (2PM) for a continuous outcome in which the causal variable of interest is continuous and endogenous. The proposed approach is cast within the GPOF. Given a fully parametric specification for the causal variable and under regular Instrumental Variables (IV) assumptions, the approach is shown to satisfy the conditional independence assumption that is often difficult to hold under alternative approaches. Using simulated data, a full information maximum likelihood (FIML) estimator is derived for estimating the “deep” parameters of the model. The Average Incremental Effect (AIE) estimator based on these deep parameter estimates is shown to outperform other conventional estimators. I apply the method for estimating the medical care cost of obesity in youth in the US.Item Specification, estimation and testing of treatment effects in multinomial outcome models : accommodating endogeneity and inter-category covariance(2018-06-18) Tang, Shichao; Terza, Joseph V.; Carlin, Paul; Lin, Hsien-Chang; Morrison, Gwendolyn; Seo, BoyoungIn this dissertation, a potential outcomes (PO) based framework is developed for causally interpretable treatment effect parameters in the multinomial dependent variable regression framework. The specification of the relevant data generating process (DGP) is also derived. This new framework simultaneously accounts for the potential endogeneity of the treatment and loosens inter-category covariance restrictions on the multinomial outcome model (e.g., the independence from irrelevant alternatives restriction). Corresponding consistent estimators for the “deep parameters” of the DGP and the treatment effect parameters are developed and implemented (in Stata). A novel approach is proposed for assessing the inter-category covariance flexibility afforded by a particular multinomial modeling specification [e.g. multinomial logit (MNL), multinomial probit (MNP), and nested multinomial logit (NMNL)] in the context of our general framework. This assessment technique can serve as a useful tool for model selection. The new modeling/estimation approach developed in this dissertation is quite general. I focus here, however, on the NMNL model because, among the three modeling specifications under consideration (MNL, MNP and NMNL), it is the only one that is both computationally feasible and is relatively unrestrictive with regard to inter-category covariance. Moreover, as a logical starting point, I restrict my analyses to the simplest version of the model – the trinomial (three-category) NMNL with an endogenous treatment (ET) variable conditioned on individual-specific covariates only. To identify potential computational issues and to assess the statistical accuracy of my proposed NMNL-ET estimator and its implementation (in Stata), I conducted a thorough simulation analysis. I found that conventional optimization techniques are, in this context, generally fraught with convergence problems. To overcome this, I implement a systematic line search algorithm that successfully resolves this issue. The simulation results suggest that it is important to accommodate both endogeneity and inter-category covariance simultaneously in model design and estimation. As an illustration and as a basis for comparing alternative parametric specifications with respect to ease of implementation, computational efficiency and statistical performance, the proposed model and estimation method are used to analyze the impact of substance abuse/dependence on the employment status using the National Epidemiologic Survey on Alcohol and Related Conditions (NESARC) data.Item Two-Stage Residual Inclusion Estimation in Health Services Research and Health Economics(Wiley, 2018-06) Terza, Joseph V.; Economics, School of Liberal ArtsOBJECTIVES: Empirical analyses in health services research and health economics often require implementation of nonlinear models whose regressors include one or more endogenous variables-regressors that are correlated with the unobserved random component of the model. In such cases, implementation of conventional regression methods that ignore endogeneity will likely produce results that are biased and not causally interpretable. Terza et al. (2008) discuss a relatively simple estimation method that avoids endogeneity bias and is applicable in a wide variety of nonlinear regression contexts. They call this method two-stage residual inclusion (2SRI). In the present paper, I offer a 2SRI how-to guide for practitioners and a step-by-step protocol that can be implemented with any of the popular statistical or econometric software packages. STUDY DESIGN: We introduce the protocol and its Stata implementation in the context of a real data example. Implementation of 2SRI for a very broad class of nonlinear models is then discussed. Additional examples are given. EMPIRICAL APPLICATION: We analyze cigarette smoking as a determinant of infant birthweight using data from Mullahy (1997). CONCLUSION: It is hoped that the discussion will serve as a practical guide to implementation of the 2SRI protocol for applied researchers.