Hu, Mei-Chen
(Department of Psychiatry, Columbia University, New York, NY, USA)
,
Pavlicova, Martina
(Department of Biostatistics, Mailman School of Public Health, Columbia University, New York, NY, USA)
,
Nunes, Edward V.
(Department of Psychiatry, Columbia University, New York, NY, USA)
Background: In clinical trials of behavioral health interventions, outcome variables often take the form of counts, such as days using substances or episodes of unprotected sex. Classically, count data follow a Poisson distribution; however, in practice such data often display greater heterogeneity ...
Background: In clinical trials of behavioral health interventions, outcome variables often take the form of counts, such as days using substances or episodes of unprotected sex. Classically, count data follow a Poisson distribution; however, in practice such data often display greater heterogeneity in the form of excess zeros (zero-inflation) or greater spread in the values (overdispersion) or both. Greater sample heterogeneity may be especially common in community-based effectiveness trials, where broad eligibility criteria are implemented to achieve a generalizable sample. Objectives: This article reviews the characteristics of Poisson model and the related models that have been developed to handle overdispersion (negative binomial (NB) model) or zero-inflation (zero-inflated Poisson (ZIP) and Poisson hurdle (PH) models) or both (zero-inflated negative binomial (ZINB) and negative binomial hurdle (NBH) models). Methods: All six models were used to model the effect of an HIV-risk reduction intervention on the count of unprotected sexual occasions (USOs), using data from a previously completed clinical trial among female patients (N == 515) participating in community-based substance abuse treatment (Tross et al. Effectiveness of HIV/AIDS sexual risk reduction groups for women in substance abuse treatment programs: Results of NIDA Clinical Trials Network Trial. J Acquir Immune Defic Syndr 2008; 48(5):581-589). Goodness of fit and the estimates of treatment effect derived from each model were compared. Results: The ZINB model provided the best fit, yielding a medium-sized effect of intervention. Conclusions and Scientific Significance: This article illustrates the consequences of applying models with different distribution assumptions on the data. If a model used does not closely fit the shape of the data distribution, the estimate of the effect of the intervention may be biased, either over- or underestimating the intervention effect.
Background: In clinical trials of behavioral health interventions, outcome variables often take the form of counts, such as days using substances or episodes of unprotected sex. Classically, count data follow a Poisson distribution; however, in practice such data often display greater heterogeneity in the form of excess zeros (zero-inflation) or greater spread in the values (overdispersion) or both. Greater sample heterogeneity may be especially common in community-based effectiveness trials, where broad eligibility criteria are implemented to achieve a generalizable sample. Objectives: This article reviews the characteristics of Poisson model and the related models that have been developed to handle overdispersion (negative binomial (NB) model) or zero-inflation (zero-inflated Poisson (ZIP) and Poisson hurdle (PH) models) or both (zero-inflated negative binomial (ZINB) and negative binomial hurdle (NBH) models). Methods: All six models were used to model the effect of an HIV-risk reduction intervention on the count of unprotected sexual occasions (USOs), using data from a previously completed clinical trial among female patients (N == 515) participating in community-based substance abuse treatment (Tross et al. Effectiveness of HIV/AIDS sexual risk reduction groups for women in substance abuse treatment programs: Results of NIDA Clinical Trials Network Trial. J Acquir Immune Defic Syndr 2008; 48(5):581-589). Goodness of fit and the estimates of treatment effect derived from each model were compared. Results: The ZINB model provided the best fit, yielding a medium-sized effect of intervention. Conclusions and Scientific Significance: This article illustrates the consequences of applying models with different distribution assumptions on the data. If a model used does not closely fit the shape of the data distribution, the estimate of the effect of the intervention may be biased, either over- or underestimating the intervention effect.
Rose, C. E., Martin, S. W., Wannemuehler, K. A., Plikaytis, B. D..
On the Use of Zero-Inflated and Hurdle Models for Modeling Vaccine Adverse Event Count Data.
Journal of biopharmaceutical statistics,
vol.16,
no.4,
463-481.
Yau, Kelvin K. W., Wang, Kui, Lee, Andy H..
Zero-Inflated Negative Binomial Mixed Regression Modeling of Over-Dispersed Count Data with Extra Zeros.
Biometrical journal: Biometrische Zeitschrift,
vol.45,
no.4,
437-452.
Liu W, Cela J. Count data models in SAS. Paper Presented at SAS Global Forum 2008, 16-19 March 2008, San Antonio, TX, No. 371-2008. Available at: http://www2.sas.com/proceedings/forum2008/371-2008.pdf. Last accessed on July 11, 2011.
Chin, Hoong Chor, Quddus, Mohammed Abdul.
Modeling Count Data with Excess Zeroes : An Empirical Application to Traffic Accidents.
Sociological methods & research,
vol.32,
no.1,
90-116.
Version 9.2. SAS Software System for Windows [computer program] 2009
10.1002/0470023589
10.1002/9781119013563
Econometric Analysis Greene W. 2002 5
Second International Symposium on Information Theory Akaike H 267 1973
Joshua, Sarath C., Garber, Nicholas J..
Estimating truck accident rate and involvements using linear and Poisson regression models.
Transportation planning and technology,
vol.15,
no.1,
41-58.
Vuong, Quang H..
Likelihood Ratio Tests for Model Selection and Non-Nested Hypotheses.
Econometrica : journal of the Econometric Society,
vol.57,
no.2,
307-.
※ AI-Helper는 부적절한 답변을 할 수 있습니다.