eCite Digital Repository

Summary goodness-of-fit statistics for binary generalized linear models with noncanonical link functions

Citation

Canary, JD and Blizzard, L and Barry, RP and Hosmer, DW and Quinn, SJ, Summary goodness-of-fit statistics for binary generalized linear models with noncanonical link functions, Biometrical Journal, 58, (3) pp. 674-690. ISSN 0323-3847 (2016) [Refereed Article]

Copyright Statement

Copyright 2015 John Wiley & Sons Ltd/London School of Economics

DOI: doi:10.1002/bimj.201400079

Abstract

Generalized linear models (GLM) with a canonical logit link function are the primary modeling technique used to relate a binary outcome to predictor variables. However, noncanonical links can offer more flexibility, producing convenient analytical quantities (e.g., probit GLMs in toxicology) and desired measures of effect (e.g., relative risk from log GLMs). Many summary goodness-of-fit (GOF) statistics exist for logistic GLM. Their properties make the development of GOF statistics relatively straightforward, but it can be more difficult under noncanonical links. Although GOF tests for logistic GLM with continuous covariates (GLMCC) have been applied to GLMCCs with log links, we know of no GOF tests in the literature specifically developed for GLMCCs that can be applied regardless of link function chosen. We generalize the Tsiatis GOF statistic originally developed for logistic GLMCCs, (TG), so that it can be applied under any link function. Further, we show that the algebraically related Hosmer–Lemeshow (HL) and Pigeon–Heyse (J2) statistics can be applied directly. In a simulation study, TG, HL, and J2 were used to evaluate the fit of probit, log–log, complementary log–log, and log models, all calculated with a common grouping method. The TG statistic consistently maintained Type I error rates, while those of HL and J2 were often lower than expected if terms with little influence were included. Generally, the statistics had similar power to detect an incorrect model. An exception occurred when a log GLMCC was incorrectly fit to data generated from a logistic GLMCC. In this case, TG had more power than HL or J2.

Item Details

Item Type:Refereed Article
Keywords:goodness-of-fit, Hosmer–Lemeshow, noncanonical generalized linear models, Pigeon–Heyse, Tsiatis
Research Division:Mathematical Sciences
Research Group:Statistics
Research Field:Statistics not elsewhere classified
Objective Division:Health
Objective Group:Other health
Objective Field:Other health not elsewhere classified
UTAS Author:Canary, JD (Dr Jana Canary)
UTAS Author:Blizzard, L (Professor Leigh Blizzard)
ID Code:105244
Year Published:2016 (online first 2015)
Web of Science® Times Cited:11
Deposited By:Menzies Institute for Medical Research
Deposited On:2015-12-16
Last Modified:2017-11-01
Downloads:0

Repository Staff Only: item control page