Riley, RD ORCID: https://orcid.org/0000-0001-8699-0735, Van Calster, B and Collins, GS (2020) A note on estimating the Cox-Snell R2 from a reported C statistic (AUROC) to inform sample size calculations for developing a prediction model with a binary outcome. Statistics in Medicine.

[img]
Preview
Text
sim.8806.pdf - Published Version
Available under License Creative Commons Attribution.

Download (864kB) | Preview

Abstract

In 2019 we published a pair of articles in Statistics in Medicine that describe how to calculate the minimum sample size for developing a multivariable prediction model with a continuous outcome, or with a binary or time-to-event outcome. As for any sample size calculation, the approach requires the user to specify anticipated values for key parameters. In particular, for a prediction model with a binary outcome, the outcome proportion and a conservative estimate for the overall fit of the developed model as measured by the Cox-Snell R2 (proportion of variance explained) must be specified. This proposal raises the question of how to identify a plausible value for R2 in advance of model development. Our articles suggest researchers should identify R2 from closely related models already published in their field. In this letter, we present details on how to derive R2 using the reported C statistic (AUROC) for such existing prediction models with a binary outcome. The C statistic is commonly reported, and so our approach allows researchers to obtain R2 for subsequent sample size calculations for new models. Stata and R code is provided, and a small simulation study.

Item Type: Article
Uncontrolled Keywords: C statistic (AUROC), R squared, clinical prediction model, sample size
Subjects: R Medicine > RC Internal medicine > RC0254 Neoplasms. Tumors. Oncology (including Cancer)
Related URLs:
Depositing User: Symplectic
Date Deposited: 22 Dec 2020 14:56
Last Modified: 22 Dec 2020 14:58
URI: https://eprints.keele.ac.uk/id/eprint/9038

Actions (login required)

View Item View Item