Systematic review of statistical methods for analyzing healthcare cost in administrative data

Aún no traducido Aún no traducido
Autores
Categoría Revisión sistemática
Pre-printResearchSquare
Año 2019
Cargando información sobre las referencias
Abstract BACKGROUND Healthcare costs are increasing at alarming rates in the United States (US) putting a heavy burden on the healthcare reimbursement system. Cost and cost savings have become an important focus as health policy administrators are tasked with determining the most effective allocation of limited resources. The availability of large databases, such as administrative data, comes with many challenges for analyses, including: skewed data, inflated zero counts, and potential selection bias among comparison groups. Thus, it is imperative that they are evaluated correctly. There are many different methods currently being used to estimate costs including: generalized linear models with a log link, natural logarithm transformed costs, gamma distribution, median regression, two-part models, and Bayesian models. This systematic review will identify which methods are statistically and mathematically appropriate for large claims data. METHODS Scopus and Ovid were searched for potential statistical method papers using multivariable modelling of cost that were published up to the end of February 2018. Inclusion criteria required either a comparison of two or more statistical methods to analyze cost or one statistical method performed on two or more different types of cost data. This systematic review follows the guidelines according to Preferred Reposting Items for Systematic Reviews and Meta-Analysis (PRISMA). RESULTS The review identified 1,048 potential papers, of which, 80 met the inclusion criteria for a full article review. There was a total of 9 papers included in the systematic review; one paper included simulations and eight papers assessed real cost data. There were 28 models assessed across the nine papers with ordinary least squares (OLS) and generalized linear models (GLM) being the most common. CONCLUSIONS GLM using the gamma distribution was included in all but two of the comparisons. Most other models that were compared to the GLM Gamma distribution with log link found it to be the superior model in both simulated data and real administrative data.
Epistemonikos ID: 2cbe42c7d037a54c7c74da4bd7ab8a9a83f3a138
First added on: Jan 26, 2021