Loading...

JOURNAL OF EMERGING AND RARE DISEASES (ISSN:2517-7397)

Estimating the Growth Rate and Doubling Time for Short-Term Prediction and Monitoring Trend During the COVID-19 Pandemic with a SAS Macro

Stanley Xu1,2 , Christina Clarke1, Susan Shetterly1, Komal Narwaney1

1 The Institute for Health Research, Kaiser Permanente Colorado, Kaiser Permanente Colorado, 2550 S. Parker Road, Aurora, CO, United States
2 School of Public Health, University of Colorado, 13001 E 17th Pl, Aurora, United States

CitationCitation COPIED

Xu S, Clarke C, Shetterly S, Narwaney K. Estimating the Growth Rate and Doubling Time for Short-Term Prediction and Monitoring Trend During the COVID-19 Pandemic with a SAS Macro. J Emerg Rare Dis. 2020 May;3(1):121.

© 2020 Xu S, et al. This is an open-access article distributed under the terms of the Creative Commons Attribution 4.0 international License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.

Abstract

Coronavirus disease (COVID-19) has spread around the world causing tremendous stress to the US health care system. Knowing the trend of the COVID-19 pandemic is critical for the federal and local governments and health care system to prepare plans. Our aim was to develop an approach and create a SAS macro to estimate the growth rate and doubling time in days if growth rate is positive or half time in days if growth rate is negative. We fit a series of growth curves using a rolling approach. This approach was applied to the hospitalization data of Colorado State during March 13th and April 13th. The growth rate was 0.18 (95% CI=(0.11, 0.24)) and the doubling time was 5 days (95% CI= (4, 7)) for the period of March 13th-March 19th; the growth rate reached to the minimum -0.19 (95% CI= (-0.29, -0.10)) and the half time was 4 days (95% CI= (2, 6)) for the period of April 2nd – April 8th. This approach can be used for regional short-term prediction and monitoring the regional trend of the COVID-19 pandemic.

Background

In December 2019, an outbreak of coronavirus disease (COVID-19) caused by the novel coronavirus (SARS-CoV-2) began in Wuhan, China and has now spread across the world [1,2]. In the United States, the cumulative number of identified COVID-19 cases was 186,101 as of March 31st, 2020; among the identified cases, 3603 died [3]. To slow the spread of COVID-19, federal and local governments have issued mitigation measures such as case isolation, quarantine, school closures and closing non-essential businesses. The COVID-19 pandemic imposes tremendous challenges to the US health care system, particularly given concerns that the need for hospital beds and ICU beds could exceed capacity [4-6]. Predicting the future numbers of COVID-19 cases and healthcare utilization is critical for governments and health care systems preparation plans [4,6,7]. Two useful and critical quantities for prediction are the growth rate [8] and the doubling time of number of events [9]. The growth rate is the percent change of daily events (e.g, COVID-19 cases, number of patients hospitalized or number of deaths). The doubling time is the length of time required to double the number of daily events.

Our goal was to develop an approach and create a SAS macro using observed data to estimate the growth rate and doubling time in days for short-term prediction.

Methods

A rolling growth curve approach (RGCA)

In the United States, there are several barriers for testing people for COVID-19 such as shortages of swabs and testing kits and restrictions on who should get tested. Therefore, the number of COVID-19 cases is often under-identified and under-reported. However, the number of hospitalized COVID-19 patients (hospitalizations) and number of deaths due to COVID-19 are more reliable than the reported number of COVID-19 cases [10]. In this paper, we used the number of daily hospitalized COVID-19 patients to calculate the growth rate and doubling time in days.

We assumed a growth curve of daily hospitalizations over a period of n days from day t (start day) to day (t n + −1). Let y (t + j -1) denote the daily hospitalizations at day ( t + j - 1) 1≤ ≤j n . Based on the growth model, we have

where t y is the number of hospitalizations at the start day t ; r is the growth rate. When the growth rate r > 0 , the number of daily hospitalizations increases. For example, if r =0.4, the growth rate of hospitalizations is 40% more for each day. When growth rate r = 0 , the number of daily hospitalizations has no change. When growth rate r < 0 , the number of daily hospitalizations declines. When the number of hospitalizations doubles at j D= ,  that is y ( t + D - 1 ) = 2 y we have 2 y t = y t ( 1 + r )D-1 Further, it can be shown that


We fit two models: a) using equation (1) which estimates the growth rate r ; b) using equation (1) with r substituted with

from equation (2). The second model estimates the doubling time in days D , meaning that it takes D days from the start day t for the number of daily hospitalizations to double. We used SAS PROC NLIN [11] to fit these two nonlinear models. Note that equation


estimated D represents the days required to reduce the number of hospitalizations by half (half time).

Because the growth rate and doubling time may change over time, we used a rolling growth curve approach (RGCA). For example, we set the length of the period to be 7 days ( n 7 = days). We estimated the growth rate and the doubling time in days for the following periods for hospitalization data from Colorado State from March 13th – April 13th [12]: March 13th-19th,14th-20th, 15th-21st,…, April 7th-April 13th.

Short-term prediction

The estimated growth rate from the last period of the RGCA approach (e.g., April 7th-April 13th) can be used for future short-term prediction of hospitalizations. Let denote the last day of the last period, k y is the number of hospitalizations on this day. For the Colorado hospitalization data in this analysis, k is April 13th, 36 k y = .

Let m denote the date after date k , then the predicted my is  


where  rk is the estimated growth rate from the last period. As the growth rate changes over time, the prediction is only appropriate for short-term prediction (e.g., within 7 days) and updated growth rates should be used.

Results

We estimated a series of growth rates using RGCA with a length of 7 days. The estimated growth rates and 95% CIs were plotted over time using the mid-day of a 7 day period (Figure 1). The growth rate peaked with a value of 56.2% at the mid-day of March 18th for the period March 15th and March 21st. Between March 18th and April 1st, although the growth rate continuously decreased, the daily number of hospitalizations increased because of positive growth rates. We started to observe negative growth rates after April 1st, except for a positive growth rate on April 9th. The growth rate reached its minimum at the mid-day of April 5th (period April 2nd-April 8th) with a value of -19.2%. The growth rate then increased after April 5th. Note that a negative growth rate represents a reduction in number of hospitalizations.

The doubling time (growth rate>0) and half time (growth rate<0) in days over time are displayed in Figure 2. Before April 1st, the y-axis represents the doubling time in days because of positive growth rates. After April 1st, except for a positive growth rate on April 9th, the y-axis represents a half-time because of negative growth rates. On April 1st, the reduction rate was very small (0.5%) which resulted in a high half-time, 128 days with very wide 95% CIs (not shown in the figure). On April 8th, there was a small reduction rate (1.2%) resulting in 55 days of half time.

Using the estimated growth rate from the last period April 7thApril 13th,  r k = 0.027, the predicted numbers of daily hospitalizations for April 14th and 15th were 35 and 34, respectively. SAS programs are available for conducting these analyses (Appendix A and Appendix B).


Figure 1: Estimated growth rate with 95% CIs over time using hospitalization data from Colorado State

Figure 2: Estimated doubling (or half) time in days with 95% CIs over time using hospitalization data from Colorado State

Discussion

These models can be similarly applied to death data if they are available and not sparse. When COVID-19 testing is widely available to the public and the number of COVID-19 testing is less selective, these models can also be used to directly estimate the growth rate and the doubling time for COVID-19 cases. Due to a lag in reporting hospitalization, it is recommended to exclude the recent 1-2 days’ hospitalization data in fitting the growth curves. This paper illustrates that hospitalization data can be used to estimate the growth rate and doubling (or half) time to aid predicting future hospitalizations, deaths and COVID-19 cases. Because a series of growth curves were fit, the RGCA approach can also be used for real-time monitoring of the epidemic trend as shown in Figure 1.

Colorado state issued three social distancing guidelines: a voluntary social distancing on March 17th, closing educational facilities on March 23rd, and closing non-essential services on March 26th (Figure 1). It takes some time (e.g., 2 weeks) for these mitigation measures to have impact. Although the effectiveness of these mitigation measures has not been investigated formally, it is believed that they helped to slow the spread of COVID-19 and reduced the number of hospitalizations and death in Colorado and across the United States. 

Acknowledgements

This research was supported by the Institute for Health Research, Kaiser Permanente Colorado. Xu was also supported by NIH/NCRR Colorado CTSI Grant Number UL1 RR025780.

APPENDIX A


APPENDIX B