Document explaining model structures and development plans #114

eschrom · 2025-02-01T01:27:06Z

There are many directions we could take the refining of existing models and/or introduction of new model structures. I have made a document to summarize where we are now, some of the issues observed, and ideas for model development.

swo

I'm only half-way through, but wanted to get some thoughts started!

docs/model_development.md

swo · 2025-02-04T14:30:26Z

docs/model_development.md

+There are two broad types of models under consideration: autoregressive (AR) and full-curve (FC) models.
+
+- AR: Incident uptake at time $t$, $u_t$, is a function previous incident uptake value(s), among other predictors.
+- FC: Cumulative uptake at time $t$, $c_t$, is a function of time, among other predictors.


I think the more typical phrasing is "parametric curve fitting," but that might be jargon I made up

Well I definitely made up the "full curve" jargon haha. As long as it is clear what we are talking about, and how it differs from AR, the specific jargon doesn't matter to me.

docs/model_development.md

Fuhan-Yang

Thanks Ed for coming up with model ideas! My comments are research questions, which should not gate the model development.

docs/model_development.md

Fuhan-Yang · 2025-02-03T19:16:16Z

docs/model_development.md

+## Forecast Uncertainty
+
+Forecasting with AR models naturally produces a cone of uncertainty that expands into the future. Each draw from the posterior distribution is a unique combination of parameter values that defines a trajectory of uptake going forward (still with some stochastic influence on observations, from $\sigma$). All these trajectories sprout from the last observed data point and diverge as they move into the future.
+


I'm still not clear about how to compound the forecast uncertainty in the AR model. When model is fitted, we get the posterior matrices for parameters [ $\alpha$, $\beta_x$, $v$]. We draw X rows (random indices) from the posterior matrices to calculate $f_{t+1}$, then we will get X $f_{t+1}$. We continue to forecast $f_{t+2}$ using the parameter estimates (X samples? or mean of the posterior) and X $f_{t+1}$. Then each $f_{t+1}$ will be used as predictor and another X samples from [ $\alpha$, $\beta_x$, $v$] . In this way, the number of samples will exponentially increase as we sequentially forecast. Not sure if the uncertainty will explode

See my note above about distinguishing the observed uptake data from the latent uptakes. The cone of uncertainty expands because of the combinations of uncertainties on $\hat{u}_0$, $\beta_i$, and $\epsilon_t$

Even before we distinguish true latent uptake from observed uptake, it is worth clarifying how uncertainty compounds into the future when forecasting with our current AR model. From each $[\alpha, \beta_x, \sigma]$ sample from the posterior distribution $[\alpha, \beta_x]$ defines a deterministic trajectory into the future. But at each timepoint along the way, there is still uncertainty thanks to $\sigma$. So at each timepoint, one value is sampled, to serve as the basis of projecting the next timepoint, and so on.

This diagram may help:

Fuhan-Yang · 2025-02-04T14:54:37Z

docs/model_development.md

+&A,~H,~n,~\sigma_A,~\sigma_H,~\sigma_n,~\sigma \sim \text{prior distributions} \\
+\end{align*}
+$$
+


How do we ensure that $c_t$ is bounded by [0,1] without bounding $A$, $H$, $n$, $\sigma_A$, $\sigma_H$, $\sigma_n$ and $\sigma$? The support of these hyperparameters is $R$ without any constraints

Good point! In principle, you are correct. In practice, I imagine that thin enough prior distributions for $A$ and $\sigma_A$ would prevent predicted values of $c_t$ from escaping their bounds.

swo · 2025-02-04T15:58:15Z

docs/model_development.md

+\begin{align*}
+&u_t \sim N(\mu, \sigma) \\
+&\mu = \alpha_s + \beta_{u,s}u_{t-1} + \beta_{t,s}t + \beta_{tu,s}tu_{t-1} \\
+&\alpha_s \sim N(\alpha,~\sigma_{\alpha}) \\


I think this is conceptually equivalent to deciding when $t=0$ is?

Like, either you say "t=0 is always Sep 1, and there might have been some amount of uptake by that point, and we fit for that amount" or you say "there is some time, around Sep 1, before which there was practically no uptake, and we fit for that date"

Personally, I like having the t=0 date be the thing we fit for, because then you can have a wider prior (e.g., for normal flu seasons) or a very tight prior (e.g., when we know the exact rollout date for a Covid vaccine)

Another reason to prefer t=0 being the variable is that you decouple the temporal thing (when does the season start) from overall uptake. A season with overall uptake probably has higher uptake at t=0 (i.e., $\alpha$), so you would get correlations there?

This is the same as #30 , I think

Yep! That is #30.

On closer inspection, your comments are about #30 here, but this section is not. This section is about allowing the model parameters to differ by season, to account for different seasons having slightly different curve shapes. But I think some discussion on how rollout is handled is important to add.

swo · 2025-02-04T16:01:42Z

docs/model_development.md

+## Forecast Uncertainty
+
+Forecasting with AR models naturally produces a cone of uncertainty that expands into the future. Each draw from the posterior distribution is a unique combination of parameter values that defines a trajectory of uptake going forward (still with some stochastic influence on observations, from $\sigma$). All these trajectories sprout from the last observed data point and diverge as they move into the future.
+


See my note above about distinguishing the observed uptake data from the latent uptakes. The cone of uncertainty expands because of the combinations of uncertainties on $\hat{u}_0$, $\beta_i$, and $\epsilon_t$

swo · 2025-02-04T16:23:30Z

docs/model_development.md

+
+And again, factors other than season, such as geographic area or demographic group, could be used to group the data.
+
+## Forecasting Uncertainty


I think parametric curve fitting should still work... you get the greatest uncertainty on $\hat{u}$ in places where you don't have data, ie in the future. It's like any regression, that error bars are largest as you project further away from the data

I disagree. Think of the Hill model parameters: A = maximum uptake, H = half-maximal time, and n = steepness. Uncertainty at the end of the season depends mostly on uncertainty in A. Uncertainty at the middle of the season depends on uncertainty in A, H, and n.

Moreover, if we are fitting on 15 past seasons of data and the first half of this season, that means we have 16 data points per timepoint in early and midseason, and 15 data points per timepoint at the end of season. That's nearly the same amount of data to fit with, no matter where you are along the curve.

This is an old figure, but... The red curve here is a Hill model, fit to the flu data before ~Oct 2024, and then projecting ~Nov 2024 onward. (There are no hyperparameters for season here.) Note that the 95% credible interval starts somewhat wide, even right where the observations leave off, and it does not expand much into the future.

Good point: for a linear regression, the cone of uncertainty continues to grow as you move away from where you have data, but for vaccine uptake, which we know will be a sigmoid that plateaus at some value, we really only have some finite uncertainty about that final uptake

So maybe this actually is desirable behavior? The posterior uncertainty in A is really the thing that matters. And we actually do have, as you point out, for flu, a lot of information about what that's going to be.

Maybe another way I would read this is: the LIUM has more uncertainty about H (half-way point). Like, the LIUM cone (and even point estimates) suggest that uptake is still rising at day 365, while CHM is confident it will have turned over by day 200. So the "cone of uncertainty" isn't so much about A but about H?

I'm not sure if the shaded areas are uncertainty on the latent (i.e., predicted "true" uptake) or the predictions (i.e., including measurement error). Maybe what this is saying is that measurement error is large compared to uncertainty in the latent curve?

I do not see this as desirable behavior. Even if the Hill model matched the data perfectly at the junction between data and forecast (i.e. at the forecast date), the wide uncertainty there says that cumulative uptake could plausibly go down ~5% in the week after the forecast date. That's clearly not right.

Remember, there is no distinction between latent true uptake and noisy observed uptake in these models, yet. And I believe that might be the root of the problem.

Because the autoregressive model only estimates the relationship between pairs of successive weeks, when forecasting, it trusts the last data point before the forecast date absolutely. Uncertainty compounds into the future from there.

Because the Hill model estimates the shape of a full curve, when forecasting, it does not trust the last data point before the forecast date any more than it trusts any other historical data point. Uncertainty captures variations across the historical curves, even on the forecast date itself.

I am soon to submit another PR which I hope will take a step toward illustrating this a bit better.

eschrom · 2025-02-05T22:40:17Z

I think this doc has already done its job, which is to stimulate thoughtful discussion about the next steps in model development. In particular, I think the autoregressive model that @swo outlined, separating latent true uptake from noisy observed uptake, should be the next step.

So I plan to merge this doc and to edit it periodically in the future, whenever we need a catalyst for more model development brainstorming.

There are many directions we could take the refining of existing models and/or introduction of new model structures. I have made a document to summarize where we are now, some of the issues observed, and ideas for model development.

eschrom requested review from swo and Fuhan-Yang February 1, 2025 01:27

First draft of model development doc

6fccaeb

eschrom force-pushed the ecs_modeldoc branch from 4a300ca to 6fccaeb Compare February 3, 2025 21:13

swo reviewed Feb 4, 2025

View reviewed changes

swo self-requested a review February 4, 2025 14:55

Fuhan-Yang approved these changes Feb 4, 2025

View reviewed changes

swo reviewed Feb 4, 2025

View reviewed changes

eschrom added 6 commits February 4, 2025 18:09

Incorporated some feedback on latent truth and rollout

83d0242

Added subscripts to mu

2016444

Attempted some minor latex fixes

8f5973c

Latex fixes pt. 2

05f04a5

Latex fixes pt. 3

bd735b1

Fixed math sum

74b950c

eschrom merged commit a45956e into main Feb 5, 2025
2 checks passed

eschrom deleted the ecs_modeldoc branch February 5, 2025 22:40

eschrom mentioned this pull request Feb 10, 2025

Plot projections and eval scores #115

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Document explaining model structures and development plans #114

Document explaining model structures and development plans #114

eschrom commented Feb 1, 2025

swo left a comment

swo Feb 4, 2025

eschrom Feb 4, 2025

Fuhan-Yang left a comment

Fuhan-Yang Feb 3, 2025

swo Feb 4, 2025

eschrom Feb 4, 2025

Fuhan-Yang Feb 4, 2025

eschrom Feb 4, 2025 •

edited

Loading

swo Feb 4, 2025

swo Feb 4, 2025

swo Feb 4, 2025

eschrom Feb 4, 2025

eschrom Feb 4, 2025 •

edited

Loading

swo Feb 4, 2025

swo Feb 4, 2025

eschrom Feb 4, 2025

swo Feb 5, 2025

eschrom Feb 5, 2025 •

edited

Loading

eschrom commented Feb 5, 2025

		## Forecast Uncertainty

		Forecasting with AR models naturally produces a cone of uncertainty that expands into the future. Each draw from the posterior distribution is a unique combination of parameter values that defines a trajectory of uptake going forward (still with some stochastic influence on observations, from $\sigma$). All these trajectories sprout from the last observed data point and diverge as they move into the future.


		And again, factors other than season, such as geographic area or demographic group, could be used to group the data.

		## Forecasting Uncertainty

Document explaining model structures and development plans #114

Document explaining model structures and development plans #114

Conversation

eschrom commented Feb 1, 2025

swo left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Fuhan-Yang left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

eschrom Feb 4, 2025 • edited Loading

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

eschrom Feb 4, 2025 • edited Loading

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

eschrom Feb 5, 2025 • edited Loading

Choose a reason for hiding this comment

eschrom commented Feb 5, 2025

eschrom Feb 4, 2025 •

edited

Loading

eschrom Feb 4, 2025 •

edited

Loading

eschrom Feb 5, 2025 •

edited

Loading