Option to run as sequence-to-one and sequence-to-sequence #13

jmframe · 2021-09-28T14:08:46Z

Short description explaining the high-level reason for the new issue.

Current behavior

The model state space persists throughout the simulation time.

Expected behavior

NeuralHydrology trains the LSTM to reset the state space and run a full sequence length of simulation at each time step.

Steps to replicate behavior (include URLs)

Either

Have an option to pass the full sequence to the LSTM at each time step, which means also re-setting the state space at each time step. This option would increase run time by a factor equal to the sequence length. For example, if the training sequence length is 336 timesteps, this would require 336 calls the the model for each time step of prediction.
Train the NeuralHydrology CustomLSTM and eliminate the state re-setting.

Screenshots

madMatchstick · 2022-08-16T22:29:17Z

@jmframe,

Are you able to provide more details on this one? Specially, are you suggesting to modify the forward pass via neuralhydrology's customlstm as defined here?

Thank you in advance, cheers!

jmframe · 2022-08-17T13:54:15Z

I think the two options are 1) change the way the forward pass is made, yes, but not through NH, do that through the BMI, and 2) change the way the model is trained with NH to match the way we have implemented it. I think option 1 is much more reasonable, for the short term. In the long term, the thing to do is develop a BMI directly in the NH code, so there is no potential for conflict between training and forward ngen predictions.

SnowHydrology · 2024-01-26T15:41:03Z

@jmframe I'm digging up old issues today. What are the advantages to resetting the state space each time step? It seems unlikely we'd want to multiply the current execution time by the sequence length unless some dramatic performance increases (i.e. better streamflow prediction) resulted. At some point, given time resources, we could retrain LSTM to eliminate the state resetting per your second recommendation.

jmframe · 2024-01-26T16:36:30Z

Well. The advantage of resetting the state space and passing in the full sequence is that is how the model is trained, and that is what the weights of the model are trained to respond to. But, I did quite a bit of experimenting, and didn't find that it was much different, results-wise. The issue of computation and time constraints is an import one. It would be good to evaluate the performance of the large runs to see what this means in terms of extra costs.

There is a funded CIROH project to do all this, which was supposed to start in 2022, but you know the story...

SnowHydrology · 2024-02-06T20:28:56Z

That's good to know @jmframe. I'll leave this issue open.

madMatchstick added the enhancement New feature or request label Aug 5, 2022

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Option to run as sequence-to-one and sequence-to-sequence #13

Option to run as sequence-to-one and sequence-to-sequence #13

jmframe commented Sep 28, 2021

madMatchstick commented Aug 16, 2022

jmframe commented Aug 17, 2022

SnowHydrology commented Jan 26, 2024

jmframe commented Jan 26, 2024

SnowHydrology commented Feb 6, 2024

Option to run as sequence-to-one and sequence-to-sequence #13

Option to run as sequence-to-one and sequence-to-sequence #13

Comments

jmframe commented Sep 28, 2021

Current behavior

Expected behavior

Steps to replicate behavior (include URLs)

Screenshots

madMatchstick commented Aug 16, 2022

jmframe commented Aug 17, 2022

SnowHydrology commented Jan 26, 2024

jmframe commented Jan 26, 2024

SnowHydrology commented Feb 6, 2024