Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[NCO Bug] Add restart capability for GFS wave post processing jobs #1251

Open
KateFriedman-NOAA opened this issue Jan 18, 2023 · 9 comments
Open
Assignees
Labels
nco-bug Something isn't working in Ops.

Comments

@KateFriedman-NOAA
Copy link
Member

KateFriedman-NOAA commented Jan 18, 2023

Bugzilla #1368

Details from NCO in bugzilla:

Please add the restart capability for GFS wave postprocessing jobs, 

-  gfs_wave_postsbs
-  gfs_wave_post_bndpnt
-  gfs_wave_post_bndpntbll

referred to the "NCEP Central Operations WCOSS Implementation Standards, version 11.0.0", page 13, 
https://www.nco.ncep.noaa.gov/idsb/implementation_standards/ImplementationStandards.v11.0.0.pdf?

During the catchup process after system issue, gfs atmos/wave post processing jobs could fail on waiting upstream 
files. Each time the gfs atmos/wave post processing jobs will processed from the very beginning. 

For the restart capability, suggest to add checking if the product is produced then skip logic in the jobs to avoid 
multiple alerts in rerun. Also it will be best use of the system resource and delivery product efficiently.
@KateFriedman-NOAA KateFriedman-NOAA added the nco-bug Something isn't working in Ops. label Jan 18, 2023
@KateFriedman-NOAA KateFriedman-NOAA added this to the GFSv17 milestone Jan 18, 2023
@HuiyaChuang-NOAA
Copy link

@KateFriedman-NOAA @JessicaMeixner-NOAA I believe Jessica has agreed to the the POC for this

@JessicaMeixner-NOAA
Copy link
Contributor

I can be the POC for this. At some point we'll need to find someone who can help with the non-wave job: gfs_atmos_postsnd as I am not the right person to do that technical work but can certainly serve as a POC.

@HuiyaChuang-NOAA
Copy link

HuiyaChuang-NOAA commented Jan 19, 2023 via email

@KateFriedman-NOAA
Copy link
Member Author

The other bugzilla assigned to Bo (#1245) is unrelated to adding restart capability to the bufr sounding job.

We could assign both @BoCui-NOAA and @JessicaMeixner-NOAA to this issue or I could split the bufr sounding aspect into it's own global-workflow issue. Thoughts from folks?

Either way, I'll need to go through @ShelleyMelchior-NOAA before I can assign @BoCui-NOAA to this or a new one split from this. :)

@WalterKolczynski-NOAA
Copy link
Contributor

Soundings should be a separate issue

@KateFriedman-NOAA KateFriedman-NOAA changed the title [NCO Bug] Add restart capability for GFS atmos/wave post processing jobs [NCO Bug] Add restart capability for GFS wave post processing jobs Jan 19, 2023
@KateFriedman-NOAA
Copy link
Member Author

Alrighty, I split the bufr sounding aspect of this bugzilla into a new issue #1257 .

@WalterKolczynski-NOAA
Copy link
Contributor

#2290 will render this one moot

@WalterKolczynski-NOAA
Copy link
Contributor

Misspoke, #2290 will only update the gridded job. The point jobs will need similar updates.

@JessicaMeixner-NOAA
Copy link
Contributor

There's a plan in place to completely reingineer the point post job. We will ensure in the design it's either under 15 minutes or has a restart capability.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
nco-bug Something isn't working in Ops.
Projects
None yet
Development

No branches or pull requests

4 participants