-
Notifications
You must be signed in to change notification settings - Fork 24
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
desi_pipe sync doesn't sync redrock output #563
Comments
Also: oddly none of the tasks are in the "running" state, even though many of them did run and finish and others were mid-stream with running when the job got killed. The command I ran was
(note the 15:00 minutes, which I should have made longer) |
Redshift and spectra tasks are “special”, but I agree that their state should move from ready to running. On the other hand I am not sure they are ever “done”. There is a separate DB table that tracks which pixels are touched for a given exposure, and that is how we can schedule redshift tasks. Thanks for opening this to track the question. |
Agreed about the ambiguity of "done" since we don't know if a future observation will contribute more spectra to that same healpix. At the same time, if a job times out and leaves some pixels unfinished, we don't want to have to start over again from the beginning. Suggestion: after redrock finishes a pixel containing all spectra to date, mark it as "done". When Related on the redrock side: if the output zbest file already exists, only refit targets that have new data in the input (desihub/redrock#105). That will avoid wasting CPU time if the pipeline does try to rerun a pixel that doesn't need to be updated. |
This is related to #558, and I'll fix them both in the same PR. |
My redrock job timed out leaving me partially done on the redshift tasks.
desi_pipe sync
doesn't recognize that and still leaves all of my tasks in the "ready" state instead of moving those with file output into the "done" state.Optimistically assigning this to 18.3 'cause it would be really really nice to have this working for this major release.
The text was updated successfully, but these errors were encountered: