You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
In our use case, we are working on integrating Argo retries with Metaflow’s retried Argo workflows. This environment variable would allow us to set a limit on how many times a user can retry an Argo workflow.
That said, beyond our specific use case, adding this configuration flexibility would be generally useful.
Current Behaviour
importpandasaspdfrommetaflowimport (
FlowSpec,
Parameter,
card,
project,
step,
retry
)
@project(name="dummy_project")classHelloWorld(FlowSpec):
force_error=Parameter("force-error", type=bool, default=False)
@card@stepdefstart(self):
print("something")
self.next(self.end)
@card@retry(times=10)@stepdefend(self):
ifself.force_error:
raiseException("Testing errors in metaflow")
print(f"the data artifact is: {self.my_var}")
if__name__=="__main__":
HelloWorld()
Running the above flow locally via python hello_world.py run throws the following exception
Metaflow 2.14.0 executing HelloWorld for user:j.kollipara
Project: dummy_project, Branch: user.j.kollipara
Validating your flow...
The graph looks good!
Running pylint...
Pylint is happy!
Flow failed:
The maximum number of retries is @retry(times=4).
error: Recipe `_poetry-run` failed with exit code 1
Currently, a user can attempt to run a specific task up to a maximum of 6 times. It would be beneficial to make this value configurable.
In our use case, we are working on integrating Argo retries with Metaflow’s retried Argo workflows. This environment variable would allow us to set a limit on how many times a user can retry an Argo workflow.
That said, beyond our specific use case, adding this configuration flexibility would be generally useful.
Current Behaviour
python hello_world.py run
throws the following exceptionSource code of the above error:
metaflow/metaflow/plugins/retry_decorator.py
Lines 30 to 37 in 5c960ea
Proposed Behaviour
Setting
METAFLOW_MAX_ATTEMPTS=12
would allow users to run the above flow.The text was updated successfully, but these errors were encountered: