Add timeout parameter as optional for isvc creation and it corresponding change #135

tarukumar · 2025-02-10T07:50:16Z

I observed that when testing a larger model, which takes more time to download, the default timeout parameter isn't sufficient, causing the model deployment to fail. As a result, I modified the timeout parameter to be an option in the inference function, allowing for easier control.

Signed-off-by: Tarun Kumar <[email protected]>

github-actions · 2025-02-10T07:50:32Z

The following are automatically added/executed:

PR size label.
Run pre-commit
Run tox

Available user actions:

To mark a PR as WIP, add /wip in a comment. To remove it from the PR comment /wip cancel to the PR.
To block merging of a PR, add /hold in a comment. To un-block merging of PR comment /hold cancel.
To mark a PR as approved, add /lgtm in a comment. To remove, add /lgtm cancel.
lgtm label removed on each new commit push.
To mark PR as verified comment /verified to the PR, to un-verify comment /verified cancel to the PR.
verified label removed on each new commit push.

Supported labels

{'/wip', '/lgtm', '/hold', '/verified'}

for more information, see https://pre-commit.ci

tarukumar · 2025-02-10T10:02:30Z

/verified

rnetser · 2025-02-10T14:17:20Z

tests/model_serving/model_server/utils.py

@@ -34,22 +33,25 @@
 LOGGER = get_logger(name=__name__)


-def verify_no_failed_pods(client: DynamicClient, isvc: InferenceService, runtime_name: str | None) -> None:
+def verify_no_failed_pods(
+    client: DynamicClient, isvc: InferenceService, runtime_name: str | None, timeout: int = 5 * 60


please use TIMEOUT_5MIN from constants

rnetser · 2025-02-10T14:19:25Z

utilities/infra.py

@@ -88,6 +88,7 @@ def wait_for_inference_deployment_replicas(
    isvc: InferenceService,
    runtime_name: str | None,
    expected_num_deployments: int = 1,
+    timeout: int = 4 * 60,


please save to constants and re-use (also in create_ns)

adolfo-ab · 2025-02-10T15:16:49Z

tests/model_serving/model_server/utils.py

@@ -34,22 +33,25 @@
 LOGGER = get_logger(name=__name__)


-def verify_no_failed_pods(client: DynamicClient, isvc: InferenceService, runtime_name: str | None) -> None:
+def verify_no_failed_pods(
+    client: DynamicClient, isvc: InferenceService, runtime_name: str | None, timeout: int = 5 * 60


adolfo-ab · 2025-02-10T15:18:32Z

utilities/infra.py

@@ -88,6 +88,7 @@ def wait_for_inference_deployment_replicas(
    isvc: InferenceService,
    runtime_name: str | None,
    expected_num_deployments: int = 1,
+    timeout: int = 4 * 60,


adolfo-ab · 2025-02-10T15:21:38Z

tests/model_serving/model_server/utils.py

@@ -125,6 +127,7 @@ def create_isvc(
    wait_for_predictor_pods: bool = True,
    autoscaler_mode: Optional[str] = None,
    multi_node_worker_spec: Optional[dict[str, int]] = None,
+    timeout: int = 15 * 60,


use constant

Signed-off-by: Tarun Kumar <[email protected]>

tarukumar · 2025-02-10T16:18:28Z

closing this the branch is messed up

Add timeout option

5dfbc90

Signed-off-by: Tarun Kumar <[email protected]>

github-actions bot added the size/s label Feb 10, 2025

tarukumar and others added 2 commits February 10, 2025 14:20

Merge branch 'main' into timeout

fa4ecf1

[pre-commit.ci] auto fixes from pre-commit.com hooks

97406a1

for more information, see https://pre-commit.ci

github-actions bot added the Verified Verified pr in Jenkins label Feb 10, 2025

rnetser requested changes Feb 10, 2025

View reviewed changes

adolfo-ab previously approved these changes Feb 10, 2025

View reviewed changes

utilities/infra.py

4a6b885

Signed-off-by: Tarun Kumar <[email protected]>

tarukumar dismissed adolfo-ab’s stale review via 4a6b885 February 10, 2025 16:12

github-actions bot added size/l and removed size/s Verified Verified pr in Jenkins labels Feb 10, 2025

tarukumar closed this Feb 10, 2025

github-actions bot added the commented-by-tarukumar label Feb 10, 2025

tarukumar mentioned this pull request Feb 10, 2025

Add timeout parameter as optional for isvc creation and it corresponding change #136

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add timeout parameter as optional for isvc creation and it corresponding change #135

Add timeout parameter as optional for isvc creation and it corresponding change #135

tarukumar commented Feb 10, 2025

github-actions bot commented Feb 10, 2025

tarukumar commented Feb 10, 2025

rnetser Feb 10, 2025

adolfo-ab Feb 10, 2025

rnetser Feb 10, 2025

adolfo-ab Feb 10, 2025

adolfo-ab Feb 10, 2025

adolfo-ab Feb 10, 2025

adolfo-ab Feb 10, 2025

tarukumar commented Feb 10, 2025

Add timeout parameter as optional for isvc creation and it corresponding change #135

Add timeout parameter as optional for isvc creation and it corresponding change #135

Conversation

tarukumar commented Feb 10, 2025

github-actions bot commented Feb 10, 2025

tarukumar commented Feb 10, 2025

rnetser Feb 10, 2025

Choose a reason for hiding this comment

adolfo-ab Feb 10, 2025

Choose a reason for hiding this comment

rnetser Feb 10, 2025

Choose a reason for hiding this comment

adolfo-ab Feb 10, 2025

Choose a reason for hiding this comment

adolfo-ab Feb 10, 2025

Choose a reason for hiding this comment

adolfo-ab Feb 10, 2025

Choose a reason for hiding this comment

adolfo-ab Feb 10, 2025

Choose a reason for hiding this comment

tarukumar commented Feb 10, 2025