Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Add timeout parameter as optional for isvc creation and it corresponding change #135

Closed
wants to merge 4 commits into from

Conversation

tarukumar
Copy link
Contributor

I observed that when testing a larger model, which takes more time to download, the default timeout parameter isn't sufficient, causing the model deployment to fail. As a result, I modified the timeout parameter to be an option in the inference function, allowing for easier control.

Signed-off-by: Tarun Kumar <[email protected]>
Copy link

The following are automatically added/executed:

Available user actions:

  • To mark a PR as WIP, add /wip in a comment. To remove it from the PR comment /wip cancel to the PR.
  • To block merging of a PR, add /hold in a comment. To un-block merging of PR comment /hold cancel.
  • To mark a PR as approved, add /lgtm in a comment. To remove, add /lgtm cancel.
    lgtm label removed on each new commit push.
  • To mark PR as verified comment /verified to the PR, to un-verify comment /verified cancel to the PR.
    verified label removed on each new commit push.
Supported labels

{'/wip', '/lgtm', '/hold', '/verified'}

@tarukumar
Copy link
Contributor Author

/verified

@github-actions github-actions bot added the Verified Verified pr in Jenkins label Feb 10, 2025
@@ -34,22 +33,25 @@
LOGGER = get_logger(name=__name__)


def verify_no_failed_pods(client: DynamicClient, isvc: InferenceService, runtime_name: str | None) -> None:
def verify_no_failed_pods(
client: DynamicClient, isvc: InferenceService, runtime_name: str | None, timeout: int = 5 * 60
Copy link
Collaborator

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

please use TIMEOUT_5MIN from constants

Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

+1

@@ -88,6 +88,7 @@ def wait_for_inference_deployment_replicas(
isvc: InferenceService,
runtime_name: str | None,
expected_num_deployments: int = 1,
timeout: int = 4 * 60,
Copy link
Collaborator

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

please save to constants and re-use (also in create_ns)

Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

+1

adolfo-ab
adolfo-ab previously approved these changes Feb 10, 2025
@@ -34,22 +33,25 @@
LOGGER = get_logger(name=__name__)


def verify_no_failed_pods(client: DynamicClient, isvc: InferenceService, runtime_name: str | None) -> None:
def verify_no_failed_pods(
client: DynamicClient, isvc: InferenceService, runtime_name: str | None, timeout: int = 5 * 60
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

+1

@@ -88,6 +88,7 @@ def wait_for_inference_deployment_replicas(
isvc: InferenceService,
runtime_name: str | None,
expected_num_deployments: int = 1,
timeout: int = 4 * 60,
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

+1

@@ -125,6 +127,7 @@ def create_isvc(
wait_for_predictor_pods: bool = True,
autoscaler_mode: Optional[str] = None,
multi_node_worker_spec: Optional[dict[str, int]] = None,
timeout: int = 15 * 60,
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

use constant

Signed-off-by: Tarun Kumar <[email protected]>
@github-actions github-actions bot added size/l and removed size/s Verified Verified pr in Jenkins labels Feb 10, 2025
@tarukumar
Copy link
Contributor Author

closing this the branch is messed up

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
Development

Successfully merging this pull request may close these issues.

3 participants