Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Determine number of workers and blocks to set for parsl config in GKE #43

Open
julietcohen opened this issue Jul 29, 2024 · 0 comments
Open

Comments

@julietcohen
Copy link
Collaborator

Within the Kubernetes workflow, regardless if deploying on the ADC cluster or on GKE, the parsl config contains parameters for:

  • max_blocks = 4,
  • min_blocks = 1,
  • init_blocks = 1,
  • max_workers = 4,
  • cores_per_worker = 1,

These parameters determine how the work is distributed across workers. Before re-building and pushing the image, the user should adjust these as needed depending on the size of the input dataset. Parsl's documentation for blocks is here.

@shishichen noted in the GKE documentation: "During the script execution, parsl will bring up additional worker pods as needed."

I confirmed this during my first small run of the GKE workflow. I set the parameters to the values above, and initially there were 4 connected parsl workers, then that jumped to 8. This is not an explicit problem, but ideally we would control this number closely with the config. We should add more documentation about when to use blocks versus workers.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

1 participant