Skip to content

Commit

Permalink
Update docs to reference sbatch location (#385)
Browse files Browse the repository at this point in the history
Signed-off-by: Igor Gitman <[email protected]>
Co-authored-by: Shubham Toshniwal <[email protected]>
  • Loading branch information
Kipok and shtoshni authored Feb 13, 2025
1 parent ca597f6 commit 26e8d3b
Show file tree
Hide file tree
Showing 3 changed files with 8 additions and 1 deletion.
2 changes: 1 addition & 1 deletion cluster_configs/example-slurm.yaml
Original file line number Diff line number Diff line change
Expand Up @@ -29,7 +29,7 @@ job_name_prefix: "nemo_skills:"
# ssh_tunnel:
# host: <slurm host>
# user: <username>
# job_dir: <some location on slurm cluster to keep job metadata>
# job_dir: <some location on slurm cluster to keep job metadata, uploaded code and generated sbatch files>
# identity: <can specify ssh key to avoid entering password>


Expand Down
6 changes: 6 additions & 0 deletions docs/basics/prerequisites.md
Original file line number Diff line number Diff line change
@@ -1,5 +1,7 @@
# Prerequisites and Getting Started

<!-- TODO: this grew too long, we should split it into multiple pages -->

## Installation

To get started first install the repo (python 3.10+). Either clone and run `pip install -e .` or install directly with
Expand Down Expand Up @@ -107,6 +109,10 @@ periodically remove local and cluster nemo-run folders to free up space. There i
be automatically recreated) as long as you don't have any running jobs when you remove the folders.
If you want to have more fine-grained control over code reuse, you can directly specify `--reuse_code_exp` argument when submitting jobs

## Inspecting generated sbatch file
While our job submission is somewhat complicated and goes through NeMo-Run, at the end, we simply execute a particular sbatch file
that is uploaded to the cluster. It is helpful sometimes to see what's in it and modify directly. You can find sbatch file(s)
for each job inside `ssh_tunnel.job_dir` cluster folder that is defined in your cluster config.

## Running pipelines

Expand Down
1 change: 1 addition & 0 deletions docs/pipelines/decontamination.md
Original file line number Diff line number Diff line change
Expand Up @@ -45,6 +45,7 @@ cmd = (
run_cmd(
cluster="local",
container="nemo",
num_gpus=1, # can increase this if you have more gpus
ctx=wrap_arguments(cmd),
)
```
Expand Down

0 comments on commit 26e8d3b

Please sign in to comment.