-
Notifications
You must be signed in to change notification settings - Fork 154
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
run_dev.sh failing #163
Comments
Thanks for raising this. We had seen this on pre-release builds of Jetpack 6..0 and VPI 3.2.1 which required running the pre-requisite steps listed here. These steps should not have been necessary with a Jetpack 6.1 machine, however, and we had not seen it since. Could you run these steps on your Jetson and let us know if that resolves it for you? If so, we'll update the instructions and the troubleshooting while we try to reproduce the issue on our end here too. |
Thanks for your feedback. I was not successful in running the steps in the provided link. Here is what I have done so far: Setting up the Jetson from scratchSince my initial post, I reinstalled Jetpack. I did this because I was afraid that some other installations I had done on the system could somehow be interfering with building the image. I have done the following:
These are the only steps I have done in terms of setting up the system which are not directly related to Isaac Isaac setupI set up Isaac according to this: https://nvidia-isaac-ros.github.io/getting_started/dev_env_setup.html . I skipped parts of Step 1, related to how to move docker over to the SSD, since my installation already is on the SSD. However, after my first failure of building ros2-isaac-common on the new JetPack installation I have since added I followed step 2 - 4 and sat up the workspace under isaac_ros_commonI cloned isaac_ros_common repo into This yielded the following error:
Running pre-requisite steps:nvidia-smi returns the following:
According to this it seems like the CUDA driver is recognized, considering that CUDA Version says 12.6. I found it worrisome that no stats are available for the GPU. However, according to some googling, it seems like this is to be expected on Jetson devices? Checking nvidia-container yields:
To me, this looks good. However, running nvidia-ctk gives:
Since this did not work I did check whether a yaml file already existed for cdi and found the following:
I therefore proceeded with the next step with the hope that previous yaml file would be sufficient, however, I was met with another error:
ConclusionI was not able to fix the issue using the steps from your previous comment. Is there something else that I'm missing? Thank you for your assistance |
I was able to start the container after commenting out the below line in run_dev.sh (line 243) as a temporary workaround.
|
+1, I'm experiencing exactly same steps and same issue w/ @Flipsack Setup: |
I was able to build the image when commenting out the line that @mickey13 was suggesting |
Able to replicate.
Able to fix by removing CDI by following @mickey13 |
@Flipsack I followed your steps as #163 (comment) and encoutered the same errors.
This issue was reported and solved here: https://forums.developer.nvidia.com/t/podman-gpu-on-jetson-agx-orin/297734/10?u=development7. The fix is to force csv format: sudo nvidia-ctk cdi generate --output=/etc/cdi/nvidia.yaml --mode=csv Is still got the same error: $ sudo nvidia-ctk runtime configure --runtime=docker --cdi.enabled=true
Incorrect Usage: flag provided but not defined: -cdi.enabled on the next step but this time |
This may be because of an older version of NVIDIA Container Toolkit (see here on how to update to at least 1.16). It is possible the JetPack upgrade from 6.0 to 6.1 did not update the NCT for you I suppose. That should resolve this without any workarounds and keep PVA accessible within the dev container as intended. Alternatively, I had run into the same issues listed here on pre-release JP6.0 and was able to get things mostly working on NVIDIA Container Toolkit 1.14 (could not upgrade to 1.16 because the update list was too extensive) using the
|
Same issue here on Jetson Orin Nano Devkit. @hemalshahNV I was unable to install anything newer than NCT 14.2 on my Jetson following your link, even after configuring experimental packages. @beniaminopozzan After following your fix |
I'm also unable to update NCT above 14.2 on a Jetson Orin NX flashed with JP6.1(rev1) |
I'm trying to use isaac_ros_common
Upon executing ./run_dev.sh I get the follwing error:
Here is the full, verbose, output:
Setup:
Thanks for any help with this issue
The text was updated successfully, but these errors were encountered: