-
Notifications
You must be signed in to change notification settings - Fork 56
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
TimeOut error when attempting to run pre-trained RoboThor model checkpoint #363
Comments
Hi @dtch1997, Thanks for the bug report! Can you show me the output from sudo $(which ai2thor-xorg) start There is a way to run |
Hi @Lucaweihs , Sorry to meet you again. I also met the same problem in training models for objectnav.
The error message is:
Meanwhile, here is the
When I tried to run
Furthermore, I found lots of processes in the
I have to use
to kill those processes. Additionally, I also met another error when I run the command:
Here is the error:
Could you help me on these matters? |
@Lucaweihs thanks for the prompt response! I tried running
Is there a way to check whether the necessary displays are on? Also, here's
|
Hi @dtch1997, If I understood it correctly, you might be running from a terminal emulator on a computer with a screen attached (so xserver is probably already up for you, and therefore you shouldn't need to call With your virtual environment active, if you start a python console and then run
, does a window pop up? |
Hi @jordis-ai2 , to test the inference, I am running Ubuntu 22.04 on my laptop with a monitor attached. I am using the built-in Terminal application. When I ran:
There was no window pop-up |
I might be missing some detail, but can you try again after doing something like the first answer in this thread? I'm guessing you're using wayland instead of Xorg xserver. |
@jordis-ai2 I made the change in
And verified that I'm now using
Didn't fix the previous issues |
Hi @jordis-ai2 , I'm experiencing a TimeOut issue with the 'VectorSampledTask' that is similar to the issue you previously helped with. However, I'm using a headless workstation with Xorgs running. I have provided more details, including the error message and commands used, in my previous comments. Would you be able to suggest a solution or provide guidance on how to address this issue? Thank you. |
Hi @dtch1997, I've also encountered this issue while training with AllenAct, but I found a temporary solution that may help you as well. I was able to run the code using 'CloudRendering' by adding I hope this workaround works for you too. |
thanks @xiaobaishu0097 , I tried that but it didn't work due to the following error:
Actually, as far as I understand, I am running the checkpoint without visualization, so I do not understand why rendering is needed or would cause issues. @Lucaweihs would you be able to comment on that? Also, @jordis-ai2 I tried your previous suggestion, which didn't work. Is there possibly some simple diagnostic commands I can run which would help us recognize the problem? |
Hi @dtch1997, I assume there was no error message when you ran my last suggestion (despite no window showing up). I guess I'd try to look at In any case, my general advice is to start running a standalone AI2THOR Let me know if that helps. |
Problem
Unable to run pre-trained RoboThor model checkpoint
Steps to reproduce
Followed all instructions at https://allenact.org/tutorials/running-inference-on-a-pretrained-model/
Then ran:
Got the error:
Expected behavior
Able to run inference and save metrics to tensorboard.
Desktop
Please add the following information:
Additional context
Running on Python 3.8 in Anaconda
The text was updated successfully, but these errors were encountered: