-
Notifications
You must be signed in to change notification settings - Fork 602
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Encoding using multiple-GPUs #541
Comments
This has been brought up from time to time and a good reminder that better integrated support should be added. One less than pretty workaround that has been used in the past: import torch
embeddings.model.model.model = torch.nn.DataParallel(embeddings.model.model.model) Might be worth trying that to see if it improves performance at all. Below are a couple more references to investigate. References:
Let's keep this open and I'll take a look at better multi-GPU support in an upcoming release. Ultimately, the best solution given Python's GIL is to spawn a pool of encoding processes, one per GPU. |
Thank you for the response |
But this seems to under utilise the 2 GPUs in kaggle free tier. Any suggestions? |
If you're going to use the multi process pool, make sure to undo the torch dataparallel wrapper. This most likely needs a dedicated effort to optimize encoding for 2 GPUs. I only develop with 1 GPU so it's not a use case I've prioritized. |
I was able to significantly speed up the process using this external encoding handling feature. An encoding which tool around 1 hr in a single GPU environment was reduced to 18 mins. But it would be nice if this feature is an inbuilt feature in your library.
|
Glad to hear it! Nice to see you were able to use an external transform function to solve this. Will keep this open to add a similar method directly to txtai. |
I am doing this task where I am recommending products based on their reviews. To do that I am using your library and when I am creating the index, what I would like to know is whether there is a way to utilize several GPUs. Because the time it takes to encode the review text is huge and I have a multiGPU environment.
Also the task also expects explanations and I would also like to know can the same be done when I execute the explain method.
Thank you.
The text was updated successfully, but these errors were encountered: