Train SpaCy custom pipelines in AWS Trainium #13092
-
Hello SpaCy community! First of all, I want to express my gratitude for all the incredible work you've done. Currently, I'm utilizing an AWS instance for training multiple models. Recently, I came across a new PyTorch-Neuron-based instance called AWS Trainium (you can find more information here: https://aws.amazon.com/machine-learning/trainium/). I'm curious to know whether it's feasible to use SpaCy to train custom pipelines on this specific instance or if there are any plans to implement a SpaCy version that is compatible with it. Thanks for your insights. |
Beta Was this translation helpful? Give feedback.
Replies: 2 comments 1 reply
-
Hi @Shanoa00, we don't have any plans to add support specifically for Trainium. You should be able to train spaCy models on Trainium instances though (if you run into problems, let us know). |
Beta Was this translation helpful? Give feedback.
-
Thanks for your response @rmitsch. I am facing some errors regarding the use of GPU. Since AWS Trainium uses PyTorch-Neuron you don't require CUDA, CuPy libraries. Based on this FAQ (https://awsdocs-neuron.readthedocs-hosted.com/en/latest/general/faq/training/neuron-training.html#can-i-use-cuda-libraries-with-aws-trainium) you need to strip the code out, and from that point the Neuron compiler will take the model as is and optimize it at the hardware level. =================================== train =================================== The instance I am using has: |
Beta Was this translation helpful? Give feedback.
Thanks for bringing this up! We'll discuss AWS Trainium support internally. For now I can only recommend to use non-Trainium instances.