-
Notifications
You must be signed in to change notification settings - Fork 164
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
identify languages doesn't work with Telugu in v0.9 #57
Comments
@Shubhamjain27 Will you be able to take this up? |
Please help me i will train Telugu model .. I can see Language model file in NLP for Telugu ...where is seperate model located |
@goru001 If someone isn't working on this, I can take this up. We can use pycld2, pycld3 , it identifies all the supported language except: oriya, bengali and sanskrit. I have used the same in my own projects and it's also used by polyglot's language detection. What do you think ? |
@lordzuko That'll be great! Feel free to raise a PR for this. |
@goru001 can I take this issue up if it is still unresolved? |
@nitkannen Yes sure, this is still unresolved and it'll be great if you can contribute! |
Sure @goru001 |
@goru001 can you give me some guidance as to from where I can start to retrain the Telugu model. Any notebooks or scripts used for other languages and data can be really helpful |
identify languages function which uses separate model for identifying the languages hasn't been retrained on Telugu in v0.9.
Need to retrain it to support Telugu.
The text was updated successfully, but these errors were encountered: