Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

identify languages doesn't work with Telugu in v0.9 #57

Open
goru001 opened this issue Oct 11, 2020 · 8 comments
Open

identify languages doesn't work with Telugu in v0.9 #57

goru001 opened this issue Oct 11, 2020 · 8 comments
Labels
bug Something isn't working help wanted Extra attention is needed

Comments

@goru001
Copy link
Owner

goru001 commented Oct 11, 2020

identify languages function which uses separate model for identifying the languages hasn't been retrained on Telugu in v0.9.
Need to retrain it to support Telugu.

@goru001
Copy link
Owner Author

goru001 commented Oct 11, 2020

@Shubhamjain27 Will you be able to take this up?

@goru001 goru001 added bug Something isn't working help wanted Extra attention is needed labels Oct 11, 2020
@chaitusvk
Copy link

Please help me i will train Telugu model .. I can see Language model file in NLP for Telugu ...where is seperate model located
I am Telugu Speaking Guy..

@lordzuko
Copy link

@goru001 If someone isn't working on this, I can take this up. We can use pycld2, pycld3 , it identifies all the supported language except: oriya, bengali and sanskrit.

I have used the same in my own projects and it's also used by polyglot's language detection.
https://github.com/aboSamoor/polyglot/blob/d0d2aa8/polyglot/detect/base.py#L72

What do you think ?

@goru001
Copy link
Owner Author

goru001 commented Nov 30, 2020

@lordzuko That'll be great! Feel free to raise a PR for this.

@nitkannen
Copy link

@goru001 can I take this issue up if it is still unresolved?

@goru001
Copy link
Owner Author

goru001 commented Aug 24, 2021

@nitkannen Yes sure, this is still unresolved and it'll be great if you can contribute!

@nitkannen
Copy link

Sure @goru001

@nitkannen
Copy link

@goru001 can you give me some guidance as to from where I can start to retrain the Telugu model. Any notebooks or scripts used for other languages and data can be really helpful

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
bug Something isn't working help wanted Extra attention is needed
Projects
None yet
Development

No branches or pull requests

4 participants