Skip to content

Releases: goru001/inltk

Code-Mixed Languages support

11 Oct 17:16
Compare
Choose a tag to compare
  • Adds support for Hinglish, Tanglish, Manglish in Latin script
  • Adds support for Telugu

Add English support

18 Jan 08:36
c39d9ba
Compare
Choose a tag to compare
  1. English support has been added
  2. Runtime performance of get_similar_sentences has been added

Get Similar Sentences - Augment and Multiply your data

14 Dec 11:30
7c55095
Compare
Choose a tag to compare

This contains addition of new feature - get_similar_sentences - with which you can augment and multiply your data in supported languages

Sentence encodings + Sentence similarity

10 Nov 18:03
Compare
Choose a tag to compare

New Features:

  • You can now get 400 dimensional encoding for sentences using get_sentence_encoding - supported for all languages in iNLTK
  • You can now get similarity score (cosine similarity) between 2 sentences using get_sentence_similarity - supported for all languages in iNLTK.

New Model:

  • The above features will not work for punjabi language with the old model. Please execute the following code-snippet before using them
from inltk.inltk import reset_models

>> reset_models('pa')
>> setup('pa')

Urdu support + Windows support

26 May 09:03
Compare
Choose a tag to compare

Added Urdu support to iNLTK - thanks to @anuragshas contributions
Added Windows 10 support - thanks to @ibrahiminfinite contributions

Make embeddings available

17 May 17:11
Compare
Choose a tag to compare

Added get_embedding_vectors function to allow users to get embedding vectors for their words/sentences/documents

Add Tamil support

17 May 02:17
Compare
Choose a tag to compare

Added tamil support