Skip to content

Character based n-gram models for detecting different languages.

Notifications You must be signed in to change notification settings

dodoels/n_gram_language_detection

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

12 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

n_gram_language_detection

To train unigram model with English corpus

python -m main train -n 1 -l en

To train unigram model with French corpus

python -m main train -n 1 -l fr

To train unigram model with German corpus

python -m main train -n 1 -l de

To train bigram model with English corpus

python -m main train -n 2 -l en

To train bigram model with French corpus

python -m main train -n 2 -l fr

To train bigram model with German corpus

python -m main train -n 2 -l de

To detect the language used as sentences with unigram

python -m main predict -n 1

To detect the language used as sentences with bigram

python -m main predict -n 2

About

Character based n-gram models for detecting different languages.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages