Sinhala News Analysis Using Text Mining and Machine Learning

This study focused on analysing Sinhala news reports published online to extract important features using text mining and machine learning techniques. Then, represent this extracted information in a way that readers find it easy to read news or do research on reports published in the past.

As a contribution to the future research on Sinhala NLP, most of the resources developed under this project like code snippets, datasets and other lexical resources are made publicly available in this repository.

The study was presented at Ruhuna International Science and Technology Conference (RISTCON 2018) on 15th February 2018.

Keywords: Sinhala language, Natural language processing, Sinhala NLP, Feature selection, Text classification, Text clustering

Name		Name	Last commit message	Last commit date
Latest commit History 16 Commits
data-scrappers		data-scrappers
sinhala-document-classifier		sinhala-document-classifier
sinhala-document-clustering		sinhala-document-clustering
sinhala-news-corpus		sinhala-news-corpus
sinhala-plagiarism-checker		sinhala-plagiarism-checker
sinhala-stemmer		sinhala-stemmer
sinhala-text-preprocessing		sinhala-text-preprocessing
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Sinhala News Analysis Using Text Mining and Machine Learning

About

Releases

Packages

Languages

rksk/sinhala-news-analysis

Folders and files

Latest commit

History

Repository files navigation

Sinhala News Analysis Using Text Mining and Machine Learning

About

Topics

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages