This project uses classifiers from the scikit-learn library to classify articles based on their content. It also uses the wordcloud library to create an image for each different category of the articles also based on the content.
To install all the dependencies on your virtual environment, run:
- pip install -r requirements.txt
To run all the classifiers and the WorldCloud script with python 3, run:
- python main.py
If you are running on Windows, with the standard python library, you will need the Microsoft Visual C++ 14.0
, for more information click here.
Since the above dependency requires many GBs to be installed, you can skip it and let the project run without the WordCloud (it automatically detects the missing dependency and skips running the WordCloud).