Skip to content

Telegram Data Clustering contest solution by Mindful Squirrel

License

Notifications You must be signed in to change notification settings

evengard88/tgcontest

 
 

Repository files navigation

TGNews

Build Status

Demo

Install

Prerequisites: CMake, Boost

$ sudo apt-get install cmake libboost-all-dev build-essential

If you got zip archive, just go to building binary

To download code and models:

$ git clone https://github.com/IlyaGusev/tgcontest
$ cd tgcontest
$ git submodule init
$ git submodule update
$ bash download_models.sh

To build binary (in "tgcontest" dir):

$ mkdir build && cd build && cmake -DCMAKE_BUILD_TYPE=Release ..
$ make

To download datasets:

$ bash download_data.sh

Run on sample:

./build/tgnews top data --ndocs 10000

Training

Models

Data

Markup

Misc

Links

TODO:

  • Framework for complex NN
  • Proper clustering markup
  • Error analysis for categories classifiers
  • Alternatives for PageRank
  • "Ugly" titles

About

Telegram Data Clustering contest solution by Mindful Squirrel

Resources

License

Stars

Watchers

Forks

Packages

No packages published

Languages

  • Jupyter Notebook 41.3%
  • C++ 29.0%
  • HTML 23.0%
  • Python 3.5%
  • CMake 1.2%
  • Shell 1.1%
  • Other 0.9%