Scripts for processing data from arXiv.org, analyzing text data and building topic models for research project.