This is a simple vector space recommender system based on tf-idf that retrieves the most recent AI/machine learning papers from the arXiv website and sorts them by their similarity to the papers that you've liked. Project inspired by Andrej Karpathy's arxiv-sanity-lite and this paper.
The profile vectors are computed from the PDFs of the papers rather than just their abstracts. The background scheduler sends requests to the arXiv API every 24 hours, bacause the review process operates in daily cycles. Users' profile vectors are updated with a fixed constant when a new paper is liked. The website is powered by Flask, and uses scikit-learn for td-idf computations.
- Email newsletter
- Summarizing papers with generative AI
- Finding similar articles
- Reading lists