- Brooklyn, NY
- https://dnpdata.com
Stars
Eligibility screener for Good Cause Eviction protections in NYC
Follow the cryptocurrency industry’s influence on 2024 elections in the United States.
https://dl.acm.org/doi/10.1145/3657281
Dockerized cluster architecture for OpenSearch with compose.
🍁 Sycamore is an LLM-powered search and analytics platform for unstructured data.
This is the Personality Core for GLaDOS, the first steps towards a real-life implementation of the AI from the Portal series by Valve.
Simple, Pythonic, text processing--Sentiment analysis, part-of-speech tagging, noun phrase extraction, translation, and more.
Always know what to expect from your data.
Docker image with Jupyter, Pytorch and CUDA GPUs supports.
An API Client package to access the APIs for NBA.com
A developer reference project for creating Retrieval Augmented Generation (RAG) chatbots on Linux using TensorRT-LLM
PyMuPDF is a high performance Python library for data extraction, analysis, conversion & manipulation of PDF (and other) documents.
An Open Source YouTube app for privacy
Think fearlessly with end-to-end encrypted notes and files. For issues, visit https://standardnotes.com/forum or https://standardnotes.com/help.
A (PyTorch) imbalanced dataset sampler for oversampling low frequent classes and undersampling high frequent ones.
A collection of corpora for named entity recognition (NER) and entity recognition tasks. These annotated datasets cover a variety of languages, domains and entity types.
A data platform for criminal justice reform [public mirror]
🚀 Extremely fast fuzzy matcher & spelling checker in Python!
A biased barometer for gauging the relative speed of some regex engines on a curated set of tasks.
Fuzzy matching and more functionality for spaCy.
Named Entity Recognition (NER) Annotation tool for SpaCy. Generates Traning Data as a JSON which can be readily used.