-
Baps Patil
- Northern Hemisphere, Earth
- https://www.bapspatil.com
- @baps_patil
- in/bapspatil
Highlights
A.I.
Drag & drop UI to build your customized LLM flow
Jan is an open source alternative to ChatGPT that runs 100% offline on your computer
real time face swap and one-click video deepfake with only a single image
Automate Creation of YouTube Shorts using MoviePy.
A Community-Driven Mapping of AI Development Tools
The all-in-one Desktop & Docker AI application with built-in RAG, AI agents, No-code agent builder, and more.
Make websites accessible for AI agents
Riona 🌸 is built using Node.js and TypeScript 🛠️, designed for seamless job execution 📸. It's lightweight, efficient, and still evolving 🚧—exciting new features coming soon! 🌟
Run local LLMs like llama, deepseek-distill, kokoro and more inside your browser
This is a simple demonstration of more advanced, agentic patterns built on top of the Realtime API.
Examples and guides for using the OpenAI API
Agno is a lightweight library for building multi-modal Agents
Amphion (/æmˈfaɪən/) is a toolkit for Audio, Music, and Speech Generation. Its purpose is to support reproducible research and help junior researchers and engineers get started in the field of audi…
Gradio WebUI for audio processing, powered by Whisper (OpenAI-Whisper, Faster-Whisper, Whisper-Timestamped). Features Voice Changer(RVC), zero-shot Voice Cloning (E2, F5-TTS, CosyVoice), YouTube do…
deep seek & o1 auto coders which write python code from a simple description and iteratively improvesit and fix errors
On-device Diffusion Models for Apple Silicon
The most powerful and modular diffusion model GUI, api and backend with a graph/nodes interface.
A powerful coding assistant application that integrates with the DeepSeek API to process user conversations and generate structured JSON responses. Through an intuitive command-line interface, it c…
A GUI Agent application based on UI-TARS(Vision-Lanuage Model) that allows you to control your computer using natural language.
Multi-lingual large voice generation model, providing inference, training and deployment full-stack ability.
A free + OSS logo generator powered by Flux on Together AI
A react-based starter app for using the Multimodal Live API over websockets with Gemini
Lightpanda: the headless browser designed for AI and automation
Finetune Llama 3.3, DeepSeek-R1 & Reasoning LLMs 2x faster with 70% less memory! 🦥
Enable AI models for video production in the browser