Speech-To-Phrase
A fast and local speech-to-text system that is personalized with your Home Assistant device and area names.
piper
Piper(https://github.com/rhasspy/wyoming-piper) is a fast, local neural text to speech system that sounds great and is optimized for the Raspberry Pi 4.
Paperless-AI
An automated document analyzer for Paperless-ngx using OpenAI API and Ollama (Mistral, llama, phi 3, gemma 2) to automatically analyze and tag your documents.
Crawl4AI
Open-source web crawler and scraper tailored for LLMs, AI agents, and data pipelines.
Qdrant
Qdrant (read: quadrant) is a vector similarity search engine and vector database.
AnythingLLM
The all-in-one AI app for any LLM with full RAG and AI Agent capabilities.
Flowise
Open source low-code tool for developers to build customized LLM orchestration flow and AI agents.
lobe-chat
LobeChat is an open-source, extensible (Function Calling) high-performance chatbot framework.
MCP-Searxng
An MCP server implementation that integrates the SearXNG API, providing web search capabilities.
ParkPow-Plate-Recognizer---Snapshot
Read a license plate from a vehicle picture, powered by ParkPow.
faster-whisper
Faster-whisper(https://github.com/SYSTRAN/faster-whisper) is a reimplementation of OpenAI's Whisper model using CTranslate2, which is a fast inference engine for Transformer models.
Lingarr
Lingarr is an application that leverages translation technologies to automatically translate subtitle files to your desired target language.
Paperless-GPT
paperless-gpt seamlessly pairs with paperless-ngx to generate AI-powered document titles and tags, saving you hours of manual sorting.
CodeProject.AI_ServerGPU
Fast, free, self-hosted Artificial Intelligence Server for any platform, any language.
CodeProject.AI_Server
Fast, free, self-hosted Artificial Intelligence Server for any platform, any language.
ebook2audiobook
CPU/GPU Converter from eBooks to audiobooks with chapters and metadata using Calibre, ffmpeg, XTTSv2, Fairseq and more.
ComfyUI-Nvidia-Docker
ComfyUI WebUI Dockerfile with Nvidia support, installing ComfyUI from GitHub.
koboldcpp
KoboldCpp is a lightweight but powerful AI backend, bundled with KoboldAI Lite frontend.
Whishper
Transcribe any audio to text, translate and edit subtitles completely locally with a web UI.
Refact
Refact WebUI for fine-tuning and self-hosting of code models, that you can later use inside Refact plugins for code completion and chat.
AUTOMATIC1111-Stable-Diffusion-Web-UI
A web interface for Stable Diffusion Integrates with Open WebUI: https://docs.openwebui.com/tutorial/images/#configuring-open-webui Add custom models: https://github.com/AbdBarho/stable-diffusion-webui-docker/wiki/Usage#custom-models
ParkPow-Plate-Recognizer---Stream---GPU
Read a license plate from a live video stream, powered by ParkPow.
ParkPow-Plate-Recognizer---Stream
Read a license plate from a live video stream, powered by ParkPow.
ebook2audiobook---Legacy
This is a legacy version of ebook2audiobook.
stable-diffusion
A big thank you to Holaf for this compiled version of Stable Diffusion which allows you to easily benefit from the interface of your choice and fully enjoy the power of this artificial intelligence.
DOODS
DOODS (Dedicated Open Object Detection Service) is a REST service that detects objects in images or video streams.
ParkPow-Plate-Recognizer---Snapshot---GPU
Read a license plate from a vehicle picture, powered by ParkPow.
Frigate-Plate-Recognizer
Identify license plates via Plate Recognizer or CodeProject.AI and add them as sublabels to Frigate.
LocalDeepResearch
Local Deep Research (LDR) is an AI-powered research assistant that performs systematic research by breaking down complex questions, searching multiple sources in parallel, verifying information across sources, and creating comprehensive reports with proper citations.
stable-diffusion
GPU-ready Dockerfile to run the Stability.AI stable-diffusion model with a simple web interface
birdnet-go
BirdNET-Go is an AI solution for continuous avian monitoring and identification 24/7 realtime bird song analysis of soundcard capture, analysis output to log file, SQLite or MySQL Utilizes BirdNET AI model trained with more than 6500 bird species Local processing, Internet connectivity not required Easy to use Web user…
UglyFeed
Retrieve, aggregate, filter, evaluate, rewrite and serve RSS feeds using Large Language Models for fun, research and learning purposes
OpenAIWebUI
OpenAI API-compatible WebUI. Requires valid API keys for the providers enabled (see list in the selection). Supports OLLAMA_HOST for self-hosted models. Model capabilities depend on the model, but a default for each will be used. The list of recognized models for each provider is available in https://github.com/Infotre…
Txtify
An open-source web application that transcribes and translates audio from YouTube videos or uploaded media files.
youtube-transcript-to-article
YouTube Transcript to Article YouTube Transcript to Article is a Docker-based Python project that provides an API for converting YouTube transcripts into professional articles using OpenAI's ChatGPT.
YuE-GP
YuE AI Music Generation for the GPU Poor (by deepmeepbeep) Our model's name is YuE (乐).
Whisper-CPP-Server
Whisper-CPP-Server is a high-performance speech recognition service written in C++, designed to provide developers and enterprises with a reliable and efficient speech-to-text inference engine.
Whisper-API-Server
A drop-in replacement for the OpenAI's Whisper API using the same API but running locally.
ParkPow-Shipping-Container-Recognizer---Stream
Read a shipping container number from a live stream, powered by ParkPow.
ParkPow-People-Tracker
PeopleTracker is a detection software that processes live camera or pre-recorded video feeds rapidly and effectively.
ParkPow-Shipping-Container-Recognizer---Snapshot
Read a shipping container number from a picture, powered by ParkPow.
Docling-Serve
What is Docling? Docling is an open-source toolkit (from IBM Research) that converts documents (PDF, DOCX, images, HTML, etc.) into structured Markdown or JSON.
WhisperLive---GPU
A real-time transcription application that uses the OpenAI Whisper model to convert speech input into text output.

