Tagged "showcase"
-
Velr: Embedded Property-Graph Database for Local LLM Applications
-
Self-Hostable AI Agents and Internal Software Framework Released
-
Running a Private AI Brain on Windows PC as Alternative to Cloud Services
-
Claude Usage Monitor: Track API Usage with macOS Menu Bar App
-
Powerful AI Search Engine Built on Single GeForce RTX 5090
-
Developer Builds Fully Local Multi-Agent System Using vLLM and Parallel Inference
-
ik_llama.cpp Fork Delivers 26x Faster Prompt Processing on Qwen 3.5 27B
-
Careless Whisper – Personal Local Speech to Text
-
Brezn – Decentralized Local Communication
-
AI Playground for Developers Built in Vite and Python
-
Running an AI Agent on a 448KB RAM Microcontroller
-
MacinAI Local brings functional LLM inference to classic Macintosh hardware
-
Atuin v18.13 – Better Search, a PTY Proxy, and AI for Your Shell
-
SwarmHawk – Open-Source CLI for Vulnerability Scanning with AI Synthesis
-
Ultra-Compact 28M Parameter Models Show Promise for Specialized Domain Tasks
-
Qwen 3.5 Emerges as Top Performer for Local Deployment with Extensive Quantization Options
-
Llamafile 0.10 Released with GPU Support and Rebuilt Core
-
Claude Code Permissions Hook – Delegate Permission Approval to LLM
-
Meet Sarvam Edge: India's AI Model That Runs on Phones and Laptops With No Internet
-
Kilo Is the VS Code Extension That Actually Works With Every Local LLM I Throw At It
-
Unsloth Studio: Open-Source Web UI for Training and Running LLMs Locally
-
Skills Manager – manage AI agent skills across Claude, Cursor, Copilot
-
LucidShark – Local-first, open-source quality and security gate
-
Custom GPU Multiplexer Achieves 0.3ms Model Switching on Legacy Hardware
-
Auto-retry Claude Code on subscription rate limits (zero deps, tmux-based)
-
Browser-Based Transcription Tools
-
Show HN: Process Mining for AI Agent Systems
-
OpenJarvis: Local-First AI Agents That Run Entirely On-Device
-
Mistral Small 4 119B Released with NVFP4 Quantisation Support
-
Local Qwen Models Master Browser Automation Through Iterative Replanning
-
KAIST Develops World's First Hyper-Personalized On-Device AI Chip
-
OpenClaw Isn't the Only Raspberry Pi AI Tool—Here Are 4 Others You Can Try This Week
-
Qwen 3.5 122B Demonstrates Exceptional Reasoning for Local Deployment
-
OmniCoder-9B: Efficient Coding Model for 8GB GPUs
-
Show HN: Merrilin.ai – Code Blocks in Your Books, Finally
-
LoKI – Local AI Assistant for Linux and WSL
-
This External GPU Enclosure Tries to Break Cloud Dependence for Local AI Inference
-
Dictare – Open-source Voice Layer for AI Coding Agents (100% Local)
-
Show HN: Generate, Clean, and Prepare LLM Training Data, All-in-One
-
Custom AI Smart Speaker
-
VS Code Agent Kanban – Task Management for AI-Assisted Development
-
Nota AI to Showcase End-to-End On-Device AI Optimization at Embedded World 2026
-
Nemotron 9B Powers Large-Scale Local Inference: Patent Classification and Real-Time Applications
-
Gyro-Claw – Secure Execution Runtime for AI Agents
-
Engram – Open-Source Persistent Memory for AI Agents
-
commitgen-cc – Generate Conventional Commit Messages Locally with Ollama
-
VoiceShelf: Fully Offline Android Audiobook Reader Using Kokoro TTS
-
IBM Granite 4.0 1B Speech Model Released for Multilingual Speech Recognition
-
Qwen3.5 122B Achieves 25 tok/s on 72GB VRAM Setup
-
Researchers Develop Persistent Memory System for Local LLMs—No RAG Required
-
Show HN: Anonymize LLM traffic to dodge API fingerprinting and rate-limiting
-
Agent System – 7 specialized AI agents that plan, build, verify, and ship code
-
VaultAI – 42 AI Models on a Portable SSD, Works Offline for $399
-
TemplateFlow – Build AI Workflows, Not Prompts
-
SanityBoard Adds 27 New Model Evaluations Including Qwen 3.5 Plus, GLM 5, and Gemini 3.1 Pro
-
Qwen3 Coder Next 8FP Demonstrates Exceptional Long-Context Performance on 128GB System
-
I Stopped Paying for ChatGPT and Built a Private AI Setup That Anyone Can Run
-
Using Local LLMs With Self-Hosted Tools to Manage Documents in Paperless-ngx
-
Kitten TTS V0.8 Released: New State-of-the-Art Super-Tiny TTS Model Under 25 MB
-
Show HN: Forked – A Local Time-Travel Debugger for OpenClaw Agents
-
Self-Hosted Local LLMs for Document Management with Paperless-ngx
-
GPT-OSS 20B Now Runs 100% Locally in Browser via WebGPU
-
GNOME's AI Assistant Newelle Adds llama.cpp Support and Command Execution
-
Ring-1T-2.5 Released with SOTA Deep Thinking Performance
-
Godot MCP Gives AI Assistants Full Access to Game Engine Editor
-
Developer Creates Custom Local AI Headshot Generator After Commercial Solutions Fail