Tagged "showcase"

Supply Chain DLP: Stop Leaked .env Files, Credentials, SSH Keys, and API Tokens 2 June 2026
Meet Memory OS: A 6-Layer Open-Source Memory Stack Built on Hermes Agent 2 June 2026
MDMA – Turn LLM Responses into Interactive UI via MCP 2 June 2026
JetBrains Releases Mellum2: A 12B MoE Model for Fast, Specialized Tasks 2 June 2026
Proveyouragent: Cryptographic Identity for AI Agents (Ed25519 and DPoP) 1 June 2026
Show HN: seed – Self-Modifying Webpage with On-Device LLM 31 May 2026
Show HN: Egress WAF to Limit AI Agents and NPM Malware Based on mitmproxy 31 May 2026
Slow Journal App with AI Integration 30 May 2026
Rsync 3.4.3 Features Hundreds of Claude Commits 30 May 2026
Rewriting CRIU in Zig using LLM 30 May 2026
MediaTek Dimensity 7500 Brings On-Device AI and Enhanced Power Efficiency to Mid-Range Phones 30 May 2026
Show HN: AI-org – Org-mode Powered by AI 30 May 2026
The Windows Device Manager, on Linux 29 May 2026
Tiny microphone on my balcony to listen for any birds passing by 29 May 2026
Privacy-Focused Raspberry Pi Zero 2W DIY Security Camera with On-Device AI and End-to-End Encryption 28 May 2026
Money Printer Pro – Open-source AI Content Generator 28 May 2026
OpenBMB Runs Local Agents with MiniCPM5-1B – Efficient LLM for Edge Deployment 27 May 2026
Anker Soundcore Liberty 5 Pro Earbuds Feature Dedicated On-Device AI Chip with Touch Screen 26 May 2026
Maker Demonstrates Portable AI with Suitcase-Integrated Jetson Orin Setup 25 May 2026
Show HN: I Built a Debugging Challenge for the AI Coding Age 25 May 2026
AgentSlice – Make AI Coding Agents Ask Before They Edit 25 May 2026
MCP Servers Transform Local LLM Stack, Replacing $249 Paid Tools 24 May 2026
Redditor Successfully Runs 1 Trillion Parameter LLM Using Cheap Intel Optane DIMMs 24 May 2026
llama.cpp Checkpoint Fix Accelerates Local Coding Agents 22 May 2026
Show HN: Interactive and Stylized AI Chat Chrome Extension 22 May 2026
Hardware LLM Taalas Reaches >14,000 TPS on Llama 3.1 8B 21 May 2026
Adobe Photoshop Update Brings On-Device AI Processing 21 May 2026
Occupy Wall Street Co-Founder Builds Offline-Running AI Organizing Mentor 20 May 2026
Google and Synaptics Partner on Coralboard for Immersive Edge AI Experiences 20 May 2026
Open Source Local Audio Stem Separation Tool Released 19 May 2026
LLM Wiki App Chunker: Transform Documents Into Navigable Knowledge Trees 19 May 2026
eXo MCP Server Enables Secure AI Agent Access to Workplace Tools 19 May 2026
Running Large Language Models on Single-Board Computer Clusters: Creative Edge Deployment 18 May 2026
Ansede-static: Offline SAST Tool Demonstrates Value of Local AI Tools 18 May 2026
Local LLMs Enable Intelligent Smart Camera Control Without Cloud Dependency 18 May 2026
MegaTrain: Full Precision Training of 100B+ Parameter LLMs on a Single GPU 17 May 2026
Local LLM Takes Control of Video Doorbell—The Future of Smart Cameras 17 May 2026
Maker Builds Offline Jetson-Powered Chatbot Suitcase 17 May 2026
Offline Voice-to-Text and AI Keyboard App for Local Processing 16 May 2026
Local LLM Integration Enables Replacement of Paid Subscription Services 16 May 2026
Apple's M5 MacBook Air Advances On-Device AI with Redesigned Hardware 16 May 2026
AI/ML Benchmark Tool for Local LLM Inference and XGBoost Training 16 May 2026
Show HN: Find the best local LLM for your hardware, ranked by benchmarks 15 May 2026
Local LLM Persistent Context Prevents Repetitive Mistakes 14 May 2026
Avocado Studio: Open-Source AI Content Editor for Next.js Sites 14 May 2026
Legacy System Analysis with AI Reveals Modern Architecture Under the Hood 14 May 2026
Tsjilp – AI as a Silent Communication Assistant 13 May 2026
Running a Local LLM on a 12-Year-Old Raspberry Pi 13 May 2026
Mainline Linux 6.12 on Annapurna Labs Alpine V2 (Ubiquiti UNVR, UDM-Pro) 13 May 2026
Lucebox Brings Faster Local AI Inference to AMD Strix Halo 13 May 2026
Before Upload – Check Files Locally Before Sending to AI Tools 13 May 2026
Privatemode.ai – AI Provider with Confidential Computing 12 May 2026
MDL: Endless Visual Novel Engine Powered by AI 11 May 2026
Lython: Experimental Python Compiler Toolchain Based on LLVM 11 May 2026
DFlash Speculative Decoding Delivers 8.5x Speed Improvement for LLM Inference 11 May 2026
Cotypist – AI Autocomplete for Mac 11 May 2026
I Built My Second Brain for Meetings. No Monthly Subscription 11 May 2026
Mlx-serve: Run LLMs Natively on Your Mac 10 May 2026
DistillFast: AI Cost Optimization Tool for Model Efficiency 10 May 2026
How I Used a Local LLM to Organize the Store on My NAS 9 May 2026
Dikaletus: Open-Source Meeting Recording and Transcription Using Mistral AI 9 May 2026
Perplexity Brings On-Device AI Workflow to Macs with 'Personal Computer' Feature 8 May 2026
Show HN: A Local-First Agentic Knowledge Manager 8 May 2026
Running Espressif's OpenClaw-Inspired AI Agent on ESP32 with Self-Hosted LLM Works in Practice 8 May 2026
Show HN: Runs AI Coding Agents Inside Isolated Docker Containers 8 May 2026
Airplane AI – Local NDA Safe AI Powered by Gemma 8 May 2026
0ctx – Local-First Project Memory for AI Workflows 8 May 2026
Show HN: Desktop Agent Center – Local AI Automation via Hotkeys 7 May 2026
Zed Editor Integrates AI Features with Local Deployment Focus 6 May 2026
I Replaced ChatGPT and Claude With This Powerful Local LLM and Saved Over $20 a Month While Gaining Full Control 5 May 2026
A 49-Line Physics Classifier That Beats kNN on 76% of Benchmarks 5 May 2026
Show HN: Memex, Claude Memory via Local RAG with MCP and Offline Embeddings 5 May 2026
Google's Gemma 4 Could Put Powerful AI on Your Phone and Laptop 5 May 2026
Show HN: Claude Relay – Local Claude Code Sessions Message Each Other 5 May 2026
Ruflo: Multi-Agent AI Orchestration for Claude Code 4 May 2026
NordVPN Adds On-Device AI Voice Detector to Chrome Extension to Identify Synthetic Audio 4 May 2026
Gemma 4 Just Replaced My Whole Local LLM Stack 4 May 2026
Eval Skills for AI Agents 4 May 2026
Daintree: A Delegation Environment for Orchestrating AI Coding Agents 4 May 2026
Building a Jira Alternative with Claude in 8 Days 4 May 2026
Control AI Risk with Pre-Built Frameworks and Ready-to-Run Evaluations 4 May 2026
I Put a Local LLM on My Phone and Stopped Needing Cloud AI for Most Tasks 3 May 2026
Show HN: Kit – Editor, Browser, Terminal, Mail with AI Agents Sharing Context 3 May 2026
Home Assistant's Local LLM Support Outperforms Gemini for Home, and Google Knows It 3 May 2026
Show HN: Enoch – Control Plane for Autonomous AI Research 3 May 2026
Show HN: Filling PDF Forms with AI Using Client-Side Tool Calling 2 May 2026
Building a Raspberry Pi-Based Local LLM Server for Remote Access 1 May 2026
New Open-Source Tool Automatically Matches Local LLMs to Your PC Hardware 1 May 2026
Home Assistant's Local LLM Support Outperforms Gemini for Home Automation 1 May 2026
Running Capable Local LLMs Without Expensive GPU Hardware 30 April 2026
Building a Remote-Accessible Local LLM Server on Raspberry Pi 30 April 2026
Show HN: Arkloop – Open-Source, Local-First Agent Client 30 April 2026
Stop Guessing: Open-Source Tool Predicts Which Local LLMs Run on Your PC 28 April 2026
Show HN: Minimal Linux Sandboxes to Manage AI-Generated Code with Ease 28 April 2026
Hipfire: A Rust-Native AMD Inference Engine That Outperforms llama.cpp 28 April 2026
Unsloth's Custom Kernels Make LLM Fine-Tuning Viable on Consumer GPUs 27 April 2026
Pocket LLM v1.5.0 Brings Multimodal AI to Android with No Cloud Required 27 April 2026
The New Linux Kernel AI Bot Uncovering Bugs Is A Local LLM On Framework Desktop + AMD Ryzen AI Max 27 April 2026
Singapore's Foreign Minister Builds an AI "Second Brain" Using NanoClaw 26 April 2026
Pluggable's TBT5-AI: First Thunderbolt Dock Explicitly Targeting Local LLM Workstations 26 April 2026
Show HN: Phonetic Formatter – Offline English Text to IPA on iPhone and iPad 26 April 2026
SiGit Code: Local-First Coding Agent 25 April 2026
Rust Open-Source Headless Browser for AI Agents and Web Scraping 25 April 2026
Run a Local LLM Server on Raspberry Pi with Remote Access Capabilities 25 April 2026
Show HN: A Karpathy-Style LLM Wiki Your Agents Maintain 25 April 2026
GPU Passthrough to LXCs in Proxmox Outperforms VMs and Simplifies Local AI Infrastructure 25 April 2026
I Built a Local AI Stack With 5 Docker Containers, and Now I'll Never Pay for ChatGPT Again 24 April 2026
Building Real-World On-Device AI with LiteRT and NPU 24 April 2026
AI Agent Designs a RISC-V CPU Core from Scratch 24 April 2026
Show HN: We built an OCR server that can process 270 dense images/s on a 5090 23 April 2026
Cortex Auth – Rust secrets vault for AI agents (exec-based injection) 23 April 2026
Tesseron: New API Framework for AI Agents with Developer-Defined Configuration 22 April 2026
Sarvam Edge: India's Offline AI Model Runs on Phones and Laptops Without Internet 22 April 2026
Developer Turns Phone Into Local LLM Server with Vision, Voice, and Tool Calling Capabilities 22 April 2026
Cursor-Autoresearch: AI Research Automation Port for Local Workflows 22 April 2026
ZeusHammer: Built an AI Agent That Thinks Locally 20 April 2026
Complete Local Coding Assistant Stack Running Inside Your Editor 20 April 2026
Waterloo's Live AI-Goose Tracker: Real-Time Edge Vision 19 April 2026
PCMind: Local AI Analysis of Docs, Audio, Video and Images 19 April 2026
Memjar: Uncompromising Local-First Second Brain 19 April 2026
LlaMa.cpp Robot Wars 19 April 2026
Kilo is the VS Code Extension That Actually Works with Every Local LLM 19 April 2026
Show HN: I Can't Write Python. It Works Anyway – Local LLM Automation 18 April 2026
115 TOPS in 0.67L: CHUWI AuBox X Packs On-Device AI Power Into a Palm-Sized Mini PC 18 April 2026
Build a More Secure, Always-On Local AI Agent with OpenClaw and NVIDIA NemoClaw 18 April 2026
BibCrit – LLM Grounded in ETCBC Corpus Data for Biblical Textual Criticism 18 April 2026
Kilo Is the VS Code Extension That Actually Works With Every Local LLM I Throw at It 17 April 2026
After Two Months of Open WebUI Updates, I'd Pick It Over ChatGPT's Interface for Local LLMs 17 April 2026
Show HN: An MCP server that lets AI compose music on a hardware synth 17 April 2026
Community Computer: Collaborative Autoresearch on a Peer-to-Peer Network 17 April 2026
ChatMCP – Connect your AI browser chats to your coding agents 17 April 2026
Building a Voice AI Wearable in a Casio F91W with Whisper and BLE 16 April 2026
Open WebUI Emerges as Superior Interface for Local LLMs After Two Months of Active Development 16 April 2026
N8n, Dify, and Ollama Emerge as Leading Self-Hosted AI Automation Stack 16 April 2026
Book Translator: Two-Pass Local Translation with Self-Reflection via Ollama 16 April 2026
Bonsai 1.7B in the Browser: A 290MB 1-bit LLM on WebGPU 16 April 2026
Xiaomi 12 Pro Converted Into 24/7 Headless AI Server With Ollama and Gemma4 15 April 2026
Slop-scan – Detect AI Code Slop Patterns in Your Repo 15 April 2026
SigMap – Shrink AI Coding Context 97% with Auto-Scaling Token Budget 15 April 2026
Self-Hosted LLMs Transform Personal Knowledge Management Systems 15 April 2026
Noi Enables Running ChatGPT and Claude Side-by-Side on Your Desktop 15 April 2026
Running Gemma 4 on an iPhone 13 Pro 15 April 2026
GBrain – System to Make Your AI Agent Better Reflect You 15 April 2026
DotLLM – Building an LLM Inference Engine in C# 15 April 2026
DGX Spark Setup Guide: Running vLLM and PyTorch for Local LLM Inference Backend 15 April 2026
DFlash Doubles Token Generation Speed of Qwen3.5 27B on Mac M5 Max 15 April 2026
Ubiquiti UniFi G6 Turret 4K Camera Features On-Device AI Processing at $199 Price Point 14 April 2026
Talking to a Local LLM in the Firefox Sidebar 14 April 2026
Minisforum N5 MAX AI NAS Delivers 126 TOPS with 200TB Storage for Local LLM Workloads 14 April 2026
MiniMax M2.7 Achieves SOTA Performance Under 64GB on Mac with TQ Quantization 14 April 2026
Local LLM Connected to Home Assistant via MCP Now Enables Autonomous Smart Home Management 14 April 2026
Developer Shares Golden Stack for Local Coding Assistant Integration Directly Inside Code Editors 14 April 2026
Build a Sovereign Local AI Stack: Ollama and Open WebUI and Pgvector 2026 13 April 2026
Show HN: SkillCompass – Open-Source Quality Evaluator for Your AI Skills 13 April 2026
Self-Hosted LLM Took Personal Knowledge Management System to the Next Level 13 April 2026
Defender – Local Prompt Injection Detection for AI Agents 13 April 2026
Unsloth Completes Comprehensive MiniMax M2.7 GGUF Quantization Suite 12 April 2026
Universal Knowledge Store and Grounding Layer for AI Reasoning Engines 12 April 2026
Self-Hosted LLM Elevates Personal Knowledge Management Systems to New Levels 12 April 2026
MiniMax M2.7 Advances Scalable Agentic Workflows on NVIDIA Platforms for Complex AI Applications 12 April 2026
Google Gemma 4 Delivers Exceptional Speed and Accuracy for Local Inference 12 April 2026
DFlash Speculative Decoding Achieves 3.3x Speedup on Apple Silicon 12 April 2026
I Gave My AI Shell Access and Felt Uneasy – So I Sandboxed It 12 April 2026
Self-Hosted LLMs Transform Personal Knowledge Management Systems 11 April 2026
Parakeet Streaming ASR on Apple Silicon via CoreML 11 April 2026
AIYO Wisper: Local Voice-to-Text for macOS Using WhisperKit 11 April 2026
Self-Installing Skill Manager for AI Agents 11 April 2026
Tether Launches QVAC SDK for Cross-Platform Local AI Development 10 April 2026
5 Open-Source Projects Running Transformers on CPUs to GPUs in Pure Java 10 April 2026
AI Scans 400k Reddit Posts to Flag Overlooked GLP-1 Side Effects 10 April 2026
VoxCPM2: New Open-Source TTS Model with Voice Cloning and Design 9 April 2026
Speculative Decoding Made My Local LLM Actually Usable 9 April 2026
Running a 1.7B Parameters LLM on an Apple Watch 9 April 2026
I Replaced My Local LLM With a Model Half Its Size and Got Better Results — and It Wasn't About the Parameters 9 April 2026
Gemini-CLI, Llama.cpp, and Qwen3.5 Running on NVIDIA Jetson TK1 9 April 2026
Google AI Edge Gallery Showcases Offline Inference with Gemma 4 8 April 2026
Google's Gemma 4 Brings Powerful On-Device AI to Android and iOS 8 April 2026
Show HN: Willitrun – Check if Any ML Model Runs on Any Device (Benchmark-Backed) 7 April 2026
StyleSeed – Design Rules That Make AI Coding Tools Produce Professional UI 7 April 2026
Quansloth Using Google's Turboquant Breaks the VRAM Wall for Local LLMs 7 April 2026
Octopoda: Open Source Memory Layer for Fully Offline AI Agents 7 April 2026
MemPalace, the Highest-Scoring AI Memory System Ever Benchmarked 7 April 2026
Gemma 4 26B Achieves Impressive Local Performance With Proper Configuration 7 April 2026
CricketBrain: Neuromorphic Signal Processor in Rust (0.175us/step, 944 bytes) 7 April 2026
METATRON: Open-Source AI Penetration Testing with Local LLMs 6 April 2026
Show HN: Lightweight LLM Tracing Tool with CLI 6 April 2026
HunyuanOCR 1B: High-Quality OCR Now Viable on Budget Consumer Hardware 6 April 2026
Real-time Multimodal AI on Apple Silicon: Gemma E2B Demo Shows Practical Edge Deployment 6 April 2026
Gemma 4 31B Achieves Exceptional Performance on Local Hardware 6 April 2026
Show HN: Turn Photos Into Wordle Puzzles with AI That Runs 100% in Your Browser 6 April 2026
Vektor – Local-First Associative Memory for AI Agents 5 April 2026
Unpaved: Audit Toolkit for AI Developer Tool Bias in Global South Contexts 5 April 2026
Satsgate: Monetize AI Agents and APIs with Lightning L402 Protocol 5 April 2026
Qwen 3.5 397B Reduced to 35% Parameters With Usable Quality on 96GB GPU 5 April 2026
GMKtec NucBox K17 Launches with 97 TOPS AI Performance for Local Inference 5 April 2026
Gemma 4 26B MoE Emerges as Optimal All-Around Local Model for Consumer Hardware 5 April 2026
Nex Life Logger: Local Activity Tracker with AI Agent Integration 4 April 2026
Mixed Precision Quantization on MLX with TurboQuant Implementation 4 April 2026
Kokoro TTS Achieves 20× Realtime Speed on CPU-Only On-Device Inference 4 April 2026
Free AI Video Clipper Using Scene and Speech-Based Segmentation 4 April 2026
Autonet: Decentralized AI Training with Constitutional Governance 4 April 2026
SkillCompass – Diagnose and Improve AI Agent Skills Across 6 Dimensions 3 April 2026
OpenUMA – Apple-Style Unified Memory for x86 AI Inference 3 April 2026
Gemma 4 Shows Strong Reasoning Performance with Thinking Tokens 3 April 2026
Gemma 4 2B Successfully Runs on Raspberry Pi 5 3 April 2026
Apfel – The Free AI Already on Your Mac 3 April 2026
TurboQuant Enables Qwen 3.5-27B on 16GB Consumer GPUs 2 April 2026
SmolLM2-360M Running on Samsung Galaxy Watch 4 with 74% Memory Reduction 2 April 2026
Show HN: Memsearch – Persistent, Cross-Agent, Cross-Session Memory for AI Agents 2 April 2026
TinyGPU Adds Mac Support for External Nvidia GPU Acceleration 2 April 2026
git11 Is an AI Workspace for GitHub Engineering Teams 2 April 2026
Show HN: Extra-Platforms, Python Library to Detect OS, Arch, Shell, CI, AI 2 April 2026
Satcove – Query 5 AI Models Simultaneously and Get Structured Verdicts 1 April 2026
Qwen 3.5-27B Demonstrates Superior Performance vs Gemini 3.1 Pro and GPT-5.3 1 April 2026
Claw64 – Full Agentic Loop in <4KB on Commodore 64 1 April 2026
Orca – Executable skills and capabilities for AI agent workflows 31 March 2026
I built an O(1) physics engine to stop LLM hallucinations in construction 31 March 2026
Miasma: A Tool to Protect Data from AI Web Scrapers 29 March 2026
Lat.md: Agent Lattice – A Knowledge Graph for Your Codebase in Markdown 29 March 2026
IBM Granite 4.0 3B Vision: Compact Enterprise-Grade Document AI 29 March 2026
DaVinci-MagiHuman: Open-Source AI Model for Realistic Video Generation 29 March 2026
Qwen3 512k Context via TurboQuant on Mac mini 28 March 2026
M5 Max Delivers 1.7x Faster Inference Than M3 Max on Qwen 3.5 Models 28 March 2026
Reverse-Engineering the Apollo 11 Code with AI 28 March 2026
This Wearable Runs an On-Device AI With 2-Week Battery Life 27 March 2026
This Self-Hosted Tool Makes My Local LLMs Feel Exactly Like ChatGPT, but Nothing Leaves My Network 27 March 2026
RotorQuant: 10-19x Faster Quantisation Alternative Using Clifford Algebra 27 March 2026
Coding Implementation to Run Qwen3.5 Reasoning Models Distilled With Claude-Style Thinking Using GGUF and 4-Bit Quantization 27 March 2026
mlx-Code: Run Claude Code Locally with MLX-LM 27 March 2026
Homelab Consolidation: Replacing 3 Models with Single 122B MoE Model on AMD Ryzen AI MAX+ 27 March 2026
See What Your AI Agents Are Doing: Multi-Agent Observability Tool 27 March 2026
RF-DETR Nano and YOLO26 Enable On-Device Object Detection on Smartphones 26 March 2026
NVIDIA Releases GPT-OSS-Puzzle-88B, a Deployment-Optimized Model 26 March 2026
MCP-Manticore: Let Your AI Assistant Write Manticore Queries for You 26 March 2026
Show HN: Beforeyouship – Pre-Build Tool to Estimate LLM Cost 26 March 2026
Liquid AI's LFM2-24B Achieves 50 Tokens/Second in Web Browser via WebGPU 26 March 2026
Operating Systems. One USB. ZFS on Root. AI-Powered. Free 26 March 2026
Running an Open-Weight LLM Locally on an Apple Watch 25 March 2026
Show HN: Open Agent Spec – Treat AI Agents Like Typed Functions, Not Prompt Chains 25 March 2026
OmniCoder v2 Released: Improved Code Generation for Local Deployment 25 March 2026
Private Brain LLM Setup on Windows PC Eliminates Need for Paid Cloud Services 25 March 2026
Researcher Successfully Runs Local LLMs on Legacy "Dead" GPU With Surprising Results 25 March 2026
Council: A Structured Deliberation Protocol Across Diverse AI Models 25 March 2026
Ultra-Large 400B-Class LLM Runs on iPhone in Test 25 March 2026
Velr: Embedded Property-Graph Database for Local LLM Applications 23 March 2026
Self-Hostable AI Agents and Internal Software Framework Released 23 March 2026
Running a Private AI Brain on Windows PC as Alternative to Cloud Services 23 March 2026
Claude Usage Monitor: Track API Usage with macOS Menu Bar App 23 March 2026
Powerful AI Search Engine Built on Single GeForce RTX 5090 23 March 2026
Developer Builds Fully Local Multi-Agent System Using vLLM and Parallel Inference 22 March 2026
ik_llama.cpp Fork Delivers 26x Faster Prompt Processing on Qwen 3.5 27B 22 March 2026
Careless Whisper – Personal Local Speech to Text 22 March 2026
Brezn – Decentralized Local Communication 22 March 2026
AI Playground for Developers Built in Vite and Python 22 March 2026
Running an AI Agent on a 448KB RAM Microcontroller 21 March 2026
MacinAI Local brings functional LLM inference to classic Macintosh hardware 21 March 2026
Atuin v18.13 – Better Search, a PTY Proxy, and AI for Your Shell 21 March 2026
SwarmHawk – Open-Source CLI for Vulnerability Scanning with AI Synthesis 20 March 2026
Ultra-Compact 28M Parameter Models Show Promise for Specialized Domain Tasks 20 March 2026
Qwen 3.5 Emerges as Top Performer for Local Deployment with Extensive Quantization Options 20 March 2026
Llamafile 0.10 Released with GPU Support and Rebuilt Core 20 March 2026
Claude Code Permissions Hook – Delegate Permission Approval to LLM 20 March 2026
Meet Sarvam Edge: India's AI Model That Runs on Phones and Laptops With No Internet 19 March 2026
Kilo Is the VS Code Extension That Actually Works With Every Local LLM I Throw At It 19 March 2026
Unsloth Studio: Open-Source Web UI for Training and Running LLMs Locally 18 March 2026
Skills Manager – manage AI agent skills across Claude, Cursor, Copilot 18 March 2026
LucidShark – Local-first, open-source quality and security gate 18 March 2026
Custom GPU Multiplexer Achieves 0.3ms Model Switching on Legacy Hardware 18 March 2026
Auto-retry Claude Code on subscription rate limits (zero deps, tmux-based) 18 March 2026
Browser-Based Transcription Tools 18 March 2026
Show HN: Process Mining for AI Agent Systems 18 March 2026
OpenJarvis: Local-First AI Agents That Run Entirely On-Device 17 March 2026
Mistral Small 4 119B Released with NVFP4 Quantisation Support 17 March 2026
Local Qwen Models Master Browser Automation Through Iterative Replanning 17 March 2026
KAIST Develops World's First Hyper-Personalized On-Device AI Chip 17 March 2026
OpenClaw Isn't the Only Raspberry Pi AI Tool—Here Are 4 Others You Can Try This Week 16 March 2026
Qwen 3.5 122B Demonstrates Exceptional Reasoning for Local Deployment 16 March 2026
OmniCoder-9B: Efficient Coding Model for 8GB GPUs 16 March 2026
Show HN: Merrilin.ai – Code Blocks in Your Books, Finally 16 March 2026
LoKI – Local AI Assistant for Linux and WSL 16 March 2026
This External GPU Enclosure Tries to Break Cloud Dependence for Local AI Inference 16 March 2026
Dictare – Open-source Voice Layer for AI Coding Agents (100% Local) 16 March 2026
Show HN: Generate, Clean, and Prepare LLM Training Data, All-in-One 16 March 2026
Custom AI Smart Speaker 16 March 2026
VS Code Agent Kanban – Task Management for AI-Assisted Development 9 March 2026
Nota AI to Showcase End-to-End On-Device AI Optimization at Embedded World 2026 9 March 2026
Nemotron 9B Powers Large-Scale Local Inference: Patent Classification and Real-Time Applications 9 March 2026
Gyro-Claw – Secure Execution Runtime for AI Agents 9 March 2026
Engram – Open-Source Persistent Memory for AI Agents 9 March 2026
commitgen-cc – Generate Conventional Commit Messages Locally with Ollama 9 March 2026
VoiceShelf: Fully Offline Android Audiobook Reader Using Kokoro TTS 9 March 2026
IBM Granite 4.0 1B Speech Model Released for Multilingual Speech Recognition 7 March 2026
Qwen3.5 122B Achieves 25 tok/s on 72GB VRAM Setup 26 February 2026
Researchers Develop Persistent Memory System for Local LLMs—No RAG Required 26 February 2026
Show HN: Anonymize LLM traffic to dodge API fingerprinting and rate-limiting 26 February 2026
Agent System – 7 specialized AI agents that plan, build, verify, and ship code 26 February 2026
VaultAI – 42 AI Models on a Portable SSD, Works Offline for $399 20 February 2026
TemplateFlow – Build AI Workflows, Not Prompts 20 February 2026
SanityBoard Adds 27 New Model Evaluations Including Qwen 3.5 Plus, GLM 5, and Gemini 3.1 Pro 20 February 2026
Qwen3 Coder Next 8FP Demonstrates Exceptional Long-Context Performance on 128GB System 20 February 2026
I Stopped Paying for ChatGPT and Built a Private AI Setup That Anyone Can Run 20 February 2026
Using Local LLMs With Self-Hosted Tools to Manage Documents in Paperless-ngx 20 February 2026
Kitten TTS V0.8 Released: New State-of-the-Art Super-Tiny TTS Model Under 25 MB 20 February 2026
Show HN: Forked – A Local Time-Travel Debugger for OpenClaw Agents 20 February 2026
Self-Hosted Local LLMs for Document Management with Paperless-ngx 19 February 2026
GPT-OSS 20B Now Runs 100% Locally in Browser via WebGPU 14 February 2026
GNOME's AI Assistant Newelle Adds llama.cpp Support and Command Execution 14 February 2026
Ring-1T-2.5 Released with SOTA Deep Thinking Performance 13 February 2026
Godot MCP Gives AI Assistants Full Access to Game Engine Editor 11 February 2026
Developer Creates Custom Local AI Headshot Generator After Commercial Solutions Fail 11 February 2026