Tagged "self-hosted"

Self-Hostable AI Agents and Internal Software Framework Released 23 March 2026
Running a Private AI Brain on Windows PC as Alternative to Cloud Services 23 March 2026
MiniMax M2.7 Model to Be Released as Open Weights 23 March 2026
How to Build a Self-Hosted AI Server with LM Studio: Step-by-Step Guide 23 March 2026
Powerful AI Search Engine Built on Single GeForce RTX 5090 23 March 2026
Ditching Paid AI Services: Building Self-Hosted LLM Solutions as ChatGPT, Claude, and Gemini Alternatives 22 March 2026
Setting Up a Private AI Brain on Windows: Complete Guide to Local LLM Deployment 22 March 2026
Why You Should Use Both ChatGPT and Local LLMs: A Practical Hybrid Approach 22 March 2026
Automating Read-It-Later Workflows with Local LLMs for Overnight Summarization 22 March 2026
Self-Hosted AI Code Review with Local LLMs: Secure Automation Guide 21 March 2026
Pydantic-Deep: Production Deep Agents for Pydantic AI 21 March 2026
Local AI Coding Assistant: Free Cursor Alternative with VS Code, Ollama & Continue 21 March 2026
DeepSeek R1 RTX 4090 vs Apple M3 Max: Benchmark & Performance Guide 21 March 2026
Your Site Content Is Powering AI. Your Bank Account Has No Idea 21 March 2026
Build a $1,500 AI Server with DeepSeek-R1 on RTX 4090 21 March 2026
Why Self-Hosted LLMs Make Financial and Privacy Sense Over Paid Services 20 March 2026
Claude Code Permissions Hook – Delegate Permission Approval to LLM 20 March 2026
Kilo Is the VS Code Extension That Actually Works With Every Local LLM I Throw At It 19 March 2026
Tether's QVAC Introduces Cross-Platform Bitnet LoRA Framework for On-Device AI Training 19 March 2026
I Switched to a Local LLM for These 5 Tasks and the Cloud Version Hasn't Been Worth It Since 18 March 2026
You're Using Your Local LLM Wrong If You're Prompting It Like a Cloud LLM 18 March 2026
Mistral Releases Small 4 Open-Source Model Under Apache 2.0 17 March 2026
Show HN: Merrilin.ai – Code Blocks in Your Books, Finally 16 March 2026
LoKI – Local AI Assistant for Linux and WSL 16 March 2026
Qwen3.5-397B Achieves 282 tok/s on 4x RTX PRO 6000 Blackwell Through Custom CUTLASS Kernel 15 March 2026
OpenClaw vs Eigent vs Claude Cowork: Comparing Open-Source AI Collaboration Platforms 15 March 2026
Nvidia's Nemotron 3 Super: Understanding the Significance for Local LLM Deployment 15 March 2026
Two Local Models Prove Competitive Enough to Replace ChatGPT, Gemini, and Copilot 15 March 2026
Show HN: Intake API – An Inbox for AI Coding Agents 14 March 2026
Show HN: Bots of WallStreet – Multi-Agent Debate and Prediction Framework 14 March 2026
AgentArmor: Open-Source 8-Layer Security Framework for AI Agents 14 March 2026
Runpod Report: Qwen Has Overtaken Meta's Llama As The Most-Deployed Self-Hosted LLM 13 March 2026
Linux 7.0 AMDGPU Fixing Idle Power Issue For RDNA4 GPUs After Compute Workloads 13 March 2026
Intel Updates LLM-Scaler-vLLM With Support For More Qwen3/3.5 Models 13 March 2026
Qwodel – An Open-Source Unified Pipeline for LLM Quantization 12 March 2026
Nvidia Releases Nemotron 3 Super: 120B MoE Model for Local Deployment 12 March 2026
MeepaChat – Slack for AI Agents (iOS, macOS, Web / Cloud, Self-Hosted) 12 March 2026
Apple M5 Max 128GB Benchmark Results for Local LLM Inference 12 March 2026
Show HN: Detect When an LLM Silently Changes Behavior for the Same Prompt 12 March 2026
Ex-Manus Backend Lead Shares: Moving Beyond Function Calling in Agent Design 12 March 2026
LMF – LLM Markup Format 11 March 2026
A Kubernetes Operator That Orchestrates AI Coding Agents 11 March 2026
Show HN: Aver – a Language Designed for AI to Write and Humans to Review 11 March 2026
Show HN: AIWatermarkDetector: Detect AI Watermarks in Text or Code 11 March 2026
Researchers Gave AI Agents Real Tools. One Deleted Its Own Mail Server 11 March 2026
Mnemos: Persistent Memory System for Local AI Agents 10 March 2026
FreeBSD 14.4 Released: Implications for Local LLM Deployment 10 March 2026
Community Survey: AI Content Automation Stacks in 2026 10 March 2026
Sarvam Open-Sources 30B and 105B Reasoning Models 9 March 2026
How to Run Your Own Local LLM — 2026 Edition 9 March 2026
Engram – Open-Source Persistent Memory for AI Agents 9 March 2026
Reverse engineering a DOS game with no source code using Codex 5.4 8 March 2026
Show HN: Proxly – Self-hosted tunneling on your own domain in 60 seconds 8 March 2026
OpenSpec: Spec-driven development (SDD) for AI coding assistants 8 March 2026
Benchmark: Local Open-Source LLMs Competitive in Real-Time Trading Applications 8 March 2026
AI Agent Reliability Tracker 8 March 2026
Show HN: SimplAI – Build and Deploy AI Agents and Workflows Without Boilerplate 7 March 2026
Self-Hosted Paperless-ngx With Optional Local AI Integration 7 March 2026
Show HN: RedDragon – LLM-Assisted IR Analysis of Code Across Languages 7 March 2026
Open WebUI Adds Native Terminal Tool Calling with Qwen3.5 35B Support 7 March 2026
Turning Your Linux Terminal into a Local AI Assistant 7 March 2026
Alibaba Releases Qwen 3.5 AI Model with On-Device AI Support 7 March 2026
The Emerging Role of SRAM-Centric Chips in AI Inference 6 March 2026
Real-World Qwen 3.5 9B Agent Performance on M1 Pro Validates Edge Deployment 6 March 2026
llama.cpp Merges Agentic Loop and MCP Client Support 6 March 2026
ConsciOS v1.0: A Viable Systems Architecture for Human and AI Alignment 6 March 2026
SynthesisOS – A Local-First, Agentic Desktop Layer Built in Rust 4 March 2026
Qwen 3.5-35B-A3B Achieves 37.8% on SWE-bench Verified Hard 4 March 2026
Quantifying Cost Savings with Local LLMs for Development 4 March 2026
Incrmd: Incremental AI Coding by Editing PROJECT.md 4 March 2026
ÆTHERYA Core – Deterministic Policy Engine for Governing LLM Actions 4 March 2026
RAG vs. Skill vs. MCP vs. RLM: Comparing LLM Enhancement Patterns 2 March 2026
HP ZBook Ultra 14 G1a Workstation Reclaims Local AI Workflows for Professionals 2 March 2026
Alibaba's Open-Source CoPaw AI Agent Now Compatible with MCP and ClawHub Skills 2 March 2026
RAG-Enterprise – 100% Local RAG System for Enterprise Documents 1 March 2026
Huawei's SuperPoD Portfolio Creates New Option for Global Computing at MWC Barcelona 2026 1 March 2026
4 Free Tools to Run Powerful AI on Your PC Without a Subscription 1 March 2026
DeepSeek V4 Multimodal Model Coming Next Week With Image and Video Generation 1 March 2026
AI-Native Store Research 1 March 2026
The Complete Developer's Guide to Running LLMs Locally: From Ollama to Production 26 February 2026
Show HN: Anonymize LLM traffic to dodge API fingerprinting and rate-limiting 26 February 2026
Show HN: A Human-Curated, CLI-Driven Context Layer for AI Agents 25 February 2026
Elastic Introduces Best-in-Class Embedding Models for High Performance Semantic Search 24 February 2026
Show HN: Dypai – Build Backends from Your IDE Using AI and MCP 24 February 2026
Enterprise Infrastructure Guide: Running Local LLMs for 70-150 Developers 24 February 2026
Show HN: Agora – AI API Pricing Oracle with X402 Micropayments 24 February 2026
Making Wolfram Technology Available as Foundation Tool for LLM Systems 23 February 2026
Open-Source Framework Achieves Gemini 3 Deep Think Level Performance Through Local Model Scaffolding 23 February 2026
Massu: Governance Layer for AI Coding Assistants with 51 MCP Tools 23 February 2026
GLM-5 Becomes Top Open-Weights Model on Extended NYT Connections Benchmark 23 February 2026
FORTHought: Self-Hosted AI Stack for Physics Labs Built on OpenWebUI 23 February 2026
Show HN: The Only CLI Your AI Agent Will Need 23 February 2026
Breaking the Speed Limit: Strategies for 17k Tokens/Sec Local Inference 23 February 2026
Ollama 0.17 Released With Improved OpenClaw Onboarding 22 February 2026
Show HN: Horizon – My AI-Powered Personal News Aggregator and Summarizer 22 February 2026
Search and Analyze Documents from the DOJ Epstein Files Release with Local LLM 21 February 2026
Claude Code Open – AI Coding Platform with Web IDE and Agents 21 February 2026
I Stopped Paying for ChatGPT and Built a Private AI Setup That Anyone Can Run 20 February 2026
The Path to Ubiquitous AI (17k tokens/sec) 20 February 2026
Ollama Production Deployment: Docker-Compose Setup Guide 20 February 2026
NVIDIA Releases Dynamo v0.9.0: Infrastructure Overhaul With FlashIndexer and Multi-Modal Support 20 February 2026
Mirai Secures $10M to Optimize On-Device AI Amid Cloud Cost Surge 20 February 2026
Using Local LLMs With Self-Hosted Tools to Manage Documents in Paperless-ngx 20 February 2026
Why AI Models Fail at Iterative Reasoning and What Could Fix It 20 February 2026
Self-Hosted Local LLMs for Document Management with Paperless-ngx 19 February 2026
Local-First RAG: Vector Search in SQLite with Hamming Distance 19 February 2026
Why My Country's AI Scene Is Built on Sand 18 February 2026
Alibaba's Qwen3.5-397B Achieves #3 Position in Open Weights Model Rankings 18 February 2026
Real-World Coding Benchmark Tests LLMs on 65 Production Codebase Tasks 18 February 2026
Ask HN: How Do You Debug Multi-Step AI Workflows When the Output Is Wrong? 18 February 2026
Self-Hosted AI: A Complete Roadmap for Beginners 17 February 2026
Open-Source Models Now Comprise 4 of Top 5 Most-Used Endpoints on OpenRouter 17 February 2026
Show HN: Inkog – Pre-flight check for AI agents (governance, loops, injection) 17 February 2026
Ask HN: What is the best bang for buck budget AI coding? 17 February 2026
I broke into my own AI system in 10 minutes. I built it 17 February 2026
Security Alert: Open Claw Designed for Self-Hosting, Stop Sharing Credentials 16 February 2026
Running Your Own AI Assistant for €19/Month: Complete Self-Hosting Guide 12 February 2026
GLM-5 Released: 744B Parameter MoE Model Targeting Complex Tasks 12 February 2026
I Tried a Claude Code Rival That's Local, Open Source, and Completely Free 12 February 2026
DeepSeek Launches Model Update with 1M Context Window 11 February 2026