Tagged "self-hosted"
-
Self-Hostable AI Agents and Internal Software Framework Released
-
Running a Private AI Brain on Windows PC as Alternative to Cloud Services
-
MiniMax M2.7 Model to Be Released as Open Weights
-
How to Build a Self-Hosted AI Server with LM Studio: Step-by-Step Guide
-
Powerful AI Search Engine Built on Single GeForce RTX 5090
-
Ditching Paid AI Services: Building Self-Hosted LLM Solutions as ChatGPT, Claude, and Gemini Alternatives
-
Setting Up a Private AI Brain on Windows: Complete Guide to Local LLM Deployment
-
Why You Should Use Both ChatGPT and Local LLMs: A Practical Hybrid Approach
-
Automating Read-It-Later Workflows with Local LLMs for Overnight Summarization
-
Self-Hosted AI Code Review with Local LLMs: Secure Automation Guide
-
Pydantic-Deep: Production Deep Agents for Pydantic AI
-
Local AI Coding Assistant: Free Cursor Alternative with VS Code, Ollama & Continue
-
DeepSeek R1 RTX 4090 vs Apple M3 Max: Benchmark & Performance Guide
-
Your Site Content Is Powering AI. Your Bank Account Has No Idea
-
Build a $1,500 AI Server with DeepSeek-R1 on RTX 4090
-
Why Self-Hosted LLMs Make Financial and Privacy Sense Over Paid Services
-
Claude Code Permissions Hook – Delegate Permission Approval to LLM
-
Kilo Is the VS Code Extension That Actually Works With Every Local LLM I Throw At It
-
Tether's QVAC Introduces Cross-Platform Bitnet LoRA Framework for On-Device AI Training
-
I Switched to a Local LLM for These 5 Tasks and the Cloud Version Hasn't Been Worth It Since
-
You're Using Your Local LLM Wrong If You're Prompting It Like a Cloud LLM
-
Mistral Releases Small 4 Open-Source Model Under Apache 2.0
-
Show HN: Merrilin.ai – Code Blocks in Your Books, Finally
-
LoKI – Local AI Assistant for Linux and WSL
-
Qwen3.5-397B Achieves 282 tok/s on 4x RTX PRO 6000 Blackwell Through Custom CUTLASS Kernel
-
OpenClaw vs Eigent vs Claude Cowork: Comparing Open-Source AI Collaboration Platforms
-
Nvidia's Nemotron 3 Super: Understanding the Significance for Local LLM Deployment
-
Two Local Models Prove Competitive Enough to Replace ChatGPT, Gemini, and Copilot
-
Show HN: Intake API – An Inbox for AI Coding Agents
-
Show HN: Bots of WallStreet – Multi-Agent Debate and Prediction Framework
-
AgentArmor: Open-Source 8-Layer Security Framework for AI Agents
-
Runpod Report: Qwen Has Overtaken Meta's Llama As The Most-Deployed Self-Hosted LLM
-
Linux 7.0 AMDGPU Fixing Idle Power Issue For RDNA4 GPUs After Compute Workloads
-
Intel Updates LLM-Scaler-vLLM With Support For More Qwen3/3.5 Models
-
Qwodel – An Open-Source Unified Pipeline for LLM Quantization
-
Nvidia Releases Nemotron 3 Super: 120B MoE Model for Local Deployment
-
MeepaChat – Slack for AI Agents (iOS, macOS, Web / Cloud, Self-Hosted)
-
Apple M5 Max 128GB Benchmark Results for Local LLM Inference
-
Show HN: Detect When an LLM Silently Changes Behavior for the Same Prompt
-
Ex-Manus Backend Lead Shares: Moving Beyond Function Calling in Agent Design
-
LMF – LLM Markup Format
-
A Kubernetes Operator That Orchestrates AI Coding Agents
-
Show HN: Aver – a Language Designed for AI to Write and Humans to Review
-
Show HN: AIWatermarkDetector: Detect AI Watermarks in Text or Code
-
Researchers Gave AI Agents Real Tools. One Deleted Its Own Mail Server
-
Mnemos: Persistent Memory System for Local AI Agents
-
FreeBSD 14.4 Released: Implications for Local LLM Deployment
-
Community Survey: AI Content Automation Stacks in 2026
-
Sarvam Open-Sources 30B and 105B Reasoning Models
-
How to Run Your Own Local LLM — 2026 Edition
-
Engram – Open-Source Persistent Memory for AI Agents
-
Reverse engineering a DOS game with no source code using Codex 5.4
-
Show HN: Proxly – Self-hosted tunneling on your own domain in 60 seconds
-
OpenSpec: Spec-driven development (SDD) for AI coding assistants
-
Benchmark: Local Open-Source LLMs Competitive in Real-Time Trading Applications
-
AI Agent Reliability Tracker
-
Show HN: SimplAI – Build and Deploy AI Agents and Workflows Without Boilerplate
-
Self-Hosted Paperless-ngx With Optional Local AI Integration
-
Show HN: RedDragon – LLM-Assisted IR Analysis of Code Across Languages
-
Open WebUI Adds Native Terminal Tool Calling with Qwen3.5 35B Support
-
Turning Your Linux Terminal into a Local AI Assistant
-
Alibaba Releases Qwen 3.5 AI Model with On-Device AI Support
-
The Emerging Role of SRAM-Centric Chips in AI Inference
-
Real-World Qwen 3.5 9B Agent Performance on M1 Pro Validates Edge Deployment
-
llama.cpp Merges Agentic Loop and MCP Client Support
-
ConsciOS v1.0: A Viable Systems Architecture for Human and AI Alignment
-
SynthesisOS – A Local-First, Agentic Desktop Layer Built in Rust
-
Qwen 3.5-35B-A3B Achieves 37.8% on SWE-bench Verified Hard
-
Quantifying Cost Savings with Local LLMs for Development
-
Incrmd: Incremental AI Coding by Editing PROJECT.md
-
ÆTHERYA Core – Deterministic Policy Engine for Governing LLM Actions
-
RAG vs. Skill vs. MCP vs. RLM: Comparing LLM Enhancement Patterns
-
HP ZBook Ultra 14 G1a Workstation Reclaims Local AI Workflows for Professionals
-
Alibaba's Open-Source CoPaw AI Agent Now Compatible with MCP and ClawHub Skills
-
RAG-Enterprise – 100% Local RAG System for Enterprise Documents
-
Huawei's SuperPoD Portfolio Creates New Option for Global Computing at MWC Barcelona 2026
-
4 Free Tools to Run Powerful AI on Your PC Without a Subscription
-
DeepSeek V4 Multimodal Model Coming Next Week With Image and Video Generation
-
AI-Native Store Research
-
The Complete Developer's Guide to Running LLMs Locally: From Ollama to Production
-
Show HN: Anonymize LLM traffic to dodge API fingerprinting and rate-limiting
-
Show HN: A Human-Curated, CLI-Driven Context Layer for AI Agents
-
Elastic Introduces Best-in-Class Embedding Models for High Performance Semantic Search
-
Show HN: Dypai – Build Backends from Your IDE Using AI and MCP
-
Enterprise Infrastructure Guide: Running Local LLMs for 70-150 Developers
-
Show HN: Agora – AI API Pricing Oracle with X402 Micropayments
-
Making Wolfram Technology Available as Foundation Tool for LLM Systems
-
Open-Source Framework Achieves Gemini 3 Deep Think Level Performance Through Local Model Scaffolding
-
Massu: Governance Layer for AI Coding Assistants with 51 MCP Tools
-
GLM-5 Becomes Top Open-Weights Model on Extended NYT Connections Benchmark
-
FORTHought: Self-Hosted AI Stack for Physics Labs Built on OpenWebUI
-
Show HN: The Only CLI Your AI Agent Will Need
-
Breaking the Speed Limit: Strategies for 17k Tokens/Sec Local Inference
-
Ollama 0.17 Released With Improved OpenClaw Onboarding
-
Show HN: Horizon – My AI-Powered Personal News Aggregator and Summarizer
-
Search and Analyze Documents from the DOJ Epstein Files Release with Local LLM
-
Claude Code Open – AI Coding Platform with Web IDE and Agents
-
I Stopped Paying for ChatGPT and Built a Private AI Setup That Anyone Can Run
-
The Path to Ubiquitous AI (17k tokens/sec)
-
Ollama Production Deployment: Docker-Compose Setup Guide
-
NVIDIA Releases Dynamo v0.9.0: Infrastructure Overhaul With FlashIndexer and Multi-Modal Support
-
Mirai Secures $10M to Optimize On-Device AI Amid Cloud Cost Surge
-
Using Local LLMs With Self-Hosted Tools to Manage Documents in Paperless-ngx
-
Why AI Models Fail at Iterative Reasoning and What Could Fix It
-
Self-Hosted Local LLMs for Document Management with Paperless-ngx
-
Local-First RAG: Vector Search in SQLite with Hamming Distance
-
Why My Country's AI Scene Is Built on Sand
-
Alibaba's Qwen3.5-397B Achieves #3 Position in Open Weights Model Rankings
-
Real-World Coding Benchmark Tests LLMs on 65 Production Codebase Tasks
-
Ask HN: How Do You Debug Multi-Step AI Workflows When the Output Is Wrong?
-
Self-Hosted AI: A Complete Roadmap for Beginners
-
Open-Source Models Now Comprise 4 of Top 5 Most-Used Endpoints on OpenRouter
-
Show HN: Inkog – Pre-flight check for AI agents (governance, loops, injection)
-
Ask HN: What is the best bang for buck budget AI coding?
-
I broke into my own AI system in 10 minutes. I built it
-
Security Alert: Open Claw Designed for Self-Hosting, Stop Sharing Credentials
-
Running Your Own AI Assistant for €19/Month: Complete Self-Hosting Guide
-
GLM-5 Released: 744B Parameter MoE Model Targeting Complex Tasks
-
I Tried a Claude Code Rival That's Local, Open Source, and Completely Free
-
DeepSeek Launches Model Update with 1M Context Window