Tagged "self-hosted"
-
N8n, Dify, and Ollama Might Be the Best Self-Hosted AI Automation Stack Right Now
-
After Two Months of Open WebUI Updates, I'd Pick It Over ChatGPT's Interface for Local LLMs
-
Grokfeed: Terminal Feed Reader for HN, Reddit, and Lobste.rs Using Claude Code
-
Why the Same LLM Gives Different Answers in Different Environments
-
What Type of AI Usage? Deployment Patterns and Implementation Considerations
-
Show HN: Minimal Linux Sandboxes to Manage AI-Generated Code with Ease
-
Hipfire: A Rust-Native AMD Inference Engine That Outperforms llama.cpp
-
Singapore's Foreign Minister Builds an AI "Second Brain" Using NanoClaw
-
Thinking Outside the Box: New Attack Surfaces in Sandboxed AI Agents
-
NVIDIA Adds Day-0 DeepSeek V4 Blackwell Support
-
75% of US Health Systems Are Using AI. Only 18% of That Deployment Is Governed
-
Critical Security Flaw: Hackers Can Exploit Ollama Model Uploads to Leak Sensitive Server Data
-
Build Your Own Local AI Stack with 5 Docker Containers and Eliminate ChatGPT Subscriptions
-
Hackers Exploit Ollama Model Uploads to Leak Server Data
-
Netherlands Reaches Deal to Cut Reliance on U.S. Cloud Tech
-
Mathesar 0.10.0
-
I Built a Local AI Stack With 5 Docker Containers, and Now I'll Never Pay for ChatGPT Again
-
How to Make Sense of AI
-
Local LLM for Private Companies
-
Intel LLM-Scaler vLLM 0.14.0 Released With Official Arc Pro B70 Support
-
Malicious GGUF Models Could Trigger Remote Code Execution on SGLang Servers
-
Running DeepSeek R1 Locally: Your Complete Setup Guide
-
AI Quota Inflation Is No Token Effort. It's Baked In
-
I Built a Local AI Stack with 5 Docker Containers, and Now I'll Never Pay for ChatGPT Again
-
Show HN: I Can't Write Python. It Works Anyway – Local LLM Automation
-
Exposed LLM Infrastructure: How Attackers Find and Exploit Misconfigured AI Deployments
-
After Two Months of Open WebUI Updates, I'd Pick It Over ChatGPT's Interface for Local LLMs
-
Researcher Discovers 221 Bugs in vLLM Stemming From Single Root Cause
-
Project Glasswing and the ASF: Open-Source's Chance to Win the AI Era
-
Open WebUI Emerges as Superior Interface for Local LLMs After Two Months of Active Development
-
N8n, Dify, and Ollama Emerge as Leading Self-Hosted AI Automation Stack
-
Self-Hosted LLMs Transform Personal Knowledge Management Systems
-
Building Practical Local Coding Assistants: A Working Stack for Editor Integration
-
GPU Passthrough to LXCs in Proxmox Simplifies Local Inference Infrastructure
-
DGX Spark Setup Guide: Running vLLM and PyTorch for Local LLM Inference Backend
-
Talking to a Local LLM in the Firefox Sidebar
-
OpenNebula 7.2 "Dark Horse" Released with Enhanced Infrastructure Support
-
Developer Shares Golden Stack for Local Coding Assistant Integration Directly Inside Code Editors
-
Abliterated Local LLM Models Show Distinct Behavioral Characteristics Compared to Standard Variants
-
Build a Sovereign Local AI Stack: Ollama and Open WebUI and Pgvector 2026
-
Self-Hosted LLM Took Personal Knowledge Management System to the Next Level
-
On-Device AI Inference Emerges as New Security Blind Spot for CISOs
-
MiniMax M2.7 Open-Sources Globally as Industry's First Self-Improving Model
-
Running Same Prompts Through Claude and Local LLM Revealed Unexpected Results
-
Self-Hosted LLM Elevates Personal Knowledge Management Systems to New Levels
-
MiniMax M2.7 Advances Scalable Agentic Workflows on NVIDIA Platforms for Complex AI Applications
-
I Gave My AI Shell Access and Felt Uneasy – So I Sandboxed It
-
Self-Hosted LLMs Transform Personal Knowledge Management Systems
-
GLM 5.1 Dominates Agentic Benchmarks, Outperforming Most Models at 1/3 Opus Cost
-
Aisbf (AI Should Be Free) Proxy 0.99.18 Released
-
Self-Installing Skill Manager for AI Agents
-
Local Small LLMs Match Enterprise Model Performance on Vulnerability Detection
-
LLM Wiki v2: Extended Knowledge Base for LLM Practitioners
-
AI Scans 400k Reddit Posts to Flag Overlooked GLP-1 Side Effects
-
VoxCPM2: New Open-Source TTS Model with Voice Cloning and Design
-
Mano-P: Open-Source On-Device GUI Agent, #1 on OSWorld Benchmark
-
Ask HN: Local-First Meetings Recorder and Transcriber
-
Gemma 4 Support Stabilized in Llama.cpp
-
GitHub Copilot CLI Adds Support for BYOK and Local Model Deployment
-
Show HN: Willitrun – Check if Any ML Model Runs on Any Device (Benchmark-Backed)
-
StyleSeed – Design Rules That Make AI Coding Tools Produce Professional UI
-
Quansloth Using Google's Turboquant Breaks the VRAM Wall for Local LLMs
-
MemPalace, the Highest-Scoring AI Memory System Ever Benchmarked
-
Gemma 4 Achieves Top Multilingual Performance Across European Languages
-
Satsgate: Monetize AI Agents and APIs with Lightning L402 Protocol
-
Run AutoGEN with Ollama and LiteLLM in Simple Steps
-
YC-Bench: GLM-5 Matches Claude Opus 4.6 at 11× Lower Cost
-
GPUs vs. TPUs: Decoding the Powerhouses of AI
-
5 Useful Docker Containers for Agentic Developers
-
Google Gemma 4 Released with GGUF Quantizations
-
Show HN: Memsearch – Persistent, Cross-Agent, Cross-Session Memory for AI Agents
-
git11 Is an AI Workspace for GitHub Engineering Teams
-
Show HN: Extra-Platforms, Python Library to Detect OS, Arch, Shell, CI, AI
-
Chinese Chipmakers Claim Nearly Half of Local Market as Nvidia's Lead Shrinks
-
Qwen 3.5-27B Demonstrates Superior Performance vs Gemini 3.1 Pro and GPT-5.3
-
If Your AI Agent Ran NPM Install During the Axios Attack, You're Compromised
-
Orca – Executable skills and capabilities for AI agent workflows
-
Ollama Launches Pi: The Minimal Coding Agent That Powers OpenClaw Is Now Yours to Customize
-
Local AI didn't replace my subscriptions, but it did take over these 6 tasks
-
I built an O(1) physics engine to stop LLM hallucinations in construction
-
Samsung Launches Galaxy Book6 Series in India with NVIDIA RTX 5070 Graphics and On-Device AI
-
DeepSeek V3 Complete Guide: Deploy and Optimize Local AI in 2026
-
DeepSeek-R1 Chain-of-Thought Debugging: A Developer's Guide
-
Scion: Running Concurrent LLM Agents with Isolated Identities and Workspaces
-
RAG Deployment Lessons from Regulated Industries
-
Miasma: A Tool to Protect Data from AI Web Scrapers
-
Converting a Home Server Into a Production AI Appliance
-
IBM Granite 4.0 3B Vision: Compact Enterprise-Grade Document AI
-
DaVinci-MagiHuman: Open-Source AI Model for Realistic Video Generation
-
HP Launches Copilot+ PCs in India with On-Device AI Capabilities for Local Inference
-
GPU Passthrough to LXCs in Proxmox Simplifies Local LLM Deployment
-
Why Your AI Agents Will Turn Against You
-
This Self-Hosted Tool Makes My Local LLMs Feel Exactly Like ChatGPT, but Nothing Leaves My Network
-
Mistral AI Releases Voxtral: Open-Source TTS Model Beating ElevenLabs on Local Hardware
-
Homelab Consolidation: Replacing 3 Models with Single 122B MoE Model on AMD Ryzen AI MAX+
-
Hold on to Your Hardware: Implications for Local LLM Deployment
-
See What Your AI Agents Are Doing: Multi-Agent Observability Tool
-
Why Responsible AI Is the Bedrock of AI-Powered Applications
-
NVIDIA Releases GPT-OSS-Puzzle-88B, a Deployment-Optimized Model
-
Show HN: Beforeyouship – Pre-Build Tool to Estimate LLM Cost
-
Real-World Benchmark: DeepSeek-V3 Matches Claude Sonnet on Routine Coding Tasks
-
Show HN: Open Agent Spec – Treat AI Agents Like Typed Functions, Not Prompt Chains
-
AI Slop or Quality Storytelling? – Dune Themed MCP Gateway Tutorial
-
Private Brain LLM Setup on Windows PC Eliminates Need for Paid Cloud Services
-
Critical: LiteLLM Supply Chain Attack Detected, Bifrost Alternative Released
-
Council: A Structured Deliberation Protocol Across Diverse AI Models
-
Self-Hostable AI Agents and Internal Software Framework Released
-
Running a Private AI Brain on Windows PC as Alternative to Cloud Services
-
MiniMax M2.7 Model to Be Released as Open Weights
-
How to Build a Self-Hosted AI Server with LM Studio: Step-by-Step Guide
-
Powerful AI Search Engine Built on Single GeForce RTX 5090
-
Ditching Paid AI Services: Building Self-Hosted LLM Solutions as ChatGPT, Claude, and Gemini Alternatives
-
Setting Up a Private AI Brain on Windows: Complete Guide to Local LLM Deployment
-
Why You Should Use Both ChatGPT and Local LLMs: A Practical Hybrid Approach
-
Automating Read-It-Later Workflows with Local LLMs for Overnight Summarization
-
Self-Hosted AI Code Review with Local LLMs: Secure Automation Guide
-
Pydantic-Deep: Production Deep Agents for Pydantic AI
-
Local AI Coding Assistant: Free Cursor Alternative with VS Code, Ollama & Continue
-
DeepSeek R1 RTX 4090 vs Apple M3 Max: Benchmark & Performance Guide
-
Your Site Content Is Powering AI. Your Bank Account Has No Idea
-
Build a $1,500 AI Server with DeepSeek-R1 on RTX 4090
-
Why Self-Hosted LLMs Make Financial and Privacy Sense Over Paid Services
-
Claude Code Permissions Hook – Delegate Permission Approval to LLM
-
Kilo Is the VS Code Extension That Actually Works With Every Local LLM I Throw At It
-
Tether's QVAC Introduces Cross-Platform Bitnet LoRA Framework for On-Device AI Training
-
I Switched to a Local LLM for These 5 Tasks and the Cloud Version Hasn't Been Worth It Since
-
You're Using Your Local LLM Wrong If You're Prompting It Like a Cloud LLM
-
Mistral Releases Small 4 Open-Source Model Under Apache 2.0
-
Show HN: Merrilin.ai – Code Blocks in Your Books, Finally
-
LoKI – Local AI Assistant for Linux and WSL
-
Qwen3.5-397B Achieves 282 tok/s on 4x RTX PRO 6000 Blackwell Through Custom CUTLASS Kernel
-
OpenClaw vs Eigent vs Claude Cowork: Comparing Open-Source AI Collaboration Platforms
-
Nvidia's Nemotron 3 Super: Understanding the Significance for Local LLM Deployment
-
Two Local Models Prove Competitive Enough to Replace ChatGPT, Gemini, and Copilot
-
Show HN: Intake API – An Inbox for AI Coding Agents
-
Show HN: Bots of WallStreet – Multi-Agent Debate and Prediction Framework
-
AgentArmor: Open-Source 8-Layer Security Framework for AI Agents
-
Runpod Report: Qwen Has Overtaken Meta's Llama As The Most-Deployed Self-Hosted LLM
-
Linux 7.0 AMDGPU Fixing Idle Power Issue For RDNA4 GPUs After Compute Workloads
-
Intel Updates LLM-Scaler-vLLM With Support For More Qwen3/3.5 Models
-
Qwodel – An Open-Source Unified Pipeline for LLM Quantization
-
Nvidia Releases Nemotron 3 Super: 120B MoE Model for Local Deployment
-
MeepaChat – Slack for AI Agents (iOS, macOS, Web / Cloud, Self-Hosted)
-
Apple M5 Max 128GB Benchmark Results for Local LLM Inference
-
Show HN: Detect When an LLM Silently Changes Behavior for the Same Prompt
-
Ex-Manus Backend Lead Shares: Moving Beyond Function Calling in Agent Design
-
LMF – LLM Markup Format
-
A Kubernetes Operator That Orchestrates AI Coding Agents
-
Show HN: Aver – a Language Designed for AI to Write and Humans to Review
-
Show HN: AIWatermarkDetector: Detect AI Watermarks in Text or Code
-
Researchers Gave AI Agents Real Tools. One Deleted Its Own Mail Server
-
Mnemos: Persistent Memory System for Local AI Agents
-
FreeBSD 14.4 Released: Implications for Local LLM Deployment
-
Community Survey: AI Content Automation Stacks in 2026
-
Sarvam Open-Sources 30B and 105B Reasoning Models
-
How to Run Your Own Local LLM — 2026 Edition
-
Engram – Open-Source Persistent Memory for AI Agents
-
Reverse engineering a DOS game with no source code using Codex 5.4
-
Show HN: Proxly – Self-hosted tunneling on your own domain in 60 seconds
-
OpenSpec: Spec-driven development (SDD) for AI coding assistants
-
Benchmark: Local Open-Source LLMs Competitive in Real-Time Trading Applications
-
AI Agent Reliability Tracker
-
Show HN: SimplAI – Build and Deploy AI Agents and Workflows Without Boilerplate
-
Self-Hosted Paperless-ngx With Optional Local AI Integration
-
Show HN: RedDragon – LLM-Assisted IR Analysis of Code Across Languages
-
Open WebUI Adds Native Terminal Tool Calling with Qwen3.5 35B Support
-
Turning Your Linux Terminal into a Local AI Assistant
-
Alibaba Releases Qwen 3.5 AI Model with On-Device AI Support
-
The Emerging Role of SRAM-Centric Chips in AI Inference
-
Real-World Qwen 3.5 9B Agent Performance on M1 Pro Validates Edge Deployment
-
llama.cpp Merges Agentic Loop and MCP Client Support
-
ConsciOS v1.0: A Viable Systems Architecture for Human and AI Alignment
-
SynthesisOS – A Local-First, Agentic Desktop Layer Built in Rust
-
Qwen 3.5-35B-A3B Achieves 37.8% on SWE-bench Verified Hard
-
Quantifying Cost Savings with Local LLMs for Development
-
Incrmd: Incremental AI Coding by Editing PROJECT.md
-
ÆTHERYA Core – Deterministic Policy Engine for Governing LLM Actions
-
RAG vs. Skill vs. MCP vs. RLM: Comparing LLM Enhancement Patterns
-
HP ZBook Ultra 14 G1a Workstation Reclaims Local AI Workflows for Professionals
-
Alibaba's Open-Source CoPaw AI Agent Now Compatible with MCP and ClawHub Skills
-
RAG-Enterprise – 100% Local RAG System for Enterprise Documents
-
Huawei's SuperPoD Portfolio Creates New Option for Global Computing at MWC Barcelona 2026
-
4 Free Tools to Run Powerful AI on Your PC Without a Subscription
-
DeepSeek V4 Multimodal Model Coming Next Week With Image and Video Generation
-
AI-Native Store Research
-
The Complete Developer's Guide to Running LLMs Locally: From Ollama to Production
-
Show HN: Anonymize LLM traffic to dodge API fingerprinting and rate-limiting
-
Show HN: A Human-Curated, CLI-Driven Context Layer for AI Agents
-
Elastic Introduces Best-in-Class Embedding Models for High Performance Semantic Search
-
Show HN: Dypai – Build Backends from Your IDE Using AI and MCP
-
Enterprise Infrastructure Guide: Running Local LLMs for 70-150 Developers
-
Show HN: Agora – AI API Pricing Oracle with X402 Micropayments
-
Making Wolfram Technology Available as Foundation Tool for LLM Systems
-
Open-Source Framework Achieves Gemini 3 Deep Think Level Performance Through Local Model Scaffolding
-
Massu: Governance Layer for AI Coding Assistants with 51 MCP Tools
-
GLM-5 Becomes Top Open-Weights Model on Extended NYT Connections Benchmark
-
FORTHought: Self-Hosted AI Stack for Physics Labs Built on OpenWebUI
-
Show HN: The Only CLI Your AI Agent Will Need
-
Breaking the Speed Limit: Strategies for 17k Tokens/Sec Local Inference
-
Ollama 0.17 Released With Improved OpenClaw Onboarding
-
Show HN: Horizon – My AI-Powered Personal News Aggregator and Summarizer
-
Search and Analyze Documents from the DOJ Epstein Files Release with Local LLM
-
Claude Code Open – AI Coding Platform with Web IDE and Agents
-
I Stopped Paying for ChatGPT and Built a Private AI Setup That Anyone Can Run
-
The Path to Ubiquitous AI (17k tokens/sec)
-
Ollama Production Deployment: Docker-Compose Setup Guide
-
NVIDIA Releases Dynamo v0.9.0: Infrastructure Overhaul With FlashIndexer and Multi-Modal Support
-
Mirai Secures $10M to Optimize On-Device AI Amid Cloud Cost Surge
-
Using Local LLMs With Self-Hosted Tools to Manage Documents in Paperless-ngx
-
Why AI Models Fail at Iterative Reasoning and What Could Fix It
-
Self-Hosted Local LLMs for Document Management with Paperless-ngx
-
Local-First RAG: Vector Search in SQLite with Hamming Distance
-
Why My Country's AI Scene Is Built on Sand
-
Alibaba's Qwen3.5-397B Achieves #3 Position in Open Weights Model Rankings
-
Real-World Coding Benchmark Tests LLMs on 65 Production Codebase Tasks
-
Ask HN: How Do You Debug Multi-Step AI Workflows When the Output Is Wrong?
-
Self-Hosted AI: A Complete Roadmap for Beginners
-
Open-Source Models Now Comprise 4 of Top 5 Most-Used Endpoints on OpenRouter
-
Show HN: Inkog – Pre-flight check for AI agents (governance, loops, injection)
-
Ask HN: What is the best bang for buck budget AI coding?
-
I broke into my own AI system in 10 minutes. I built it
-
Security Alert: Open Claw Designed for Self-Hosting, Stop Sharing Credentials
-
Running Your Own AI Assistant for €19/Month: Complete Self-Hosting Guide
-
GLM-5 Released: 744B Parameter MoE Model Targeting Complex Tasks
-
I Tried a Claude Code Rival That's Local, Open Source, and Completely Free
-
DeepSeek Launches Model Update with 1M Context Window