Tagged "resource-optimization"

Qwen 3.5 Models: Optimal Settings and Reduced Overthinking Configuration 23 March 2026
LMCache Dramatically Accelerates LLM Inference on Oracle Data Science Platform 20 March 2026
Custom GPU Multiplexer Achieves 0.3ms Model Switching on Legacy Hardware 18 March 2026
Kimi Introduces Attention Residuals: 1.25x Compute Performance at <2% Overhead 17 March 2026
FreeBSD 14.4 Released: Implications for Local LLM Deployment 10 March 2026
Fine-Tuned Qwen SLMs (0.6–8B) Demonstrate Competitive Performance Against Frontier LLMs on Specialized Tasks 10 March 2026
Snapdragon Wear Elite Unveiled at MWC 2026, Advancing Wearable AI Inference 8 March 2026
SynthesisOS – A Local-First, Agentic Desktop Layer Built in Rust 4 March 2026
RunAnywhere Launches Production-Grade On-Device AI Platform for Enterprise Scale 4 March 2026
Qwen 3.5-27B Q4 Quantization Comparison and Analysis 4 March 2026
The ML.energy Leaderboard 28 February 2026
DeepSeek Paper – DualPath: Breaking the Bandwidth Bottleneck in LLM Inference 26 February 2026
Show HN: A Ground Up TLS 1.3 Client Written in C 24 February 2026
O-TITANS: Orthogonal LoRA Framework for Gemma 3 with Google TITANS Memory Architecture 22 February 2026
At India AI Impact Summit, Intel Showcases Its AI PCs and Cost-Efficient Frugal AI 21 February 2026
24 Simultaneous Claude Code Agents on Local Hardware 21 February 2026
TemplateFlow – Build AI Workflows, Not Prompts 20 February 2026
Mirai Secures $10M to Optimize On-Device AI Amid Cloud Cost Surge 20 February 2026
Local-First RAG: Vector Search in SQLite with Hamming Distance 19 February 2026
Sarvam AI Launches Edge Model to Challenge Major AI Players with Local-First Approach 18 February 2026
OpenClaw Refactored in Go, Runs on $10 Hardware 18 February 2026
Meet Sarvam Edge: India's AI Model That Runs on Phones and Laptops With No Internet 17 February 2026
Switching From Ollama And LM Studio To llama.cpp: A Performance Comparison 14 February 2026
MiniMax Releases M2.5 Model with SOTA Coding and Agent Capabilities 14 February 2026
Switching From Ollama and LM Studio to llama.cpp: Performance Benefits 13 February 2026
Nanbeige4.1-3B: A Small General Model that Reasons, Aligns, and Acts 11 February 2026
Energy-Based Models Compared Against Frontier AI for Sudoku Solving 11 February 2026