Tagged "model-performance"

Gemma 4: A New Budget-Focused Model in Posit AI 25 May 2026
Adobe Photoshop Update Brings On-Device AI Processing 21 May 2026
HP's On-Device AI Needs More If It Is Going to Compete With Copilot 17 May 2026
Open-Source Local LLM Emerges as Viable Cloud AI Competitor 15 May 2026
Local LLM Persistent Context Prevents Repetitive Mistakes 14 May 2026
I Replaced My Local LLM With a Model Half Its Size and Got Better Results 24 April 2026
Gemma 4 Just Replaced My Whole Local LLM Stack 21 April 2026
Fine-Tuned Qwen3.5-0.8B for OCR Outperforms Previous 2B Release 14 April 2026
Abliterated Local LLM Models Show Distinct Behavioral Characteristics Compared to Standard Variants 14 April 2026
MiniMax-M2.7 Delivers Exceptional Performance on Consumer Hardware 13 April 2026
GLM 5.1 Dominates Agentic Benchmarks, Outperforming Most Models at 1/3 Opus Cost 11 April 2026
Show HN: Willitrun – Check if Any ML Model Runs on Any Device (Benchmark-Backed) 7 April 2026
Gemma 4 Achieves Top Multilingual Performance Across European Languages 7 April 2026
Gemma 4 31B Achieves Exceptional Performance on Local Hardware 6 April 2026
Gemma 4 31B Achieves Third Place on FoodTruck Bench, Beating Larger Models 5 April 2026
Gemma 4 31B Outperforms GLM 5.1 in Real-World Testing 4 April 2026
Gemma 4 26B A4B Outperforms Qwen 3.5 35B on Apple Silicon 3 April 2026
Qwen 3.5-27B Demonstrates Superior Performance vs Gemini 3.1 Pro and GPT-5.3 1 April 2026
Ollama Adopts Apple's MLX Framework for Faster Local AI on Mac 1 April 2026
PrismML Announces 1-Bit Bonsai: First Commercially Viable 1-Bit LLMs 1 April 2026
Real-World Benchmark: DeepSeek-V3 Matches Claude Sonnet on Routine Coding Tasks 26 March 2026
Velr: Embedded Property-Graph Database for Local LLM Applications 23 March 2026
Alibaba Commits to Continuous Open-Sourcing of Qwen and Wan Models 23 March 2026
Setting Up a Private AI Brain on Windows: Complete Guide to Local LLM Deployment 22 March 2026
Nvidia Nemotron Cascade 2 30B Emerges as Powerful Alternative to Qwen Models 22 March 2026
Qwen 3.5 Emerges as Top Performer for Local Deployment with Extensive Quantization Options 20 March 2026
Open-Source LLMs Rapidly Displacing Proprietary SOTA Models 16 March 2026
Nvidia's Nemotron 3 Super: Understanding the Significance for Local LLM Deployment 15 March 2026
Qwen 3.5 Family Benchmark Comparison Shows Strong Performance Across Smaller Models 9 March 2026
FretBench – Testing 14 LLMs on Reading Guitar Tabs Reveals Performance Gaps 9 March 2026
Critical: Qwen 3.5 Requires BF16 KV Cache, Not FP16 for Accurate Inference 2 March 2026
Change Intent Records: The Missing Artifact in AI-Assisted Development 2 March 2026
Qwen3.5-35B RTX 5080 Experiments Confirm KV q8_0 as Free Lunch, Q4_K_M Remains Optimal 28 February 2026
Qwen 3.5-27B Demonstrates Exceptional Performance with Thoughtful Prompt Engineering 28 February 2026
Qwen 3.5 Underperforms on Hard Coding Tasks—APEX Benchmark Analysis 26 February 2026
Qwen3.5 122B Achieves 25 tok/s on 72GB VRAM Setup 26 February 2026
Advanced Quantization Techniques Show Surprising Performance Gains Over Standard Methods 25 February 2026
GLM-5 Becomes Top Open-Weights Model on Extended NYT Connections Benchmark 23 February 2026
Strix Halo Performance Benchmarks: Minimax M2.5, Step 3.5 Flash, Qwen3 Coder 21 February 2026
MiniMax-M2.5 230B MoE Model Released with GGUF Support for Local Deployment 14 February 2026
GLM-5 Released: 744B Parameter MoE Model Targeting Complex Tasks 12 February 2026