Tagged "gemma"

Google Launches Tiny Board for Running Gemma 3 Locally 29 May 2026
Gemma 4: A New Budget-Focused Model in Posit AI 25 May 2026
BT Explainer: Google's Gemma 4 Could Put Powerful AI on Your Phone and Laptop 13 May 2026
Gemma 4 Replaces Entire Local LLM Stack for Many Practitioners 12 May 2026
Perplexity Brings On-Device AI Workflow to Macs with 'Personal Computer' Feature 8 May 2026
Google Releases Gemma 4 Multi-Token Prediction Drafters To Accelerate AI Inference 8 May 2026
Airplane AI – Local NDA Safe AI Powered by Gemma 8 May 2026
Google Accelerates Gemma 4 Inference Speed 3x With Multi-Token Prediction Drafters 6 May 2026
Google's Gemma 4 Could Put Powerful AI on Your Phone and Laptop 5 May 2026
Gemma 4 Just Replaced My Whole Local LLM Stack 4 May 2026
Google's Gemma 4 Brings Powerful AI Capabilities to Phones and Laptops 30 April 2026
Google's Gemma 4: Powerful AI Models Optimized for Your Phone and Laptop 28 April 2026
Google's Gemma 4 Could Put Powerful AI on Your Phone and Laptop 27 April 2026
Google's Gemma 4 Could Put Powerful AI on Your Phone and Laptop 26 April 2026
Google's Gemma 4 Brings Powerful On-Device AI to Phones and Laptops 25 April 2026
Google's Gemma 4 Finally Makes Local LLM Deployment Compelling for Practitioners 22 April 2026
16 Ways to Make a Small Language Model Think Bigger 21 April 2026
Gemma 4 Just Replaced My Whole Local LLM Stack 21 April 2026
Gemma 4 Just Replaced My Whole Local LLM Stack 19 April 2026
Google's Gemma 4: The Most Practical Local LLM Despite Not Being The Smartest 16 April 2026
Google's Gemma 4 Brings Game-Changing Performance to Local Laptop Inference 15 April 2026
Running Gemma 4 on an iPhone 13 Pro 15 April 2026
Speculative Decoding Achieves 29% Speed Boost for Gemma-4 31B 13 April 2026
Audio Processing Support Lands in llama.cpp with Gemma-4 13 April 2026
Google's Gemma 4 Brings Free Agentic AI to Your Phone With Zero Data Leaving the Device 12 April 2026
Google Gemma 4 Delivers Exceptional Speed and Accuracy for Local Inference 12 April 2026
Critical Unsloth Gemma-4 Chat Template Updates for Tool Calling 11 April 2026
Gemma 4 31B vs Qwen 3.5 27B: Comprehensive Long Context Benchmark 11 April 2026
Gemma 4 Template Improvements Enhance Tool Use and Dialog Compliance 10 April 2026
Community Reverse Engineers Gemma 4 Multi-Token Prediction Capability 10 April 2026
Gemma 4 Support Stabilized in Llama.cpp 9 April 2026
Gemma 4 GGUF Models Updated with Critical Quantization Fixes 9 April 2026
Google AI Edge Gallery Showcases Offline Inference with Gemma 4 8 April 2026
Google's Gemma 4 Brings Powerful On-Device AI to Android and iOS 8 April 2026
TurboQuant-Optimized llama.cpp Fork Delivers GFX906 GPU Acceleration 7 April 2026
Google Launches Offline AI Dictation App for iOS with Gemma 7 April 2026
Gemma 4 Achieves Top Multilingual Performance Across European Languages 7 April 2026
Gemma 4 26B Achieves Impressive Local Performance With Proper Configuration 7 April 2026
AMD Announces Day 0 Support for Google Gemma 4 Across Processors and GPUs 7 April 2026
Context Window Optimization: Extending Gemma 4 Context Length Through Efficient Projection Quantization 6 April 2026
Google AI Edge Gallery Tops App Store Charts with On-Device Gemma 4 6 April 2026
Real-time Multimodal AI on Apple Silicon: Gemma E2B Demo Shows Practical Edge Deployment 6 April 2026
Gemma 4 31B Achieves Exceptional Performance on Local Hardware 6 April 2026
Gemma 4 31B Achieves Third Place on FoodTruck Bench, Beating Larger Models 5 April 2026
Gemma 4 26B MoE Emerges as Optimal All-Around Local Model for Consumer Hardware 5 April 2026
Apple Research Shows Self-Distillation Significantly Improves Local Code Generation 5 April 2026
NVIDIA and Google Optimize Gemma 4 AI Models for Local RTX Deployment 4 April 2026
Google Launches Gemma 4 For Advanced On-Device AI 4 April 2026
Gemma 4 31B Outperforms GLM 5.1 in Real-World Testing 4 April 2026
Gemma 4 KV Cache Memory Issues Fixed in llama.cpp 4 April 2026
AMD Rolls Out Gemma 4 Model Support Across Full Range of GPUs & CPUs 4 April 2026
April 2026 TLDR Setup for Ollama and Gemma 4 26B on a Mac mini 3 April 2026
NVIDIA Accelerates Gemma 4 for Local Agentic AI on RTX GPUs 3 April 2026
VRAM Optimization Technique Cuts Gemma 4 Memory Usage by 3x 3 April 2026
Google Gemma 4 Released with GGUF Quantizations 3 April 2026
Gemma 4 Shows Strong Reasoning Performance with Thinking Tokens 3 April 2026
Gemma 4 26B A4B Outperforms Qwen 3.5 35B on Apple Silicon 3 April 2026
Google Launches Gemma 4 Open Models for Local On-Device AI 3 April 2026
Gemma 4 Makes Local AI Agents Practical 3 April 2026
Gemma 4 2B Successfully Runs on Raspberry Pi 5 3 April 2026
Gemma 4 on Arm: Optimized On-Device AI for Mobile and Edge Deployment 3 April 2026
AMD Provides Day 0 Support for Gemma 4 on Ryzen AI Processors and GPUs 3 April 2026
O-TITANS: Orthogonal LoRA Framework for Gemma 3 with Google TITANS Memory Architecture 22 February 2026