Tagged "gemma"
- Google's Gemma 4: Powerful AI Models Optimized for Your Phone and Laptop
- Google's Gemma 4 Could Put Powerful AI on Your Phone and Laptop
- Google's Gemma 4 Could Put Powerful AI on Your Phone and Laptop
- Google's Gemma 4 Brings Powerful On-Device AI to Phones and Laptops
- Google's Gemma 4 Finally Makes Local LLM Deployment Compelling for Practitioners
- 16 Ways to Make a Small Language Model Think Bigger
- Gemma 4 Just Replaced My Whole Local LLM Stack
- Gemma 4 Just Replaced My Whole Local LLM Stack
- Google's Gemma 4: The Most Practical Local LLM Despite Not Being The Smartest
- Google's Gemma 4 Brings Game-Changing Performance to Local Laptop Inference
- Running Gemma 4 on an iPhone 13 Pro
- Speculative Decoding Achieves 29% Speed Boost for Gemma-4 31B
- Audio Processing Support Lands in llama.cpp with Gemma-4
- Google's Gemma 4 Brings Free Agentic AI to Your Phone With Zero Data Leaving the Device
- Google Gemma 4 Delivers Exceptional Speed and Accuracy for Local Inference
- Critical Unsloth Gemma-4 Chat Template Updates for Tool Calling
- Gemma 4 31B vs Qwen 3.5 27B: Comprehensive Long Context Benchmark
- Gemma 4 Template Improvements Enhance Tool Use and Dialog Compliance
- Community Reverse Engineers Gemma 4 Multi-Token Prediction Capability
- Gemma 4 Support Stabilized in Llama.cpp
- Gemma 4 GGUF Models Updated with Critical Quantization Fixes
- Google AI Edge Gallery Showcases Offline Inference with Gemma 4
- Google's Gemma 4 Brings Powerful On-Device AI to Android and iOS
- TurboQuant-Optimized llama.cpp Fork Delivers GFX906 GPU Acceleration
- Google Launches Offline AI Dictation App for iOS with Gemma
- Gemma 4 Achieves Top Multilingual Performance Across European Languages
- Gemma 4 26B Achieves Impressive Local Performance With Proper Configuration
- AMD Announces Day 0 Support for Google Gemma 4 Across Processors and GPUs
- Context Window Optimization: Extending Gemma 4 Context Length Through Efficient Projection Quantization
- Google AI Edge Gallery Tops App Store Charts with On-Device Gemma 4
- Real-time Multimodal AI on Apple Silicon: Gemma E2B Demo Shows Practical Edge Deployment
- Gemma 4 31B Achieves Exceptional Performance on Local Hardware
- Gemma 4 31B Achieves Third Place on FoodTruck Bench, Beating Larger Models
- Gemma 4 26B MoE Emerges as Optimal All-Around Local Model for Consumer Hardware
- Apple Research Shows Self-Distillation Significantly Improves Local Code Generation
- NVIDIA and Google Optimize Gemma 4 AI Models for Local RTX Deployment
- Google Launches Gemma 4 For Advanced On-Device AI
- Gemma 4 31B Outperforms GLM 5.1 in Real-World Testing
- Gemma 4 KV Cache Memory Issues Fixed in llama.cpp
- AMD Rolls Out Gemma 4 Model Support Across Full Range of GPUs & CPUs
- April 2026 TLDR Setup for Ollama and Gemma 4 26B on a Mac mini
- NVIDIA Accelerates Gemma 4 for Local Agentic AI on RTX GPUs
- VRAM Optimization Technique Cuts Gemma 4 Memory Usage by 3x
- Google Gemma 4 Released with GGUF Quantizations
- Gemma 4 Shows Strong Reasoning Performance with Thinking Tokens
- Gemma 4 26B A4B Outperforms Qwen 3.5 35B on Apple Silicon
- Google Launches Gemma 4 Open Models for Local On-Device AI
- Gemma 4 Makes Local AI Agents Practical
- Gemma 4 2B Successfully Runs on Raspberry Pi 5
- Gemma 4 on Arm: Optimized On-Device AI for Mobile and Edge Deployment
- AMD Provides Day 0 Support for Gemma 4 on Ryzen AI Processors and GPUs
- O-TITANS: Orthogonal LoRA Framework for Gemma 3 with Google TITANS Memory Architecture