Tagged "hardware-optimization"
- South Korea to Launch $687 Million Project to Develop On-Device AI Semiconductors
- Local GPT-OSS 20B Model Demonstrates Practical Agentic Capabilities
- O-TITANS: Orthogonal LoRA Framework for Gemma 3 with Google TITANS Memory Architecture
- Taalas Etches AI Models onto Transistors to Rocket Boost Inference
- Mihup and Qualcomm Collaborate to Advance Secure On-Device Voice AI for BFSI
- GPT4All Replaces Ollama On Mac After Quick Trial
- Same INT8 Model Shows 93% to 71% Accuracy Variance Across Snapdragon Chipsets
- Qwen3-Next 80B MoE Achieves 39 Tokens/Second on RTX 5070/5060 Ti Dual-GPU Setup
- Sourdine: Open-Source macOS App for 100% Local AI Transcription
- Alibaba Unveils Major AI Model Upgrade Ahead of DeepSeek Release
- Simile AI Raises $100M Series A for Local AI Infrastructure
- Scaling llama.cpp On Neoverse N2: Solving Cross-NUMA Performance Issues
- NAS System Achieves 18 tok/s with 80B LLM Using Only Integrated Graphics
- Arm SME2 Technology Expands CPU Capabilities for On-Device AI