Tagged "resource-efficiency"
- GPU Passthrough to LXCs in Proxmox Outperforms VMs and Simplifies Local AI Infrastructure
- 16 Ways to Make a Small Language Model Think Bigger
- Gemma 4 Makes Local AI Agents Practical
- TinyGPU Adds Mac Support for External Nvidia GPU Acceleration
- GPU Passthrough to LXCs in Proxmox Simplifies Local LLM Deployment
- Mistral AI Releases Voxtral: Open-Source TTS Model Beating ElevenLabs on Local Hardware
- NVIDIA Releases GPT-OSS-Puzzle-88B, a Deployment-Optimized Model
- OmniCoder-9B: Efficient Coding Model for 8GB GPUs
- Cutile.jl Brings Nvidia CUDA Tile-Based Programming to Julia
- Alibaba's Qwen 3.5 Small Model Runs Directly on iPhone 17
- Wave Field LLM Achieves O(n log n) Scaling: 825M Model Trained to 1B Parameters in 13 Hours
- GPT-OSS 120B Uncensored Model Released in Native MXFP4 Precision
- Running Mistral-7B on Intel NPU Achieves 12.6 Tokens/Second
- Nanbeige4.1-3B: A Small General Model that Reasons, Aligns, and Acts