Tagged "resource-efficiency"
- OmniCoder-9B: Efficient Coding Model for 8GB GPUs
- Cutile.jl Brings Nvidia CUDA Tile-Based Programming to Julia
- Alibaba's Qwen 3.5 Small Model Runs Directly on iPhone 17
- Wave Field LLM Achieves O(n log n) Scaling: 825M Model Trained to 1B Parameters in 13 Hours
- GPT-OSS 120B Uncensored Model Released in Native MXFP4 Precision
- Running Mistral-7B on Intel NPU Achieves 12.6 Tokens/Second
- Nanbeige4.1-3B: A Small General Model that Reasons, Aligns, and Acts