Tagged "rag"

Velr: Embedded Property-Graph Database for Local LLM Applications 23 March 2026
LM Studio Releases Reworked Plugins with Fully Local Web Research 23 March 2026
Powerful AI Search Engine Built on Single GeForce RTX 5090 23 March 2026
Llama 8B Matches 70B Performance on Multi-Hop QA Using Structured Prompting 22 March 2026
LMCache Dramatically Accelerates LLM Inference on Oracle Data Science Platform 20 March 2026
MiniMax-M2.7: New Compact Model Announced for Local Deployment 18 March 2026
Mamba 3: State Space Model Architecture Optimized for Inference 18 March 2026
Gloss: Open-Source, Local-First RAG Alternative to NotebookLM Built in Rust 10 March 2026
AI Agent Reliability Tracker 8 March 2026
Framework Choice Critical: llama.cpp and vLLM Outperform Ollama for Qwen 3.5 Testing 3 March 2026
RAG vs. Skill vs. MCP vs. RLM: Comparing LLM Enhancement Patterns 2 March 2026
RAG-Enterprise – 100% Local RAG System for Enterprise Documents 1 March 2026
Building a Privacy-Preserving RAG System in the Browser 26 February 2026
Researchers Develop Persistent Memory System for Local LLMs—No RAG Required 26 February 2026
Elastic Introduces Best-in-Class Embedding Models for High Performance Semantic Search 24 February 2026
Elastic Introduces Best-in-Class Embedding Models for High Performance Semantic Search 23 February 2026
Search and Analyze Documents from the DOJ Epstein Files Release with Local LLM 21 February 2026
NVIDIA Releases Dynamo v0.9.0: Infrastructure Overhaul With FlashIndexer and Multi-Modal Support 20 February 2026
Local-First RAG: Vector Search in SQLite with Hamming Distance 19 February 2026
InitRunner: YAML-Based AI Agent Framework with RAG and Memory 16 February 2026
GPU-Accelerated DataFrame Library for Local Inference Workloads 16 February 2026
Microsoft MarkItDown: Document Preprocessing Tool for LLMs 12 February 2026
Building a RAG Pipeline on 2M+ Pages: EpsteinFiles-RAG Project 11 February 2026