AI / ML

Argo Knowledge RAG

Local semantic search for your knowledge base

Because grep doesn't understand what you mean

3,000+ Files Indexed

<100ms Search Latency

100% Local

The Problem

3,000+ markdown files across multiple Obsidian vaults. Conversations, debugging sessions, technical documentation, legal notes. Finding anything meaningful with keyword search is nearly impossible when you're looking for "that conversation about Docker networking where I fixed the bridge issue" and all you remember is it was sometime last summer.

Quick Start

Terminal

$ git clone https://github.com/lazarusoftheshadows/argo-knowledge-rag.git

$ cd argo-knowledge-rag

$ export VAULT_PATH=~/your-obsidian-vault

$ docker-compose up -d # Open http://localhost:8000

# Or install locally:

$ pip install argo-knowledge-rag

$ argo-rag index --vault ~/Vaults/my-vault

$ argo-rag search "docker bridge networking problems"

How It Works

📄

Ingest

Reads markdown files, extracts frontmatter, respects folder structure

→

✂️

Chunk

Smart splitting that respects paragraph and code block boundaries

→

🧮

Embed

GPU-accelerated sentence-transformers generate vector embeddings

→

🔍

Search

ChromaDB cosine similarity finds semantically relevant content

Features

🔒

100% Local

All processing on your machine. Data never leaves. No cloud APIs, no subscriptions.

⚡

GPU Accelerated

~35 minutes to index 3,000 files on RTX 4070 Ti. Search results in under 100ms.

💎

Obsidian Native

Extracts frontmatter, preserves folder structure, understands vault organization.

🐳

Docker Ready

One-command deployment. Docker Compose included with GPU passthrough support.

🖥️

Multiple Interfaces

CLI for quick searches, Web UI for browsing, REST API for integration.

✂️

Smart Chunking

Respects paragraph and code block boundaries. No mid-sentence splits.

Performance

Hardware	Index Time (3000 files)	Search Latency
RTX 4070 Ti	~35 minutes	<100ms
CPU Only	~2 hours	<150ms

Why This Exists

The Alternatives

OpenWebUI's knowledge feature chokes on large file counts
Obsidian search is keyword-only — useless for semantic queries
Cloud RAG services require sending your data externally
ChatGPT/Claude can't index your local files continuously

 Argo Knowledge RAG Runs entirely local — uses your GPU
Handles thousands of files without choking
Natural language queries, not just keywords
Re-index on demand as your vault grows
 

🕸️ Tendril Knowledge graph visualization for the web 🐧 Argo OS Custom Gentoo distribution powering the GPU 📋 All Projects Back to the projects overview

Argo Knowledge RAG

The Problem

Quick Start

How It Works

Ingest

Chunk

Embed

Search

Features

100% Local

GPU Accelerated

Obsidian Native

Docker Ready

Multiple Interfaces

Smart Chunking

Performance

Why This Exists

The Alternatives

Argo Knowledge RAG

Related

System Status

🌐 Gateway

🚀 Orchestrators

🤖 Build Drones

🔨 Active Builds