RAG Pipeline Demo

Retrieval-Augmented Generation with local embeddings

Back

Click to load the embedding model in your browser. No API key required.

How it works

  • • Model runs entirely in your browser via WebAssembly — no server
  • • Uses all-MiniLM-L6-v2 (32MB) for semantic embeddings
  • • Cosine similarity ranks documents by semantic relevance
  • • No API keys or server-side processing required