Open to career opportunities and collaborations. Learn more .

Khoj

A self-hostable 'second brain' platform that turns web pages and documents into a searchable knowledge base and supports custom agents and automations.

Detailed Introduction

Khoj is a self-hostable “second brain” platform that converts web pages, notes, and documents into a semantic knowledge base and enables building searchable agents and automation over private data. The project integrates semantic retrieval and Retrieval-Augmented Generation (RAG) pipelines, supports multiple LLM backends (e.g., GPT, Gemini, Llama), local/offline models, and plugin-style integrations, and includes a dashboard, CLI, and templates for rapid setup and observability.

Main Features

  • Private deployments: run in local or private networks to satisfy privacy and compliance requirements.
  • Semantic indexing & retrieval: convert heterogeneous documents into vector indexes for high-quality retrieval and multi-hop queries.
  • Multi-backend & offline model support: flexible choice between cloud LLMs and local models.
  • Automation & scheduling: build custom agents, automate tasks, and gather observations to improve agent strategies.

Use Cases

  • Enterprise knowledge bases: provide searchable knowledge for support, R&D, or legal teams within controlled environments.
  • Research & prototyping: serve as a platform for RAG and retrieval method experiments and benchmarks.
  • Personal productivity: turn notes or Obsidian vaults into a Q&A-ready knowledge base.
  • Offline & edge scenarios: perform retrieval and inference when external APIs are unavailable or undesired.

Technical Features

  • Modular architecture: decoupled retrieval, indexing, fusion, and generation modules for easy substitution.
  • Multi-language SDKs and templates: Python/TypeScript templates and example projects for quick integration.
  • Extensible storage backends: support local disk and external object storage for artifacts.
  • Open-source licensing: repository is AGPL-3.0 licensed; check licensing terms for commercial usage.
Khoj
Resource Info
📚 RAG 🤝 Assistant 🛠️ Dev Tools 🌱 Open Source