Vespa

Vespa is a distributed engine designed for online AI and big-data workloads. It excels at low-latency retrieval and inference, supporting vector search, custom scoring, and near-real-time indexing.

vespa-engine · Since 2016-06-03

Loading score...

GitHub Website

Overview

Vespa is a distributed engine designed for online AI and big-data workloads. It excels at low-latency retrieval and inference, supporting vector search, custom scoring, and near-real-time indexing. Typical uses include semantic search, recommendation, and online model serving.

Key features

High-performance vector and text retrieval with hybrid queries.
Near-real-time indexing and low-latency query serving.
Scalable distributed architecture for production workloads.

Use cases

Retrieval layer for RAG systems and semantic search.
Recommendation and personalized online services.
Low-latency online inference and model serving.

License

Apache-2.0 — suitable for enterprise and open-source contributions.

Core Content

Core Content

Technology

Technology

More

More

AI Infrastructure

AI Infrastructure

Explore

Explore

Connect

Connect

Quick Links

Quick Links

LinkedIn

LinkedIn

Follow on X

Follow on X

Vespa

Overview

Key features

Use cases

License

Score Breakdown

Related Resources

Amplifier

Aspire

BentoML