Vespa

Name: Vespa
Price: 0.1 USD
Availability: InStock

AI-powered search engine and vector database

paid· from $0.10/mo· for enterprise· verified 29 days ago

Vespa runs vector, lexical, and structured search in a single engine, with machine-learned ranking applied at query time across billions of items. Ecommerce platform and search teams use it to power product discovery, personalized recommendations, and semantic retrieval at catalog scales where Elasticsearch or standalone vector databases hit latency ceilings. The buyer is typically a CTO or search engineering lead replacing a stitched-together stack with one system handling retrieval, ranking, and serving.

> pick this if

Pick this if you're a search or platform engineering leader running catalogs in the tens of millions to billions of items and you want vector, lexical, structured filters, and ML ranking executed in one engine rather than stitched across Elasticsearch, a vector DB, and a re-ranker.

> look elsewhere if

Look elsewhere if you're a mid-market merchant under ~$50M GMV without a dedicated search engineering team — Algolia, Typesense, or Shopify-native search will ship faster with a fraction of the operational burden.

> Vespa is used by

Metal AI
Clarm
Perplexity
Spotify
Elicit
Yahoo
Onyx
Mimeta – Civsy
Qwant
Vinted
RavenPack Bigdata.com

> Vespa is built for

platform-agnostic

> what it does for ecommerce

Combines vector, lexical, and structured filters in one query pass
Runs ML ranking models inline during retrieval, not post-hoc
Co-locates embeddings with metadata to avoid cross-system joins
Scales to billions of documents with sub-100ms tail latency
Available as managed Vespa Cloud or self-hosted open source

> how you'd use it

Marketplace or large catalog retailer, $200M+ GMV, 8–20 person search/platform engineering team
Replacing a stack of Elasticsearch for lexical plus a separate vector DB (Pinecone/Weaviate) for semantic retrieval, where re-ranking happens in a downstream service and tail latency exceeds 300ms on 50M+ SKUs
→ Single-pass hybrid retrieval with inline learned ranking, sub-100ms p99 on billions of documents, and one system to operate instead of three
DTC brand group or retailer with long-tail catalog, $50M–$500M GMV, dedicated ML/search team of 4–10
Serving personalized product discovery where embeddings, business rules (margin, stock, boost flags), and user features all need to influence ranking at query time
→ ML ranking models (GBDT, neural) execute during retrieval with user and item features co-located, lifting conversion on search and PLPs without a separate re-ranking microservice
Commerce platform or headless SaaS vendor serving multi-tenant merchants, enterprise scale
Building semantic site search and recommendations as a product feature across thousands of merchant catalogs with strict latency SLAs
→ Vespa Cloud handles tenant isolation, real-time indexing, and mixed vector/lexical/structured queries; engineering avoids building a bespoke retrieval platform

> Vespa use cases

> Vespa key features

Real-time machine-learned model inference
Supports vector search and lexical search
High scalability and performance
Open-source cloud service
Co-location of vectors and metadata

> Vespa pricing

verified 29 days ago

Startup
Plan for testing and getting your business started with restricted features
- Restricted features
- Community support only, no SLA
- Runs on shared resources
- No SSO, no autoscaling
- No redundancy by default
- No CI/CD pipeline
- No testing before Vespa upgrades
- Dev zones only
- Approx. 15 mins/day downtime during upgrades
Low fixed unit cost
Basic
Suitable for applications that don't need 24/7 operational support
- Prices go down with volume
- Production support response time: Next business day
- Deployment support response time: Next business day
- Other support response time: Next 2 business days
Commercial
Suitable for production applications with 24/7 operational support
- Prices go down with volume
- Production support response time: 1 hour 24/7
- Deployment support response time: Next business day
- Other support response time: Next 2 business days
Enterprise
Suitable for enterprises with 24/7 deployment support
- Prices go down with volume
- Production support response time: 15 minutes 24/7
- Deployment support response time: 1 hour 24/7
- Other support response time: Next business day
- Single sign-on (SSO)
- Named support representative
- Tune-up program participation
- Dedicated Slack channel
- On-site visits
Minimum monthly spend of $20,000
Self Managed
Self Managed Vespa deployment including support
- Unlimited support cases
Contact Sales

Prices vary based on resources allocated and plans chosen. Each support level sets a unit price for machine resources.

> compliance & trust

GDPR

> how vespa compares

bidirectional editorial

> alternatives to Vespa in our index

by shared use-case