vector-databases.html

Pick your vector database — 12 options across the four categories that decide most picks

Your vector store is the decision that compounds across every RAG retrieval, every semantic-search query, and every embedding pipeline you ship. The directory filters by embedded / managed-SaaS / self-hosted / multi-model, plus engine, pricing, and the hybrid-search and edge-ready flags that decide most picks. Every entry gives you a one-line summary, a concrete best-for, an honest skip-this-if, and a paragraph of opinion.

10 SIDE-BY-SIDE COMPARISONS → TOP-5 DECISION HUB →

Filter the list

Category
Engine
Pricing
Sort

Showing 12 of 12

Milvus

Distributed vector DB built for billion-scale workloads. Heavy, capable, China-AI-aligned.

  • CatSelf-hosted
  • EngineGo / C++
  • Stars32.6k
  • PricingFreemium
Hybrid search Multi-tenant
Read the take →

Qdrant

Rust-fast open-source vector engine. Cleaner API than Weaviate, smaller footprint.

  • CatSelf-hosted
  • EngineRust
  • Stars28.4k
  • PricingFreemium
Hybrid search Multi-tenant
Read the take →

Chroma

Embedded vector database for AI apps. Runs in-process like SQLite, prototype-first.

  • CatEmbedded
  • EnginePython / Rust core
  • Stars22.4k
  • PricingOpen source
Read the take →

Weaviate

Open-source self-hostable vector database with hybrid search and module ecosystem.

  • CatSelf-hosted
  • EngineGo
  • Stars14.6k
  • PricingFreemium
Hybrid search Multi-tenant
Read the take →

pgvector

Vector search inside Postgres. The default for teams already on Postgres or Supabase.

  • CatEmbedded
  • EngineC / Postgres extension
  • Stars13.5k
  • PricingOpen source
Hybrid search Multi-tenant
Read the take →

LanceDB

Embedded multimodal vector + tabular database. Object-store-backed, Rust-fast.

  • CatEmbedded
  • EngineRust
  • Stars11.8k
  • PricingFreemium
Hybrid search Edge-ready Multi-tenant
Read the take →

Vespa

mature

Yahoo-built distributed search + vector engine. Hybrid retrieval at extreme scale.

  • CatSelf-hosted
  • EngineJava / C++
  • Stars6.4k
  • PricingFreemium
Hybrid search Multi-tenant
Read the take →

Marqo

End-to-end vector search engine with built-in embedders. Multimodal, model-aware.

  • CatMulti-model
  • EnginePython
  • Stars4.8k
  • PricingFreemium
Hybrid search
Read the take →

Pinecone

The original managed vector database. Polished SDK, predictable latency, expensive at scale.

  • CatManaged SaaS
  • EngineHosted
  • Starsclosed
  • PricingFreemium
Hybrid search Multi-tenant
Read the take →

Turbopuffer

Object-store-backed serverless vector DB. Pay-per-query, cheap at idle, fast at scale.

  • CatManaged SaaS
  • EngineRust
  • Starsclosed
  • PricingPaid
Hybrid search Edge-ready Multi-tenant
Read the take →

MongoDB Atlas Vector Search

Vector search inside MongoDB Atlas. Document data + embeddings in one query.

  • CatMulti-model
  • EngineMongoDB
  • Starsclosed
  • PricingFreemium
Hybrid search Multi-tenant
Read the take →

Astra DB Vector

Cassandra-based managed vector store from DataStax. Wide-column + vector hybrid.

  • CatMulti-model
  • EngineCassandra
  • Starsclosed
  • PricingFreemium
Hybrid search Multi-tenant
Read the take →

The vector-store choice is the easy half — your retrieval design is the hard one

Picking the vector database is the easy half. The hard half is your chunking strategy, your hybrid-retrieval recipe, your reranking pipeline, and the eval loop that tells you whether your RAG is getting better or worse week to week. The 30-min call is the right starting place — describe your corpus, your latency budget, your scale; I tell you what fits.