Vector Database

A specialised database that stores embeddings and efficiently finds nearest neighbours.

What is Vector Database?

A vector database is where you store and search embeddings. The key operation is "find the K most similar vectors to this query vector". Doing this efficiently across millions or billions of embeddings is computationally hard — vector databases use specialised indexes (HNSW, IVF) to make it fast.

Popular vector databases in 2026: **Pinecone** (managed, easy), **Weaviate** (open-source), **pgvector** (Postgres extension — popular when you already have Postgres), **Qdrant**, **Milvus**, **Chroma** (lightweight).

For most Indian startups, pgvector is the pragmatic choice — it adds vector search to your existing Postgres without operating a separate service. Pinecone is the easiest managed option. The other names matter mostly at very large scale.

Why this matters

Vector databases are core RAG infrastructure. The choice affects cost, latency, and operational complexity.

Real-world example (India)

A Chennai-based EdTech indexed 100,000 educational PDFs into pgvector, costing only the Postgres they were already running. Their semantic-search-over-textbooks feature drove a 40% increase in study-time per student.

Related terms

Want to master this?

Learn Vector Database in a structured cohort

3-month live program with mentors, real projects, and 50+ partner placement support.

View the program →

← All glossary termsLLM & Gen AI