Waymouth Tech
HomeServicesProductsBlogAboutContact
Book a call
Waymouth Tech

AI implementation consulting and indie software, built and shipped from Melbourne, Australia.

Melbourne, Victoria, Australia
hello@waymouthtech.com

Services

  • AI Implementation
  • AI Enablement
  • AI Education
  • IT Services

Company

  • About
  • Products
  • Blog
  • Contact

Popular reads

  • AI consulting in Melbourne
  • AI implementation roadmap
  • AI enablement for teams
  • Australian Privacy Act & AI

© 2026 Waymouth Tech. All rights reserved.

Based in Melbourne, Victoria, Australia

AI Tools, How-tos & Comparisons

Vector Databases Explained for Business in 2026

Vector databases explained for business — what they are, when you need one, how to pick between the major options, and what they actually cost.

By Yash Shelatkar·21 May 2026·6 min read
Server infrastructure representing a vector database storing embeddings

Vector databases are one of those infrastructure topics that have moved from "AI research curiosity" to "thing your CTO needs to understand" surprisingly quickly. In 2026, any business building custom AI that reasons over its own content is making a vector database decision — whether they realise it or not. This guide explains vector databases for business decision-makers, walks through the main options, and gives you a framework for choosing.

What a vector database actually does

A vector database stores embeddings. An embedding is a numerical representation of a piece of content — usually a list of 1,000 to 3,000 numbers — generated by an embedding model. Two pieces of content with similar meaning have similar embeddings.

That property is the whole point. It lets you search by meaning instead of by keyword. Ask "how do we onboard new tenants in NSW" and the database can return documents about "tenant setup process for New South Wales" even though no exact word matches.

A vector database is, in essence, a database optimised for one operation: "find me the N items closest to this query vector". It does that fast, at scale, and with the metadata filtering and operational tooling that production systems need.

When you actually need one

You do not need a vector database to use ChatGPT, Claude, Microsoft Copilot, or Notion AI. Those tools have their own retrieval layers built in. See our pillar on choosing AI tools for business — most businesses get a long way without ever touching a vector database directly.

You do need a vector database when you build a custom AI system over your own content. Typical triggers:

  • Building an internal RAG system.
  • Building a customer-facing AI search or assistant.
  • Building a code or document assistant grounded in your codebase.
  • Building any AI agent that needs long-term memory.

If none of those describe you, skip this article. If they do, the rest of this is for you.

The major options in 2026

The vector database market has consolidated somewhat. The serious options for business use are:

Pinecone

The most popular dedicated vector database. Fully managed, easy to start, strong performance at scale. Expensive once you grow — pricing is based on storage, indexing, and query volume. Best for teams that want a managed service and do not mind paying for it.

Weaviate

Open-source, with a managed cloud option. Strong hybrid search (vector + keyword) out of the box. Good Australia-region support. Slightly steeper learning curve than Pinecone but more capable.

Qdrant

Open-source, performant, written in Rust. Increasingly popular for self-hosting. Clean API, good documentation, supports advanced filtering. A favourite for technical teams.

pgvector (Postgres extension)

The "right answer" for many mid-market businesses. Adds vector support to Postgres, which most teams are already running. Performance has improved dramatically with HNSW indexes since 2023. Hard to beat for cost and operational simplicity if you already run Postgres.

Azure AI Search and AWS OpenSearch

The cloud-native options. If you are already deep in Azure or AWS, these are pragmatic choices — vector search is built into search products you may already be paying for. Less specialised but well-supported by your cloud provider.

Other notable mentions

  • Milvus / Zilliz — strong at very large scale, popular in research and high-volume use.
  • Chroma — popular for prototyping; less common in production.
  • MongoDB Atlas Vector Search — natural fit if you already run MongoDB.

How to choose

A simple framework. Walk through these questions in order.

1. Are you already running Postgres or a major cloud database?

If yes, start with pgvector or your cloud's native vector option. The operational simplicity is enormous. You do not need a separate database for embeddings unless you have a clear reason.

2. What is your scale?

  • Under 1 million vectors. Any option works. Optimise for operational simplicity, not benchmark performance.
  • 1 to 100 million vectors. Most options still work. pgvector with proper indexing, Qdrant, Weaviate, and Pinecone all handle this range comfortably.
  • Over 100 million vectors. You start to care about specialised performance. Pinecone, Milvus, and Qdrant pull ahead. pgvector is workable but needs more careful tuning.

3. Do you need hybrid search?

Hybrid search combines vector similarity with traditional keyword search. It consistently improves retrieval quality. Weaviate, Qdrant, Azure AI Search and OpenSearch have strong native hybrid support. Pinecone and pgvector require more orchestration.

4. What are your data residency requirements?

For Australian businesses with strict data residency:

  • Self-hosted (Qdrant, Weaviate, pgvector) on AU-region cloud infrastructure is the cleanest option.
  • Pinecone and managed services typically offer region selection — verify and confirm in writing.
  • Cloud-native (Azure AI Search, AWS OpenSearch) inherits your existing cloud region.

5. Build versus operate?

If you have a strong platform team that can operate stateful infrastructure, self-hosted Qdrant or Weaviate is excellent value. If you do not, a managed service is worth the premium. Operating a poorly tuned vector database in production is its own special kind of pain.

Cost reality

A rough sketch of monthly costs for a mid-sized business RAG with around 1 million vectors and moderate query volume:

  • pgvector on existing Postgres. Near-zero marginal cost; perhaps AUD 50–200 in incremental compute.
  • Self-hosted Qdrant or Weaviate. AUD 100–500 per month for infrastructure depending on availability requirements.
  • Pinecone Standard. AUD 500–2,500 per month depending on storage and query volume.
  • Weaviate Cloud / Qdrant Cloud. AUD 200–1,500 per month depending on tier.
  • Azure AI Search. Bundled with broader Azure spend; standalone tiers AUD 250–2,000+ per month.

These numbers move around. Run the actual cost calculator for your real workload before committing.

Common mistakes

A few mistakes we see often:

Over-engineering the choice

Most teams agonise over vector database selection then discover that retrieval quality is dominated by chunking strategy, embedding model choice, and re-ranking. The database matters less than you think. Pick a reasonable option and move on.

Ignoring metadata and filtering

Real retrieval almost always involves metadata filters — by date, source, user permissions, document type. Pick a database that handles your filtering needs well. Pure similarity search is the textbook example; metadata-filtered search is the actual product.

Forgetting permissions

Vector databases are mostly designed for content discovery, not access control. You need a separate layer that filters retrieved chunks by what the asking user can see. This is the single most common source of "the AI showed me something I should not have seen" incidents.

Locking in too early

Abstracting your retrieval layer so you can swap vector databases later costs little and buys real flexibility. Do not write Pinecone-specific code throughout your application.

Where this sits in the stack

The vector database is just one component of a wider retrieval stack — embeddings, chunking, retrieval, re-ranking, generation. The whole stack matters more than any one piece. For a fuller picture, our building internal RAG systems overview covers the end-to-end architecture, and the LLM API cost management guide covers the inference economics.

What to do next

If you already run Postgres and your scale is under 10 million vectors, install pgvector and start there. Otherwise, pick between Qdrant (self-host), Weaviate (hybrid search) or Pinecone (managed) based on your team's preferences. The choice is reversible — the workflow you build on top matters more.

Talk to a Melbourne AI consultant about choosing and operating a vector database for your business.
Book a discovery call →

FAQ

Frequently asked questions.

What is a vector database in plain English?

A vector database stores numerical representations (embeddings) of your content so you can search by meaning, not just keywords. It is what lets an AI find the right document among thousands when answering a question.

Do I need a vector database to use AI in my business?

No, not for general use of ChatGPT, Claude or Copilot. You need one when you build a custom AI system that searches over your own content — typically as part of a RAG implementation.

Is Postgres pgvector good enough for production?

For most mid-market use cases, yes. pgvector has matured significantly in 2024–2026 and handles millions of vectors well. Specialised vector databases pull ahead at very high scale or when you need exotic features.

How much does a vector database cost?

Managed vector databases typically run AUD 100–3,000 per month for mid-sized business use. Self-hosted options on existing infrastructure can be near-zero marginal cost if you are already running Postgres.

Can I switch vector databases later?

Yes, but expect rework. Embeddings can usually be reused if you stay on the same embedding model. Re-ingestion takes a day to weeks depending on volume. Avoid lock-in by abstracting your retrieval layer.

Waymouth Tech · Melbourne, Australia

Want this implemented in your business?

We’re a Melbourne-based AI implementation consultancy. We scope, build and ship production AI for Australian organisations — typically 8–14 weeks from kickoff to live, billed by scope so you know what you’ll pay before we start.

  • AI Implementation, Enablement & Education
  • IT services & integrations
  • Engineering team that ships real products
  • Australian Privacy Act & AU-region cloud
Book a free 30-min discovery callSee all services

Or email hello@waymouthtech.com — usually back within 24 hours.

Continue reading

More from the archive.

Abstract circuit pattern in gradient colours representing the AI tooling landscapePillar guide
AI Tools, How-tos & Comparisons

Choosing AI Tools for Business: A Decision Framework for 2026

A practical decision framework for choosing AI tools for business in 2026 — covering selection criteria, build vs buy, and a tooling shortlist.

21 May 2026·6 min read
A server rack representing the infrastructure layer of an internal RAG system
AI Tools, How-tos & Comparisons

Building Internal RAG Systems: A Practical Overview for 2026

An overview of building internal RAG systems for business — architecture, tooling, costs, and the decisions that make or break a production RAG deployment.

21 May 2026·6 min read
A Melbourne operations workspace with coffee, a notebook and Notion open on a laptop
AI Tools, How-tos & Comparisons

Notion AI for Operations Teams: What It Actually Does Well

A practical look at Notion AI for operations teams — what it does well, where it falls short, and how to roll it out without creating workspace chaos.

21 May 2026·5 min read