Building a multi-tenant agent runtime, MyChatBot Blog

Why multi-tenant

Every customer running their own dedicated infrastructure is wasteful. Most agents are idle most of the time. Multi-tenant means dramatically lower cost per customer, and the savings flow to pricing.

But multi-tenant comes with hard problems: data isolation, fair scheduling, cost attribution. Get any of them wrong and you have a customer fire.

Data isolation

We use schema-per-tenant in Postgres for all customer data. The query layer enforces tenant scoping at the connection level, there's no API path that can read another tenant's data, full stop.

Vector stores use namespaces per tenant. LLM context never crosses tenants. Agent memories are tenant-scoped at storage time.

Fair scheduling

The noisy neighbor problem: one customer running a huge campaign starves everyone else's inference budget. We use weighted fair queuing on inference, with weights tied to plan tier and recent usage.

Bursts are absorbed by spillover capacity that costs slightly more, billed to the burst customer, not the platform.

#architecture#engineering

Yaroslav Demir

Principal Engineer

Owns platform reliability. 10+ years building high-throughput systems. Will defend Go in any thread.

Building a multi-tenant agent runtime

Why multi-tenant

Data isolation

Fair scheduling

Try MyChatBot for free

More from Engineering

How we cut voice latency under 300ms

Lessons from running 50M+ messages a month

Voice Agent v2: 3× faster, 40% cheaper, in 14 languages

Save your agent to continue