ARIA — Ten checks between a question and its answer

The pipeline

No answer leaves without passing through all ten.

Most chatbots call a model and hope. ARIA treats every answer like a deploy: gated, validated, logged. These ten steps run on every single query — this is the actual execution order in production.

Before the model

01redact_piiEmails, phone numbers, and account identifiers are scrubbed from the query before anything else sees it.✓ PASS

02policy_precheckConfigurable rules evaluate the raw input. Rules can block, redact, or flag for approval before a token is spent.✓ PASS

03kb_searchVector search over the org's isolated knowledge base, with query expansion and a keyword fallback.✓ PASS

04rerankRetrieved chunks are re-scored with a composite confidence: term frequency, exact-phrase bonus, title match.✓ PASS

05route_modelA complexity classifier sends simple queries to a fast model and hard ones to a stronger one — speed and cost without sacrificing the difficult cases.✓ PASS

The model

06generateThe model produces a structured response: agent guidance, customer-facing reply, step-by-step hints, citations.✓ PASS

After the model

07schema_validateOutput is validated against a strict schema. Malformed responses are auto-repaired or rejected — never shipped raw.✓ PASS

08policy_postcheckThe generated answer is checked again: required citations present, no blocked content, no policy violations.✓ PASS

09approval_gateSensitive responses queue for human review before delivery. Everything else auto-approves and ships.⏸ GATED

10trace_logLatency, token cost, model choice, policy events, and the full run — written to the audit trail. Every answer is reconstructable.✓ PASS

Architecture

Eight containers. One machine. Real production.

No managed AI platform, no glue services. The entire system runs as a containerized stack on a single small cloud server — deliberately simple, fully owned, end to end.

edge routerEDGE

TLS termination + dual-domain routing. aria-bot.com serves this page; app.aria-bot.com proxies the platform.

api gatewayCORE

The agent gateway. Auth → tenant resolution → the 10-step pipeline → structured response. Per-org API keys and rate limits enforced before any data access.

admin consoleUI

Chat, KB upload, approvals, policies, analytics, audit log — one operator surface.

ingestion workersASYNC

Knowledge intake as a background pipeline: fetch → extract → chunk → embed → store. Plus gap scans and eval runs.

schedulerCRON

Recurring jobs — quota resets, log archiving, nightly KB health scans.

queue + cacheQUEUE

Job broker, result backend, and session cache.

unified data layerDATA

One database for everything: orgs, knowledge chunks with embeddings, agent runs, policy rules, approvals, evals, audit trail. Vector search and relational queries in one place — no second system to keep in sync.

llm layerLLM

Multiple models behind a provider-agnostic client — swappable via config, never hardcoded.

Versioned schema migrations · every architectural choice favors one thing: a system one person can fully understand and operate.

Capabilities

Enterprise features, actually implemented.

Not a roadmap — these are live in the demo right now.

LIVEMulti-tenant isolation

Every org gets its own knowledge base, policies, prompts, branding, and API keys. Bell's data can never leak into Shopify's answers — isolation is enforced at the auth layer, before any data access.

org_id scoping on every table · per-org rate limits

LIVECitations on every answer

Each response links back to the exact source chunks it drew from. If the KB can't support an answer, ARIA says so instead of inventing one.

require_citations enforceable as a policy rule

LIVECost-aware model routing

A complexity classifier scores every query and routes it — simple lookups go to a model 10–20× cheaper, hard reasoning goes to the strong one. Automatically, per query.

fast ↔ strong tiers · cost logged per run

LIVEPolicy engine + approval queue

Configurable rules run before and after the model: block, redact, require citations, or hold for human sign-off. Built for answers that carry real consequences.

pre + post phases · every match logged as a policy event

LIVEKnows what it doesn't know

A background analyzer scores each knowledge base for gaps, contradictions, and unanswerable questions — surfacing coverage problems before users hit them.

nightly scans · per-org gap reports

LIVEEvals, A/B tests, audit trail

Regression suites for answer quality, prompt variants tested against each other, and a full audit log of every run, decision, and dollar spent.

eval suites · variant assignment per run

ARIA — BELL ARIA — SHOPIFY ARIA — AWS ARIA — ROGERS ARIA — SEPHORA + your org, in ~30 min

Demo tenants run on pre-built knowledge base templates — switch between them live in the app.

Build log

From RAG prototype to production platform.

Built in deliberate phases — each one shipped before the next began.

Phase 1–2 · shipped

The core: retrieval that can defend its answers

RAG pipeline with vector search, reranking, and confidence scoring. The policy engine and ops console landed here — guardrails were never an afterthought.

Phase 3 + 3.5 · shipped

Multi-tenancy, end to end

File and URL ingestion through background workers, then full org-agnostic tenancy: per-org identity, industry prompt templates, isolated KBs, manual scrape triggers. Rebranded Shoppy Bot → ARIA.

Phase 4 · shipped

Intelligence layer

Complexity-based cost routing, escalation detection, per-session agent memory, thumbs-up/down feedback loop, and A/B prompt testing.

Phase 5 · shipped

Enterprise hardening

Audit logs, rate limiting with quota management, analytics dashboard, evals runner, and admin tooling for orgs, quotas, and variants.

Deployed · live now

Production on owned infrastructure

Containerized stack on a single cloud server, dual-domain routing, TLS everywhere — aria-bot.com and app.aria-bot.com. Total infra cost: about a lunch per month.

SSO, Chrome extension, deeper intelligence

Google/GitHub auth, a Chrome extension that brings ARIA into any tab, and the next round of pipeline intelligence.

See it run.

The demo is the real system — same pipeline, same database, same guardrails you just read about. Switch orgs, ask questions, watch the citations come back.

Open live demo ↗ Contact ↗

Follow the build

ARIA is in active development. Drop your email for early access and build updates — no spam, just milestones.

✓ You're on the list — talk soon.

● 14 people on the list