Pipeline Architecture Capabilities Build log Open live demo ↗
A production RAG platform · built solo · live now

Ten checks between a question and its answer.

ARIA turns an organization's knowledge into cited, policy-checked, structured answers — and logs the full trace of how every one was made. Multi-tenant. Auditable. Running in production on real infrastructure.

Open live demo ↗ Read the pipeline ↓
agent_run #a91f4c
EXECUTING
Query "How do I set up Shopify Payments?"org: SHOPIFY
Structured response · auto-approved

model total cost citations policy events

6 real scenarios — including a hard block, a KB gap, and a human-approval hold · ↺ Replay cycles the next query · ask your own in the live app ↗

10/10
checks per answer
8
containers, one box
5+
isolated tenant orgs
1
engineer, end to end
The pipeline

No answer leaves without passing through all ten.

Most chatbots call a model and hope. ARIA treats every answer like a deploy: gated, validated, logged. These ten steps run on every single query — this is the actual execution order in production.

Before the model
01redact_piiEmails, phone numbers, and account identifiers are scrubbed from the query before anything else sees it.✓ PASS
02policy_precheckConfigurable rules evaluate the raw input. Rules can block, redact, or flag for approval before a token is spent.✓ PASS
03kb_searchVector search over the org's isolated knowledge base, with query expansion and a keyword fallback.✓ PASS
04rerankRetrieved chunks are re-scored with a composite confidence: term frequency, exact-phrase bonus, title match.✓ PASS
05route_modelA complexity classifier sends simple queries to a fast model and hard ones to a stronger one — speed and cost without sacrificing the difficult cases.✓ PASS
The model
06generateThe model produces a structured response: agent guidance, customer-facing reply, step-by-step hints, citations.✓ PASS
After the model
07schema_validateOutput is validated against a strict schema. Malformed responses are auto-repaired or rejected — never shipped raw.✓ PASS
08policy_postcheckThe generated answer is checked again: required citations present, no blocked content, no policy violations.✓ PASS
09approval_gateSensitive responses queue for human review before delivery. Everything else auto-approves and ships.⏸ GATED
10trace_logLatency, token cost, model choice, policy events, and the full run — written to the audit trail. Every answer is reconstructable.✓ PASS
Architecture

Eight containers. One machine. Real production.

No managed AI platform, no glue services. The entire system runs as a containerized stack on a single small cloud server — deliberately simple, fully owned, end to end.

edge routerEDGE
TLS termination + dual-domain routing. aria-bot.com serves this page; app.aria-bot.com proxies the platform.
api gatewayCORE
The agent gateway. Auth → tenant resolution → the 10-step pipeline → structured response. Per-org API keys and rate limits enforced before any data access.
admin consoleUI
Chat, KB upload, approvals, policies, analytics, audit log — one operator surface.
ingestion workersASYNC
Knowledge intake as a background pipeline: fetch → extract → chunk → embed → store. Plus gap scans and eval runs.
schedulerCRON
Recurring jobs — quota resets, log archiving, nightly KB health scans.
queue + cacheQUEUE
Job broker, result backend, and session cache.
unified data layerDATA
One database for everything: orgs, knowledge chunks with embeddings, agent runs, policy rules, approvals, evals, audit trail. Vector search and relational queries in one place — no second system to keep in sync.
llm layerLLM
Multiple models behind a provider-agnostic client — swappable via config, never hardcoded.

Versioned schema migrations · every architectural choice favors one thing: a system one person can fully understand and operate.

Capabilities

Enterprise features, actually implemented.

Not a roadmap — these are live in the demo right now.

LIVEMulti-tenant isolation

Every org gets its own knowledge base, policies, prompts, branding, and API keys. Bell's data can never leak into Shopify's answers — isolation is enforced at the auth layer, before any data access.

org_id scoping on every table · per-org rate limits
LIVECitations on every answer

Each response links back to the exact source chunks it drew from. If the KB can't support an answer, ARIA says so instead of inventing one.

require_citations enforceable as a policy rule
LIVECost-aware model routing

A complexity classifier scores every query and routes it — simple lookups go to a model 10–20× cheaper, hard reasoning goes to the strong one. Automatically, per query.

fast ↔ strong tiers · cost logged per run
LIVEPolicy engine + approval queue

Configurable rules run before and after the model: block, redact, require citations, or hold for human sign-off. Built for answers that carry real consequences.

pre + post phases · every match logged as a policy event
LIVEKnows what it doesn't know

A background analyzer scores each knowledge base for gaps, contradictions, and unanswerable questions — surfacing coverage problems before users hit them.

nightly scans · per-org gap reports
LIVEEvals, A/B tests, audit trail

Regression suites for answer quality, prompt variants tested against each other, and a full audit log of every run, decision, and dollar spent.

eval suites · variant assignment per run
ARIA — BELL ARIA — SHOPIFY ARIA — AWS ARIA — ROGERS ARIA — SEPHORA + your org, in ~30 min

Demo tenants run on pre-built knowledge base templates — switch between them live in the app.

Build log

From RAG prototype to production platform.

Built in deliberate phases — each one shipped before the next began.

Phase 1–2 · shipped

The core: retrieval that can defend its answers

RAG pipeline with vector search, reranking, and confidence scoring. The policy engine and ops console landed here — guardrails were never an afterthought.

Phase 3 + 3.5 · shipped

Multi-tenancy, end to end

File and URL ingestion through background workers, then full org-agnostic tenancy: per-org identity, industry prompt templates, isolated KBs, manual scrape triggers. Rebranded Shoppy Bot → ARIA.

Phase 4 · shipped

Intelligence layer

Complexity-based cost routing, escalation detection, per-session agent memory, thumbs-up/down feedback loop, and A/B prompt testing.

Phase 5 · shipped

Enterprise hardening

Audit logs, rate limiting with quota management, analytics dashboard, evals runner, and admin tooling for orgs, quotas, and variants.

Deployed · live now

Production on owned infrastructure

Containerized stack on a single cloud server, dual-domain routing, TLS everywhere — aria-bot.com and app.aria-bot.com. Total infra cost: about a lunch per month.

Next

SSO, Chrome extension, deeper intelligence

Google/GitHub auth, a Chrome extension that brings ARIA into any tab, and the next round of pipeline intelligence.

See it run.

The demo is the real system — same pipeline, same database, same guardrails you just read about. Switch orgs, ask questions, watch the citations come back.

Follow the build
ARIA is in active development. Drop your email for early access and build updates — no spam, just milestones.
✓ You're on the list — talk soon.
14 people on the list