
Cluster Systems / ClusterPOS
Production OpenWebUI Retrieval-Augmented Generation chatbot on Amazon Bedrock for a European SaaS retail point-of-sale platform.
The customer
ClusterPOS is a European SaaS retail point-of-sale platform serving small-to-medium retail businesses. Their support workflow ran through a Zoho-hosted knowledge base — a static documentation tree with no conversational, grounded assistance — which meant linear support-headcount growth as the customer base expanded.
The challenge
ClusterPOS needed to deflect repeat support questions, accelerate time-to-resolution, and stay competitive against retail-POS rivals increasingly shipping AI-assisted in-product help. The team had no prior Generative AI capability in production and needed an AWS partner to take them from zero to a production chatbot within a sub-three-week build window.
The solution
VeUP designed and delivered a production OpenWebUI chatbot grounded on the customer's Zoho knowledge base, with Amazon Bedrock as the inference plane and Amazon Nova Pro as the production primary foundation model. The signed SOW preserved alternate-model-toggleable Bedrock integration so the customer could expand the model set post-launch without redeploying. Live POS data is reached through OpenWebUI tool/function-calling, letting end-users ask grounded questions over both static documentation and live operational data.
The production deployment runs in the customer's AWS Canada (Central) region (ca-central-1) across a Multi-AZ VPC, with Application Load Balancer + ACM TLS at ingress, ECS Fargate hosting the OpenWebUI and VeUP-forked Bedrock Access Gateway, and Amazon RDS for PostgreSQL with pgvector for application state plus the RAG index. Production cutover was confirmed 2025-11-25; a POS integration extension shipped to production 2026-05-13.

Production outcomes
| KPI | Result |
|---|---|
| RAG retrieval latency | Sub-800 ms across a 600-document staging corpus; end-to-end response within 1–2 seconds — well inside the SOW SLA (p50 ≤ 5s, p95 ≤ 10s). |
| Production AWS run-rate | Amazon ECS Fargate ~$23/day after right-sizing (~$700/month, ~$8,400/year) sourced from AWS Cost Explorer / CUR — plus variable Amazon Bedrock token spend. |
| Customer AWS adoption signal | Customer-attested doubling of total AWS cloud bill post go-live (Fozail Ahmad, CTO, Final Handover 2025-11-25). |
| Continued engagement | Three downstream customer-initiated VeUP SA requests post-handover (RDS MySQL 8 / DDoS / WAF; IAM security review; NLB→ALB + WAF). Downstream Ignite managed-services attach Closed-Won 2025-12-09. |
In the customer's words
“Based on everything we agreed to set out and achieve, I think we have done it. In fact, we have gone even above and beyond that with the semantic search layer which I'm really excited about.”
“About 200 dollars a month right now... the 200 dollars is all the base infrastructure. And that gateway, this, that everything that's running.”
Amazon Bedrock (Nova Pro primary, alternate-model-toggleable) · AWS Fargate (OpenWebUI + VeUP-forked Bedrock Access Gateway) · Amazon RDS for PostgreSQL with pgvector · Amazon EFS · Application Load Balancer · AWS Certificate Manager · AWS Secrets Manager · AWS IAM · AWS KMS · Amazon CloudWatch · AWS Cost Explorer. External integrations: Zoho KB ingestion · Google Workspace SSO (OIDC/SAML) · ClusterPOS POS API (tool/function-call).