FIG 4.0 · HOW WE WORK

We don’t
automate chaos.

Technology doesn’t fix broken processes — it scales the dysfunction. Every Digital Employee we ship is gated by a five-step framework. We fix what’s broken first, then add AI, then measure. And we kill the agents that fail their KPIs.

Automation amplifies a good process. It amplifies a bad process at the same rate.

The methodology below is the discipline behind every production agent we deploy. It is the reason our outcomes hold under audit — and the reason we refuse engagements that try to skip steps 1 and 2.

FrameworkThe Fix-First Framework
FIG 4.1 · FIVE STEPS · GATED IN ORDER

The discipline that separates deployments that work from ones that don’t.

01
DISCOVERY

Map the process honestly.

Every handoff, every exception, every queue. Rank opportunities by ROI, not by which team is loudest. The map is unglamorous and almost always reveals that the “AI problem” is a process problem.

DeliverableRanked opportunity list with baselined KPIs.
02
PROCESS REPAIR

Fix manually first.

Before a single line of AI code, eliminate redundant steps and simplify decisions by hand. Automation amplifies a good process; it amplifies a bad process at the same rate. This is the step everyone wants to skip and the one that determines whether the deployment lasts.

DeliverableSimplified manual workflow with measurable improvement.
03
VERTICAL SLICE

Add AI — only on a process that’s already well-functioning.

Pilot a vertical slice on real data, real users, real edge cases. Same stack that will run in production. Not a demo — a working agent inside the actual operational context.

Cadence4–6 weeks to a working agent on real data.
04
OUTCOME CONTRACT

Measure outcomes, not deployments.

“We deployed an agent” is not a metric. “Onboarding fell from 16 weeks to 1.9 weeks” is. Shared dashboard, not a quarterly slide. Contract clause: if the numbers don’t move against the baseline, we don’t collect.

ContractOutcome-tied, baselined against pre-agent metrics.
05
CONTINUOUS DECISION

Kill failures fast.

Implementations that don’t deliver get killed. No sunk-cost grinding. Loyalty to outcomes, not technology. The discipline to kill is what makes the rest of the framework credible — clients trust the process precisely because we’re prepared to walk away from a deployment we built.

CadenceQuarterly performance review · kill / scale decision.
FIG 4.2 · HOW WE BUILD AND KEEP AGENTS DEFENSIBLE

Four build constraints. Four guardrails. Applied on every engagement.

CONSTRAINT 01

Technology-agnostic.

Open-weight and frontier models combined per task. Incentives aligned to your outcomes, not to platform consumption volume.

CONSTRAINT 02

Efficiency-first.

Smallest model that does the work. The cost difference between tiers can be a factor of ten — and the demo model is rarely the production model.

CONSTRAINT 03

Sovereign-by-default.

POPIA is a deployment constraint, not a checkbox. Known jurisdiction, auditable identity, from day one.

CONSTRAINT 04

Orchestration-led.

Every agent on an orchestration layer with audit trails, guardrails, versioning. Not a Python script. Validated at Gate 1 (working MVP) and Gate 2 (production sign-off).

GUARDRAIL 01

Factual accuracy.

Grounded in your verified data using RAG. Validation loops cross-reference answers against source documents at runtime.

GUARDRAIL 02

Data privacy.

PII masked before reaching the model. Agents run in secure private-cloud instances. POPIA-aligned deployment is the default.

GUARDRAIL 03

Bias and consistency.

Responses tested across scenario sets before production. Human-in-the-loop governance enforced on sensitive processes.

GUARDRAIL 04

Control and auditability.

Input sanitisation blocks prompt injection. Every decision logged. Each agent operates under its own managed identity.

FIG 4.3 · WHAT WE DON’T DO

Three places where AI deployments quietly fail.

The technology works. The architecture does not. These are the patterns we’ve seen kill otherwise sensible deployments — and the discipline we apply to avoid each.

Anti-pattern 01

Resource Mismatch.

Pilots routed through flagship-class APIs. The demo looks impressive. Then unit economics fail at production scale because nobody costed the inference. We baseline cost-per-task during the MVP, not after deployment.

Anti-pattern 02

Conflated Context.

RAG and MCP confused. RAG is the library — what the agent reads. MCP is the bridge — what the agent can do. Treating one as the other produces agents that hallucinate retrieval or execute actions without context.

Anti-pattern 03

Premature Production.

MCP servers built fast rather than right. Schema hygiene, permission scoping, tool-description quality — those decide whether a server scales past the demo. We treat the MCP layer as production software from week one.

FIG 4.9 · APPLY THE FRAMEWORK

Start
with the map.

The Diagnostic & MVP is paid, four to twelve weeks, and produces a working agent on real data. Before the first line of agent code we baseline the process you want to fix.