Skip to content
Back to Magazine
ai-operating-models 4 min read

Context Architecture: From Loose Prompts to Knowledge Operating System

Does this apply to your company?

Free 30-min AI diagnostic →

Decision

Decide what governance, ownership or cadence is missing before scaling AI.

Room

Executive committee, AI portfolio review, transformation steering.

Risk

Mistaking activity, pilots and tooling for real operating capability.

Agent prompt: map decision rights, KPIs, risks and the next operational move

This is the pillar that defines the system. If you want operational criticism of prompt engineering, check out the derivative: Why prompt engineering doesn’t scale business.

Problem

Prompt engineering solves local symptoms, not architecture.

A team can get “acceptable” answers with increasingly longer prompts, but this approach collapses when more use cases appear, more people edit instructions, and more sources of truth compete with each other.

Thesis

Scaling AI requires designing context before prompts.

A good prompt doesn’t replace a bad knowledge base. It only masks it for a while.

Framework: Context Architecture

Layer 1: Knowledge Model

Defines which sources are canonical for each decision:

  • policies
  • procedures
  • operational data
  • risk criteria

Each source must have an owner and version.

Layer 2: Retrieval Contract

Not all content should be included in every response.

You must define:

  • what is retrieved by query type
  • what is left out
  • what confidence threshold triggers human escalation

Layer 3: Continuous Evaluation

Without evaluation, any system drifts.

You need recurring tests on:

  • factuality
  • consistency
  • utility for the executing role

Case (anon): a services company had three teams launching assistants on different internal documents. The result was correct answers in demos, but contradictory ones in daily operation. By defining canonical sources per decision, versioning context, and setting escalation thresholds, variability decreased without changing the model.

From Prompt Craft to Decision Infrastructure

A prompt can improve presentation. Context Architecture improves reliability. The operational difference is critical:

  • the prompt lives in the interface,
  • the context lives in the system,
  • the decision occurs in the business.

If you only optimize the interface, inconsistencies return as soon as the team, use case, or data source changes.

Minimum Contract per Use Case

Each productive flow should explicitly declare:

  1. decision objective: what result this answer enables,
  2. allowed sources: what corpus can be used and with what priority,
  3. confidence threshold: when to escalate to human,
  4. knowledge owner: who maintains each source.

Without a contract, the system improvises. And a system that improvises doesn’t scale with control.

Error Taxonomy to Truly Improve

Not all failures are “hallucinations”. In practice, they usually fall into four types:

  • missing source: the information wasn’t available,
  • contradictory source: there were different versions without hierarchy,
  • deficient retrieval: irrelevant context was retrieved,
  • ambiguous decision: it wasn’t clear what output was useful for that role.

Labeling errors this way allows correcting architecture, not just prompt wording.

A minimum bi-weekly routine should include:

  • sampling real queries by flow,
  • error analysis by taxonomy,
  • source versioning decisions,
  • adjustment of escalation thresholds.

If this cadence doesn’t exist, quality depends on individual heroism.

Context Maturity Signals

  • average human review time decreases,
  • contradictory answers between teams decrease,
  • reuse of answers in recurring operations increases,
  • number of “patch” prompts per case drops.

When these signals improve together, you no longer have isolated prompt engineering: you have decision architecture.

Posture: This isn’t a prompt project or a tool purchase; without real governance, it’s theater.

Breathing: In real organizations, the pain isn’t the model: it’s who can say no and turn off a use case.

Operational Protocol (3 steps)

  1. Inventory active sources per use case and eliminate conflicting duplicates.
  2. Define for each flow a minimum “context packet”: 3-5 blocks of validated information.
  3. Execute a bi-weekly evaluation cycle with 15 real queries and correction plan.

Signals that You’re on Track

  • fewer exceptions due to contradictory answers
  • shorter human review time
  • greater reuse of answers in recurring operations

Anti-Patterns

  • relying on conversational memory as a repository
  • mixing normative knowledge and opinion in the same block
  • changing prompts every week without version control

Related:

Closing

The prompt is an interface. The context is the system. If you want robust results, design the architecture that supports the answer.

If you need to land it in your current stack, we can do it in advisory or in a diagnostic.


Translated from the Spanish original with AI assistance and reviewed for accuracy. Read the original in Spanish.

Context Architecture RAG
Cite this article

Berthelius, V. (2025). “Context Architecture: From Loose Prompts to Knowledge Operating System”. BRTHLS Magazine. https://www.brthls.com/magazine/context-architecture-from-prompts-to-knowledge-os-en

Fractional CAIO · Free diagnostic

Is your company ready to operate with AI?

30 minutes. No pitch. An honest read on where you are and what to move first.

Book free diagnostic