Chapter 3

The 6 requirements your contextual data layer must deliver

The 6 operating requirements of a contextual data layer: semantic clarity, relationships, freshness, provenance, an AI-native service layer, and unified multimodel coverage.

01 Semantic clarity

Data carries meaning, not just structure. Canonical entities resolve terminology drift across CRM, billing, and support.

02 Relationships & entity graph

Connections between entities are first-class and native. Multi-hop reasoning without federated joins.

03 Freshness & temporal correctness

Context reflects what is true now — and can answer what was true at time T. Bi-temporal by design.

04 Provenance & trust

Lineage, RBAC, and audit logging embedded in the platform. Every AI output is traceable to source.

05 AI-native service layer

Retrieve, rank, cite, and ground are built-in. Not reimplemented in every consuming application.

06 Unified multimodel platform

Graph, vector, document, key-value, and search — one foundation, one query language, one governance model.

Requirement 01

Semantic clarity

Schemas capture structure. They don’t capture meaning.

Your schema does not capture what your data actually means. The CRM says “active”; billing says “paying”; support says “green.” Same word, three definitions, three consuming systems.

CRM

status = "active"

= logged-in in last 30 days

Billing

status = "active"

= invoice paid this cycle

Support

status = "active"

= open case

A contextual data layer resolves those definitions into a single shared meaning: one answer to “what does active mean,” applied consistently across every system that asks.

Figure 3.1 — Same word, three meanings. One governed entity model. — **Figure 3.1 —** Same word, three meanings. One governed entity model.

What this gives you

One shared definition for every entity, across every system

Business meaning that stays current as the organization evolves

Consistent answers to the same question, every time

In Arango

AutoGraph automatically builds a knowledge graph that understands your business domain

AQL queries graph, vector, document, and key-value in a single statement — no translation between systems

Definitions and relationships update continuously — no manual re-modeling required

Failure mode

Every consuming system invents its own join key. The same customer is cust_3781 in CRM, acme-corp in billing, and ACME Corporation, Inc. in support tickets.

AI agents operating on one slice cannot reason across the others without a brittle mapping table that drifts the moment a system changes, a new source appears, or someone enters data differently.

How Arango solves it

Arango resolves those fragmented identifiers into a single, stable entity, automatically. Whether a match is exact, approximate, or inferred from structural similarity, the result is one customer, one ID, one definition that every agent and application reads from. And when definitions evolve (because they always do), Arango tracks those changes over time. A query about what was true last quarter returns last quarter’s answer. A query about today returns today’s.

Requirement 02

Relationships & entity graph

Enterprise context must be connected, not fragmented.

Vector similarity finds related documents. It will not tell you that Customer X is churning because the engineer who championed them left three months ago. That’s a relationship question — and relationships need to be first-class, native edges, not joins invented at query time.

**Figure 3.2** — Entities connected by explicit, native relationships instead of ad-hoc joins at query time.