Semantic Drift Detection

Semantic drift is the fifth distribution in Nomotic's behavioral fingerprint. While the existing four distributions — action, target, temporal, and outcome — track the structural shape of agent behavior, semantic drift tracks the meaning-level mapping between an agent's instructions and its actions.

The Problem

An agent instructed to "research flights" that begins "booking flights" while maintaining identical action type and target distributions has drifted semantically. The structural fingerprint sees no change — the agent still performs reads and writes against the same APIs. But the operational meaning of its instructions has shifted: "research" no longer means passive information gathering; it now means transactional execution.

Structural drift detection cannot catch this. Semantic drift detection can.

Core Concepts

Semantic Anchors

A SemanticAnchor defines what an instruction term should mean operationally:

from nomotic.semantic import SemanticAnchor

anchor = SemanticAnchor(
    term="research",
    expected_action_distribution={"read": 0.9, "query": 0.1},
    expected_target_distribution={"search_api": 0.7, "reviews": 0.3},
    tolerance=0.15,
)

Anchors are set per-agent, typically from a BehavioralContract or archetype defaults. They encode the expected behavioral signature of each instruction term.

Semantic Action Map

A SemanticActionMap tracks the observed mapping between instruction terms and action patterns for a specific agent. As the agent operates, each action is tagged with the instruction term it was executed under, building per-term action and target distributions.

Semantic Drift Score

A SemanticDriftScore compares the observed distributions against the anchored expectations using Jensen-Shannon Divergence (the same metric used for structural drift). Per-term drift is computed as:

per_term_drift = 0.6 * action_jsd + 0.4 * target_jsd

Action mapping is weighted more heavily (0.6) than target mapping (0.4) because a shift in what the agent does under an instruction is more significant than a shift in where it does it.

The overall semantic drift is the observation-count-weighted average across all anchored terms.

Severity Thresholds

Semantic drift uses the same severity thresholds as structural drift:

Severity

Overall Score

none

< 0.05

low

0.05 - 0.15

moderate

0.15 - 0.35

high

0.35 - 0.60

critical

>= 0.60

Architecture

Semantic drift is tracked externally to the main BehavioralFingerprint because it requires instruction context that the fingerprint doesn't have access to. The architecture is:

SemanticObserver — sits alongside FingerprintObserver in the observation layer. Maintains per-agent SemanticActionMap instances and SemanticAnchor registrations.
BehavioralFingerprint.semantic_map_ref — a reference field linking the fingerprint to its semantic map.
DriftCalculator.compare() — accepts an optional semantic_drift_score parameter. When provided, semantic drift is included in the weighted overall score. When absent, it is excluded for backward compatibility.
DriftScore.semantic_drift — new field (defaults to 0.0) storing the semantic component.
DriftMonitor — accepts an optional semantic_observer and automatically fetches semantic drift before computing overall drift.

Usage

Setting Anchors

from nomotic import FingerprintObserver, PriorRegistry
from nomotic.semantic import SemanticAnchor

observer = FingerprintObserver(prior_registry=PriorRegistry.with_defaults())

# Define what "research" should mean for this agent
anchor = SemanticAnchor(
    term="research",
    expected_action_distribution={"read": 0.9, "query": 0.1},
    expected_target_distribution={"search_api": 0.7, "reviews": 0.3},
)

observer.set_semantic_anchors("agent-1", [anchor])

Providing Instruction Context

Instruction context flows through action parameters:

from nomotic.types import Action

action = Action(
    agent_id="agent-1",
    action_type="read",
    target="/search_api",
    parameters={"instruction_context": "research flights to Tokyo"},
)

The SemanticObserver extracts instruction context from (in order of priority):

The explicit instruction_context parameter passed to observe()
action.parameters["instruction_context"]
action.parameters["task_description"]
action.parameters["goal"]
Falls back to "__untagged__" if no context is available

Querying Semantic Drift

# Through FingerprintObserver
score = observer.get_semantic_drift("agent-1")
if score and score.severity != "none":
    print(f"Semantic drift detected: {score.detail}")
    print(f"Most drifted term: {score.most_drifted_term}")
    for term, drift in score.per_term.items():
        print(f"  {term}: {drift:.3f}")

Direct SemanticObserver Usage

from nomotic.semantic import SemanticObserver, SemanticAnchor

obs = SemanticObserver()
obs.set_anchors("agent-1", [
    SemanticAnchor(
        term="analyze",
        expected_action_distribution={"read": 0.8, "query": 0.2},
        expected_target_distribution={"data_warehouse": 1.0},
    ),
])

# After observations accumulate...
score = obs.get_semantic_drift("agent-1")

Archetype Integration

All 10 built-in archetype priors now include "semantic" in their drift_weights. Archetypes where semantic meaning is especially important have elevated weights:

Archetype

Semantic Weight

Rationale

financial-analyst

1.5

Financial terms must retain exact meaning

security-monitor

1.3

Security terminology is safety-critical

All others

1.0

Standard semantic drift sensitivity

The ArchetypePrior dataclass also gains a semantic_anchors field for defining default anchors per archetype.

Drift Taxonomy Placement

Semantic drift is a distribution (what is drifting), not a scope (who is drifting). In the Nomotic Drift Taxonomy:

Five Drift Distributions:

Action drift — change in what the agent does
Target drift — change in where the agent operates
Temporal drift — change in when the agent acts
Outcome drift — change in governance evaluation outcomes
Semantic drift — change in the meaning mapping between instructions and actions

Five Drift Scopes:

Agent drift — individual agent behavioral deviation
Human drift — human reviewer oversight degradation
Fleet drift — aggregate drift across agent populations
Correlated drift — multiple agents drifting in the same direction
Coordinated drift — agents drifting in complementary ways

Semantic drift can appear at any scope: an individual agent can drift semantically, an entire fleet can exhibit correlated semantic drift, or agents can show coordinated semantic drift where one agent's "research" drift complements another's "execute" drift.

Serialization

All semantic types support to_dict() / from_dict() roundtrip serialization:

# SemanticAnchor
anchor_data = anchor.to_dict()
restored_anchor = SemanticAnchor.from_dict(anchor_data)

# SemanticActionMap
map_data = semantic_map.to_dict()
restored_map = SemanticActionMap.from_dict(map_data)

# SemanticDriftScore
score_data = score.to_dict()

Thread Safety

All mutable state in SemanticActionMap and SemanticObserver is protected by threading.Lock, matching the thread-safety guarantees of the existing fingerprint system.

PreviousReasoning Artifacts Nextgetting-started

Last updated 9 days ago

Good evening

hashtagThe Problem

hashtagCore Concepts

hashtagSemantic Anchors

hashtagSemantic Action Map

hashtagSemantic Drift Score

hashtagSeverity Thresholds

hashtagArchitecture

hashtagUsage

hashtagSetting Anchors

hashtagProviding Instruction Context

hashtagQuerying Semantic Drift

hashtagDirect SemanticObserver Usage

hashtagArchetype Integration

hashtagDrift Taxonomy Placement

hashtagSerialization

hashtagThread Safety