Skip to main content

Research PreviewMRP-2026-03v1.027 May 2026Public Distribution36 pages1.2 MB

When precedents commit AI and policy pulls it back

Why giving an AI agent more governance context doesn't make it steadily more decisive

By Sam Carter

Cover — When precedents commit AI and policy pulls it back (MRP-2026-03)

TL;DR

  • We showed one AI agent the same 283 UK procurement decisions five times, adding more of our governance rules each round — from nothing, up to the full policy.
  • More context didn't make it steadily more decisive. It stayed on the fence until we showed it past decisions, then committed in a single jump — and eased back off once it saw the full policy.
  • What moved the agent most was precedent, not policy. The lesson is about how you give an agent context, not how capable the agent is.

Abstract

Regulated teams deploying AI agents cannot say what makes one commit to a decision rather than defer it, or whether more governance context reliably helps. We reused the frozen 283-record UK procurement corpus from MRP-2026-02 and ran the same agent over the same records five times, adding context one rung at a time: baseline, prose, named rules, precedent receipts, then full policy. Predictions and ladder content were locked before any evaluation call; every verdict was bound into an Ed25519-signed receipt anchored to Sigstore Rekor. The agent held at 97.5–100% REVIEW for three rungs, then committed 107 DENYs (37.8%) when precedent receipts appeared, and backed 46 off again under full policy. Commitment is non-monotonic — the precedent rung, not the policy rung, is where it breaks.

Download PDF

PDF · 36 pages · 1.2 MB

All research