The Intelligence Age

Failed AI project rescue: a board triage guide

How boards in regulated organisations decide whether to stop, contain or recover a failing AI project, with evidence, controls and reporting.

Hamada Mahdi18 June 20268 min readResearched and drafted with AI assistance, reviewed by Karl George MBE

Failed AI project rescue starts by treating the programme as a governance incident, not a vendor dispute. The board should pause irreversible spend, preserve evidence, name an accountable owner, test whether the use case is still worth saving, and approve only fixes that restore control.

This applies when a pilot has stalled, a production system has lost trust, or a supplier delivery has produced a tool nobody can sign off. It pairs with our AI project rescue service and the evidence review in why AI projects fail, but the first decision belongs to the board: is this work still worth saving?

Key takeaways

A rescue should begin with containment: freeze the evidence, stop uncontrolled changes, and appoint one accountable owner.
The board decision is not "can the model be improved?" It is "does the use case still justify recovery under explicit stop criteria?"
Evidence recovery comes before technical repair. Logs, prompts, data lineage, evaluations, decisions and vendor claims are the raw material of the diagnosis.
Most failures sit across governance, data, model, process and control layers, so a model-only fix usually misses the defect that caused the stall.
The board report should show options, evidence, residual risk, spend to date, regulatory touchpoints and a dated decision path.
If the project is structurally unsalvageable, a clean stop is governance working, not a defeat.

Failed AI project rescue triage in the first 72 hours

The first move is to stop the project changing shape while the diagnosis is being made. Do not let a supplier rerun the demonstration, rewrite prompts, swap data, change thresholds or remove logs before the evidence has been captured. A working model can still be an evidential mess; a clean evidence pack can explain why a model did not work.

The NCSC guide for CEOs responding to cyber incidents is useful by analogy. It treats a serious technical incident as a governance, continuity, communications, legal and board-reporting problem, not only as a technical problem. A failing AI programme in a regulated organisation deserves the same discipline: one Senior Responsible Owner, a cross-functional response group, a record of technical facts translated into board consequences, and a route for decisions under time pressure.

Triage question	Evidence to capture	Owner	Initial board action
Is any live service, customer, resident, employee or regulated process affected?	Use-case map, live-user count, incident log, complaints, manual overrides	Executive sponsor	Contain the use case, define escalation rules and stop any unmanaged rollout
Is personal, sensitive, confidential or regulated data involved?	DPIA, data-flow map, processor terms, access logs, retention position	DPO or data lead	Freeze data handling changes until lawful basis, minimisation and security are clear
Can the team explain what success meant at approval?	Original business case, success metric, baseline, board paper, supplier proposal	Programme owner	Continue only if a dated metric and stop condition can be reconstructed
Can the defect be localised to one component?	Evaluation results, prompt and model versions, integration logs, user workflow evidence	Technical lead	Approve a bounded recovery sprint only if the component has a hard edge
Would recovery breach a legal, ethical or sector boundary?	Legal review, regulator guidance, risk appetite, customer impact analysis	General counsel or risk lead	Stop, or redesign the use case before any rebuild

The table is deliberately blunt. A project that cannot identify its affected people, data, metric, component and boundary is not ready to be fixed. It is ready to be contained.

Decide whether to stop, continue or contain

RAND's 2024 research report on AI project failure root causes was based on 65 interviews with experienced AI practitioners and identified five recurring causes: the wrong problem, insufficient data, technology-led work, inadequate infrastructure and problems beyond the current capability of AI. For a board, the value of that list is not the taxonomy. It is the decision test.

Stop where the use case was wrong, prohibited, disproportionate or dependent on data the organisation does not have and cannot lawfully obtain. Do not relabel that as a delivery problem. The governance decision is to close the project, record what was learned, and prevent the same approval pattern recurring.

Continue under containment where the problem is still worth solving but the approval basis has failed. That means no expansion, no additional supplier commitment and no live-user increase until a fresh metric, owner, risk assessment and control set are approved.

Recover where the defect is bounded: a document pipeline that cannot cite its sources, a classifier that needs a confidence floor, a workflow that lacks human approval, or an integration that failed because no one mapped the operational process. The project may still be worth saving, but only as a smaller piece of work with new evidence requirements.

The board should ask for three options in writing: stop now, continue in containment, or recover one component. Each option should carry the cost to date, the next cost, the risk to users or customers, the evidence gap, and the decision date when the option expires.

Recover the evidence before changing the system

Evidence recovery is not administrative tidying. It decides whether the board is looking at a failed model, a failed process or a failed approval. The minimum pack is practical:

The original board paper, business case, supplier proposal and approval conditions.
The intended users, affected people and regulated processes.
The data-flow map, DPIA, access permissions, retention position and processor terms.
The prompts, model versions, evaluation datasets, test results, thresholds and known limitations.
The integration logs, exception logs, manual overrides and user feedback.
The decision trail: who accepted risk, who approved live use, who paused or escalated issues.

The ICO's guidance on AI and data protection covers accountability, transparency, lawfulness, accuracy, fairness, security, data minimisation and individual rights. It is also explicit that the guidance is under review following the Data (Use and Access) Act coming into law on 19 June 2025, so a rescue pack should date its data-protection assumptions and not treat an old DPIA as permanently settled.

The evidence pack should separate fact from interpretation. "The model produced unsupported summaries in 17 test cases" is a fact if the test cases exist. "The supplier failed" is a conclusion. Boards need both, but not mixed together.

Map defects to governance, data, model, process and control

A common mistake is to fold every issue into "model performance". That hides the repair path. The GOV.UK AI Playbook is written for government, but its principles translate cleanly into regulated board work: know AI's limitations, use it lawfully and responsibly, keep meaningful human control, manage the full life cycle, choose the right tool and put assurance in place. The NIST AI RMF Playbook gives a second lens through Govern, Map, Measure and Manage. The UK's AI regulation response adds the local regulatory language: safety and security, transparency and explainability, fairness, accountability and governance, contestability and redress.

Defect layer	What the rescue tests	Framework mapping	Board-level fix
Governance	Owner, risk appetite, success metric, stop criteria, board approval basis	UK accountability and governance; NIST Govern	Name the owner, approve the metric, set the stop date and record the risk appetite
Data	Lawful basis, minimisation, quality, lineage, permissions, retention	ICO data protection duties; NIST Map and Measure	Rebuild the data pack before changing the model
Model	Fitness for the task, accuracy, hallucination risk, bias, drift, confidence thresholds	AI Playbook testing and monitoring; NIST Measure	Define tests the model must pass and when a person must intervene
Process	Workflow fit, user adoption, escalation, handover, exception handling	RAND workflow and problem-definition causes; AI Playbook life-cycle discipline	Redesign the work around the user and the regulated decision point
Control	Audit trail, human approval, disclosure, incident response, user redress	UK transparency, contestability and redress; NIST Manage	Build the control into the system, not into the post-mortem

The board should resist a single green status for the whole programme. A project can be technically promising and legally under-evidenced. It can be lawful but operationally unusable. It can work in a demonstration and fail the human-review pattern. The report should show each layer separately.

Report the rescue as a board decision, not a technical update

The board paper should not read like a sprint note. It should read like a decision record. A chair should be able to see what happened, what is known, what is still unknown, what options exist, and what authority the board is being asked to grant.

Use this pack as the minimum board report:

Current position: what was approved, what was built, where it stalled, and whether any live process is affected.
Evidence recovered: which logs, data records, evaluations, approvals and user feedback have been secured.
Root-cause view: governance, data, model, process and control defects, with confidence in each finding.
Stop, contain or recover options: cost, risk, benefit, timebox and expiry date for each.
Regulatory touchpoints: data protection, security, equality, sector rules, contractual terms and any reporting duties.
Decision requested: the owner, metric, budget limit, stop criteria and next board reporting date.

Common mistakes are easy to spot. The first is letting the vendor run a fresh demo before the original evidence is frozen. The second is defining success as accuracy alone, when the real issue is legal basis, workflow fit or human accountability. The third is bringing the DPO, CISO, legal team or risk lead in as late reviewers, after the design has already failed. The fourth is keeping the pilot alive because no one wants the reputational discomfort of stopping it. The fifth is reporting activity as progress.

Activity is not recovery. A recovery has a dated decision, a named owner and an evidence threshold that can close the work.

Next step: route the work to the right intervention

If the project has already failed or stalled, start with AI Project Rescue. The offer is deliberately narrow: diagnose why it failed, scope one component, rebuild it against an agreed success metric, then decide. That structure is suitable when the board needs a fast stop, contain or recover decision.

If the symptom is wider than one project, route the work to the GovernIQ Diagnostic. That is the better path where the organisation has several pilots, unclear ownership, no common approval route, weak evidence records or uncertainty about how its current controls map to AI risk.

For the wider failure pattern, read why AI projects fail. For the control standard we expect in our own work, see how we hold ourselves to account and the broader services offer.

Last reviewed: 18 June 2026.

Sources: GOV.UK AI Playbook · UK government AI regulation response · ICO AI and data protection guidance · NIST AI RMF Playbook · NCSC CEO guide to cyber incidents · RAND AI project failure root causes

AI project rescueAI governanceboard reportingregulated organisationsAI controls

Related insights

Two ink navy delivery tracks converging through a violet control node on a near-white field

The Intelligence Age

AI governance consultancy vs build shop

When UK boards should choose advisory governance, software delivery or an integrated team for AI work with regulatory evidence.

ArticleHamada Mahdi18 June 20267 min read

Near-white abstract comparison paths in ink navy with one violet route, showing a board choosing between advisory options

The Intelligence Age

Big Four alternatives AI governance

A board buyer's guide to choosing Big Four or specialist AI governance advisers, with controls, framework mapping and evidence to demand.

ArticleHamada Mahdi18 June 20267 min read

Ink navy evaluation matrix and decision fork on a near-white field, with one violet selected path

The Intelligence Age

How to choose an AI governance partner

A board-level guide to selecting an AI governance partner, with a scorecard, evidence tests and framework mapping for UK organisations.

ArticleHamada Mahdi18 June 20267 min read

Where does your board's AI governance actually stand?

Ten questions across accountability, policy, risk, data and capability. You'll get a readiness score, where to focus first, and a recommended next step. It takes about two minutes.

Take the Board AI Scorecard Or book a governance review

Free · ~2 minutes · your score shown straight away.