Is your AI coding spend producing value — or just activity?

An independent, fixed-price verdict on whether your engineering team is getting real value from Cursor, Copilot, Claude Code or Devin. Read from your own git, PR and CI history — at team and system level. Never individual rankings. Never your application source.

Request an assessment Read a sample verdict

Keith McDonnell

Independent engineering advisor · ~18 yrs Rails · AI-native

I deliver every assessment myself. The verdict carries my name because the judgment is the product — there's no team to hand you off to and no tool to sell you afterward. That's the whole point of an independent read.

— signed, every report

portrait
(warm-mono)

Speed is easy to see. Direction is not.

Your dashboard can tell you commits are up and PRs are flowing. It cannot tell you whether that motion is moving the system forward or quietly accumulating rework, review burden and defects you'll pay for later. The vendor's productivity claims describe their tool's behaviour; they can't describe your system's outcomes. That gap is what an independent read closes.

I characterise the system and its outcomes from the evidence your own history already holds, then hand you a written verdict — findings stated as likelihoods and ranges, not a single decorative score.

No individual-developer rankings.: Strictly team and system level — an ethics line and a methodology line both.
No reading your application source to judge quality.: Declared config and manifests only. Your logic never leaves your environment.
No dashboard to babysit.: One engagement, one verdict — not another tool to maintain.
No implementation or remediation.: The independence that makes the verdict worth having depends on having nothing to sell you afterward.

Scope & qualify: Confirm fit, adoption history and the decision the verdict will inform. No free read.
Extract: You run the read-only extractor in your own environment; an inspectable artifact comes back to me. Source stays with you.
Characterise & correlate: Behavioural signals against cross-organisation reference bands, bracketing the AI-adoption inflection.
Verdict: A signed, hand-written report — likelihoods and ranges, named confounders, a short set of things you can act on.

Fixed price · ~2-week calendar · delivered by me, by name.

What an independent verdict actually reads like

A complete, redacted specimen report — the verdict block, the reference-band comparison, the named limits. The clearest way to see what £25k buys before you ask.

Read the specimen verdict →

Everything I'd want a serious buyer to read before we talk — answered in writing, up front. Organised by topic; the tag marks where each piece sits in the decision.

Pricing

ReframeWhat an independent verdict is worthsoon
CommitWhat it costs, and whysoon

Problems

ReframeSelf-regarding vs other-regarding signals
ReframeWhat AI tools actually do to cognitive loadsoon
ReframeYour DevX team has become a complicated-subsystem teamsoon
ReframeThroughput world vs cost worldsoon
ReframeThe vendor case-study machinesoon

Comparisons

Evaluatevs code-audit / technical-DD shopssoon
Evaluatevs internal DIY metricssoon
Evaluatevs trust-the-vendor / do nothingsoon
Evaluatevs SEI dashboards (DX / Jellyfish / Faros / LinearB)soon

Reviews & proof

ExpandAnonymised engagement patternssoon
EvaluateWhat an independent verdict actually reads likesoon

Best / worst fit

EvaluateIs it too early? The three-month rulesoon
EvaluateWho should not hire mesoon

OSS showcase reads and methodology notes →

If a decision about your AI tooling is coming up, an independent read is worth having before you make it.

Tell me the team size, when AI tooling was adopted, and the decision in front of you.

Request an assessment Read a sample verdict first