Is your AI coding spend producing value — or just activity?

An independent, fixed-price verdict on whether your engineering team is getting real value from Cursor, Copilot, Claude Code or Devin. Read from your own git, PR and CI history — at team and system level. Never individual rankings. Never your application source.

Keith McDonnell

Independent engineering advisor · ~18 yrs Rails · AI-native

I deliver every assessment myself. The verdict carries my name because the judgment is the product — there's no team to hand you off to and no tool to sell you afterward. That's the whole point of an independent read.

— signed, every report

portrait
(warm-mono)

Speed is easy to see. Direction is not.

Your dashboard can tell you commits are up and PRs are flowing. It cannot tell you whether that motion is moving the system forward or quietly accumulating rework, review burden and defects you'll pay for later. The vendor's productivity claims describe their tool's behaviour; they can't describe your system's outcomes. That gap is what an independent read closes.

I characterise the system and its outcomes from the evidence your own history already holds, then hand you a written verdict — findings stated as likelihoods and ranges, not a single decorative score.

No individual-developer rankings.
Strictly team and system level — an ethics line and a methodology line both.
No reading your application source to judge quality.
Declared config and manifests only. Your logic never leaves your environment.
No dashboard to babysit.
One engagement, one verdict — not another tool to maintain.
No implementation or remediation.
The independence that makes the verdict worth having depends on having nothing to sell you afterward.
Scope & qualify
Confirm fit, adoption history and the decision the verdict will inform. No free read.
Extract
You run the read-only extractor in your own environment; an inspectable artifact comes back to me. Source stays with you.
Characterise & correlate
Behavioural signals against cross-organisation reference bands, bracketing the AI-adoption inflection.
Verdict
A signed, hand-written report — likelihoods and ranges, named confounders, a short set of things you can act on.

Fixed price · ~2-week calendar · delivered by me, by name.

What an independent verdict actually reads like

A complete, redacted specimen report — the verdict block, the reference-band comparison, the named limits. The clearest way to see what £25k buys before you ask.

Read the specimen verdict →

Everything I'd want a serious buyer to read before we talk — answered in writing, up front. Organised by topic; the tag marks where each piece sits in the decision.

Pricing

  • ReframeWhat an independent verdict is worthsoon
  • CommitWhat it costs, and whysoon

Problems

  • ReframeSelf-regarding vs other-regarding signals
  • ReframeWhat AI tools actually do to cognitive loadsoon
  • ReframeYour DevX team has become a complicated-subsystem teamsoon
  • ReframeThroughput world vs cost worldsoon
  • ReframeThe vendor case-study machinesoon

Comparisons

  • Evaluatevs code-audit / technical-DD shopssoon
  • Evaluatevs internal DIY metricssoon
  • Evaluatevs trust-the-vendor / do nothingsoon
  • Evaluatevs SEI dashboards (DX / Jellyfish / Faros / LinearB)soon

Reviews & proof

  • ExpandAnonymised engagement patternssoon
  • EvaluateWhat an independent verdict actually reads likesoon

Best / worst fit

  • EvaluateIs it too early? The three-month rulesoon
  • EvaluateWho should not hire mesoon

If a decision about your AI tooling is coming up, an independent read is worth having before you make it.

Tell me the team size, when AI tooling was adopted, and the decision in front of you.