What is an AI agent readiness assessment?

An AI agent readiness assessment evaluates how well your codebase and engineering infrastructure support autonomous AI coding agents. The Nerva Score™ measures 81 criteria across 9 pillars including CI/CD, testing, documentation, security, and observability to determine your maturity level (L1-L5).

Why aren't AI coding agents working in my codebase?

AI coding agents are only as effective as the infrastructure surrounding the repository. Without proper tests, CI/CD feedback loops, documentation, and observability, agents waste cycles on preventable errors, guess at configuration, and produce work that cannot be verified. Most repos score L1-L2, meaning agents can barely function.

What is the Nerva Score?

The Nerva Score™ is a maturity assessment built on 81 criteria across 9 pillars (Style & Validation, Build System, Testing, Documentation, Dev Environment, Observability, Security, Task Discovery, Product Analytics). It produces an L1-L5 score showing exactly where your repos stand and what to fix first.

The Nervous System of AI-Native Engineering

Your AI agents aren't broken.
Your repos aren't ready.

84% of developers use AI coding tools. Only 11% of organizations have agents in production. The problem isn't the AI — it's the codebase. Get your Nerva Score™ and find out exactly what to fix.

81 Criteria

9 Pillars

5 Maturity Levels

12 wks L1 to L3

87% of AI projects are stuck in pilot

You bought Copilot licenses. You tried coding agents. But nothing sticks — because the infrastructure your agents need doesn't exist yet.

Agents Can't Verify Their Work

No tests, no CI feedback loops. Your AI agent writes code it can never validate. Every change is a coin flip.

No Documentation for Agents

No AGENTS.md, no API docs, no architecture diagrams. The agent guesses at your codebase conventions — and guesses wrong.

Trust Is Collapsing

Only 29% of developers trust AI code accuracy — down from 40%. Without infrastructure-backed trust, adoption dies.

The insight most teams miss

AI coding agents are only as effective as the infrastructure surrounding the repository. A codebase without tests, CI/CD, documentation, and observability makes AI agents useless.

We don't fix your AI. We get into your repos and build the infrastructure so agents can actually work.

The Nerva Score™

81 criteria. 9 pillars. 5 maturity levels. A rigorous assessment that tells you exactly where your repos stand — and what to fix to make AI agents actually work.

Initial

Basic version control. Some formatters and linters. Agent can read code but cannot verify its own changes.

Managed

Basic CI/CD and testing in place. Agent can fix straightforward bugs with rapid CI feedback.

Standardized Target

Production-ready for agents. Agent can fix bugs, add tests, update docs, implement clearly-specified features. This is where ROI starts.

Measured

Comprehensive automation and metrics. Agent can execute multi-file refactors, optimize performance, harden security.

Optimized

Full autonomous capability. Agent can develop complete features, respond to incidents, triage work autonomously.

9 pillars we assess

Every pillar represents a critical feedback loop your AI agents need to function. Miss one, and the whole system breaks down.

Pillar 1

Style & Validation

Formatters, linters, type checkers, pre-commit hooks, complexity analysis

Pillar 2

Build System

CI/CD pipelines, release automation, deployment frequency, build caching

Pillar 3

Testing

Unit tests, integration/E2E, coverage thresholds, flaky test detection

Pillar 4

Documentation

README, AGENTS.md, API schemas, architecture diagrams, freshness checks

Pillar 5

Dev Environment

Env templates, devcontainers, database migrations, local service setup

Pillar 6

Observability

Structured logging, error tracking, distributed tracing, alerting, runbooks

Pillar 7

Security

Secrets management, branch protection, dependency scanning, DAST

Pillar 8

Task Discovery

Issue templates, labeling systems, PR templates, backlog health

Pillar 9

Product Analytics

Error-to-insight pipelines, product analytics instrumentation

How we get you there

We don't hand you a report and walk away. We embed in your team and do the work.

Assess

We scan your repositories against 81 criteria across all 9 pillars. You get your L1–L5 Nerva Score with every gap identified — for free.

Plan

We build a prioritized transformation roadmap: which pillars to fix first, what agents to deploy, what L3 looks like for your stack and team.

Transform

We embed in your team and do the work. Set up CI/CD, write test infrastructure, configure agents, create docs, harden security — hands-on, in your repos.

Operate

AI agents are live and working. We train your team, measure the score delta, and expand to more repos. You see the ROI in weeks, not quarters.

What our team does in your repos

This isn't a slide deck engagement. Our consultants are in your codebase, day-to-day, making AI agents work.

Agent Setup & Configuration

Configure Claude Code, Copilot, Cursor, or OpenHands to work with your specific repos, conventions, and workflows.

CI/CD for Agents

Build fast feedback loops so agents can verify their own work. Agents that can't test are agents that can't ship.

Test Infrastructure

Write the test suites, coverage gates, and flaky-test detection that let agents make changes with confidence.

Documentation & Agent Guides

Create AGENTS.md, architecture docs, and contributing guides so agents understand your codebase — not guess at it.

Security & Observability

Secrets management, branch protection, structured logging, error tracking — the infrastructure agents need to operate safely.

Team Enablement

Train your engineers to work alongside AI agents. New workflows, new review processes, new ways of thinking about code.

The numbers that matter

84%

Devs Using AI Tools

Adoption is universal. But adoption without infrastructure is just expensive experimentation.

11%

Agents in Production

The gap between "using Copilot" and "agents shipping features" is the infrastructure gap. That's what we close.

87%

Stuck in Pilot

AI projects that never reach production. Not because the AI failed — because the codebase wasn't ready.

The Breakthrough Level

At L3, AI agents can fix bugs, add tests, and implement features autonomously. Most repos are L1. We get you to L3.

Built for engineering leaders who need answers, not demos

VPs of Engineering

You're under pressure to prove AI productivity gains. Get a board-ready score that shows exactly where you stand and what the path to ROI looks like.

CTOs

Your board wants an AI strategy. Give them a maturity framework they understand: L1 today, L3 in 12 weeks, with measurable criteria at every step.

Engineering Directors

Stop the ad-hoc "some teams use Copilot" approach. Get a systematic assessment across all repos and a clear transformation playbook.

Your repos score L1. We'll get you to L3.

Start with a free Nerva Score™ assessment. Then we embed in your team and make the transition happen — agents writing code, running tests, and shipping features.