Every Agile Artifact Was Built to Derisk Humans Writing Code

Every agile artifact exists to derisk humans writing code. AI changes that equation. Here’s what to keep and what to throw away.

October 10, 2025

·

Norman

Executive Deck ↗Listen ↗

Let your agent read this

Executive briefClick to expand

The introduction of AI agents into the software delivery lifecycle fundamentally alters the economic premise of traditional Agile artifacts. Organizations are optimizing for constraints that no longer exist.

Re-architect the SDLC for AI's true capabilities.

Agile artifacts mitigate human cognitive and social risks; they do not address AI's unique failure modes. Traditional constructs like user stories, story points, and separate QA phases are designed to manage human limitations, not specification incompleteness inherent in agentic development.
Agent-driven development shifts the primary bottleneck from implementation to specification completeness. Where humans err through misinterpretation, AI agents fail when specifications are ambiguous or incomplete, demanding precision at the input stage rather than iterative refinement post-development.
The economic justification for multi-layered work decomposition evaporates when agents handle comprehensive feature implementation. Hierarchies like epic-feature-story-subtask exist to manage human cognitive load and coordination overhead, which are nullified when an agent can execute a complete feature from a single, exhaustive specification.
Strategic learning velocity, not merely developer productivity, becomes the critical differentiator in an AI-augmented SDLC. Organizations optimized for rapid hypothesis testing via agent-driven specification and implementation cycles will out-innovate those constrained by legacy process artifacts.

The first question for any AI program: are you optimizing a legacy process or fundamentally redesigning the system for the new economic reality?

Read the full executive package →

Pen doodle illustration for every-agile-artifact-was-built-to-derisk-humans-writing-code

6 min read

Your AI tools are working. Your SDLC isn’t.

Your board asked about AI ROI six months ago.

You showed them the metrics: 89% AI coding assistant adoption. AI-generated code at 67% of commits. Developer satisfaction up. Productivity improved 11%.

They nodded. They approved more budget.

But privately, you know something’s off. Your competitor,the one that didn’t exist three years ago,is shipping features in days while your teams measure velocity in sprints. Your AI investment might be failing for the same reason. They have 47 engineers. You have 1,800. They’re shipping 6x more features.

You’ve told yourself it’s their greenfield architecture. Their lack of technical debt.

It’s not.

They’re not writing epics, features, and stories. You are.

Every Agile Artifact Was Built to Derisk Humans Writing Code

Look at what each artifact actually does:

Most readers also read: The Engineers Who Can’t Use AI Agents Don’t Have a Tools Problem

User stories reduce cognitive load because humans can only hold 5-9 items in working memory.

Story points create estimation buffers because humans discover hidden complexity while coding.

Sprints calibrate feedback loops to the 15-20 minute cost of human context-switching.

Acceptance criteria prevent interpretation errors because different humans read requirements differently.

Code review catches logic errors and bugs humans make when tired or rushed.

Separate QA phases find defects humans introduce under deadline pressure.

Every single artifact exists because of human cognitive architecture.

They worked brilliantly for 24 years. We industrialized human risk mitigation.

Then we built agents that don’t have those risks.

Agents Have Different Failure Modes

Human failure mode:

Developer reads: “Filter transactions by date”
Interprets “date” as calendar date only
Writes code for calendar dates
QA tests with timestamps
Bug discovered: timestamps don’t work
Root cause: Human interpretation error

Agile solution: Clearer acceptance criteria, code review, QA testing. This works perfectly.

Agent failure mode:

Agent reads: “Filter transactions by date”
Specification doesn’t define format (ISO8601? Unix timestamp?)
Agent generates code using ISO8601 (default from training data)
Validation fails
Root cause: Specification incompleteness

Agile solution: Better story decomposition? More acceptance criteria? Doesn’t help.

The agent didn’t misinterpret. The specification was incomplete.

Humans fail through misinterpretation. Agents fail through specification incompleteness.

Different failure modes require different artifacts.

You’re using artifacts designed for one failure mode to address a completely different one.

Stop Writing Stories. Start Writing Perfect Specifications.

Old way:

Write epic, decompose into 8 stories (3 hours)
Story pointing and sprint planning (2 hours)
Development sprint with agent assistance (2 weeks)
Code review, QA phase, security review
Deploy

Timeline: 6-8 weeks

New way:

Spend 4-6 hours writing complete specification WITH the agent
Agent implements code + tests + docs (4 hours)
Validation reveals specification gaps (2 hours)
Refine specification (2 hours), agent regenerates (2 hours)
Deploy

Timeline: 2-3 days

Same time investment. 10x better outcome.

What a complete specification looks like:

Instead of five separate stories (“As a user, I want to filter by date…” + test story + security story + QA story), write one complete specification:

Feature: Transaction Search
Investment Theme: Customer retention efficiency

API Contract:
POST /search/transactions | p95 latency < 200ms
Parameters: date_range, amount_range, merchant, status

Behavior:
GIVEN 10K transactions
WHEN filtering by date_range + amount_range
THEN return matching transactions with pagination

Security: SQL injection prevention, rate limiting, no PII in logs
Performance: Indexes on date/amount/status, 1M transactions per user
Tests: Date edge cases, boundaries, empty results, concurrent requests

Time with agent: 4-6 hours

Agent implements: Everything,code, tests, docs,in 4 hours

No separate test stories. No separate security review. No separate QA phase.

Replace Six Layers with Three

Your current hierarchy exists to decompose work for human cognitive limits:

Portfolio → Program → Epic → Feature → Story → Sub-task

If agents execute complete features from specifications in hours, why six layers?

Replace with three:

Investment Theme → Software Feature → Executable Specification

Investment Themes = Where you place capital (Customer acquisition efficiency, Platform resilience)

Software Features = What you’re building (Multi-currency payments, Fraud detection)

Executable Specifications = What done looks like (Written WITH agent in 4-6 hours, complete enough that agent implements everything)

No epics. No stories. No decomposition.

The Strategic Learning Velocity Gap

Your process: 6-8 weeks per feature = 6.5 features/year

Competitor process: 2-4 days per feature = 90 features/year

When they test 10 product hypotheses while you test 1:

They find product-market fit faster
They learn what customers want faster
They adapt to market shifts faster
They waste less capital on wrong directions

Year 3: Their product is fundamentally better because they had 14x more at-bats.

This isn’t a productivity gap. This is a strategic learning velocity gap.

Why Specifications Work Now: Rapid Waterfall

Waterfall failed:

2-month spec → 6-month implementation → Catastrophic errors

Rapid Waterfall with agents:

6-hour spec → 4-hour implementation → Immediate gap revelation → Trivial to fix

When implementation takes 4 hours instead of 4 months, specification incompleteness is cheap to fix.

You get waterfall’s comprehensive specifications + agile’s rapid feedback loops.

This Cannot Be Delegated

Your transformation office will propose “AI-enhanced story writing.” Your PMO will create governance that protects the old system. Your agile coaches’ jobs depend on epics and stories surviving.

None of them will say: “These artifacts are broken. Replace them.”

That call sits with you.

What personal leadership means:

Week 1: YOU select 3 teams, brief them: Investment Theme → Specification → Agent → Deploy. No epics, no stories, no sprints.

Weeks 2-13: YOU review their specifications weekly. Are specs complete? Testable? What’s cycle time vs. traditional teams?

Week 14: YOU present results to board with hard data.

Non-negotiable: Your personal involvement.

This is a capital allocation model change, not a process optimization. Only you can make that call.

The Board Will Ask About AI ROI

Answer A (Optimization):

“Yes, strong ROI. 92% AI adoption. Productivity up 11%. Roadmap includes expanded capabilities.”

Board: Approves budget.

Reality: Competitor ships 50 features during this meeting. They’re building moats using your speed as their wedge.

18 months later: Board asks why competitor is gaining market share.

Answer B (Transformation):

“No. We’re using AI to optimize a 2001 process instead of building a 2025 process.

Our SDLC was designed for human constraints. Agents don’t have those constraints.

Competitors who replaced epics and stories with specifications see 90% cycle time reduction. They test 14 hypotheses while we test 1.

That’s strategic learning velocity, not productivity.

I need support for 3 pilot teams. Investment Theme → Specifications → Agent Implementation. I’ll review their specs weekly. 90 days to hard data.

We cannot keep optimizing for a constraint that no longer exists.”

Board: Harder questions. Your commitment visible.

Reality: In 90 days, you have data. Lead industry transition or learn definitively.

Which answer protects shareholder value?

What You’ll Do

Next week, someone will propose “AI-enhanced story writing.”

You’ll say:

Option 1: “Great. Let’s optimize our stories with AI.”

Result: 10-15% improvement, competitor ships 10x faster, board asks hard questions in 18 months

Option 2: “No. Stories are the problem. We’re replacing them.”

Result: 3 pilot teams, 90 days, hard data, lead or learn

One is comfortable. One is leadership.

One is delegatable. One requires your personal commitment.

Only one will work.

The Truth

We spent 24 years buying down human risk faster.

Then in 2022, we built agents that don’t generate those risks.

And we kept running the same risk mitigation process.

Your teams sit in retrospectives every two weeks asking: “We have all these AI tools. Why isn’t velocity dramatically better?”

Because you’re making agents:

Write user stories they don’t need
Wait for code reviews to catch bugs they don’t make
Go through separate QA for tests they generate with code

You’re running a 2025 workforce through a 2001 process.

The shift is simple:

Stop: Writing epics, stories, sprint planning

Start: Writing perfect specifications with agents

Same time investment. 10x better outcome.

Kiss story writing goodbye.

Every agile artifact was built to derisk humans writing code.

Agents don’t have those risks.

The constraint changed. The artifacts didn’t.

What will you do?

Companion

Executive BriefRead the brief1-min read · 223 words Brief AudioListen to the brief2-min listen · narrated by Norman Podcast TranscriptRead the full transcriptSpoken-word version of the article Slide DeckView the slide deckBoard-ready, attribution-required

Related executive briefs

CxO Put Tokens in the P&L, Not in a Developer Expense Report AI labor is a production input. June 24, 2026

CxO Dear Developer: Why AI Adoption Is Slow Predictability, not raw speed, is the executive mandate for technology adoption. June 20, 2026

CxO Your AI Token Burn Is Not the Problem. The Work Is. Financial systems for engineering work reveal an organization's value discipline. June 19, 2026

CxO Before You Build a Token Economics Dashboard, Build a Value Dashboard Value stream economics, not unit cost, drives effective AI adoption. June 9, 2026

Written by

Norman

After 20 years in software development, Norman is both a hands-on leader and defining the new age of AI SDLC for some of the biggest brands in the world — and exploring it with the builders. He writes here about things he is hearing and seeing. All posts are his personal points of view and do not reflect any employer or any customer he has ever had contact with.

The views and opinions expressed in this article are the author’s own and do not represent the positions of any employer, client, or affiliated organization.

One useful note a week

Get one good email a week.

Short notes on AI-native software leadership. No launch sequence. No funnel theater.

Subscribe

Every Agile Artifact Was Built to Derisk Humans Writing Code

Re-architect the SDLC for AI's true capabilities.

Every Agile Artifact Was Built to Derisk Humans Writing Code

Agents Have Different Failure Modes

Stop Writing Stories. Start Writing Perfect Specifications.

Replace Six Layers with Three

The Strategic Learning Velocity Gap

Why Specifications Work Now: Rapid Waterfall

This Cannot Be Delegated

The Board Will Ask About AI ROI

What You’ll Do

The Truth

Corporate fiction

Get one good email a week.