Cookies

We use cookies to analyze traffic and embed scheduling tools. Choose what you're OK with.

WorkAI LabGalleryServicesAboutBlog
Anil PervaizHire me
Shipping Claude Code for real work
Engineering

Shipping Claude Code for real work

Anil Pervaiz
Anil Pervaiz·March 14, 2026·7 min read

A field report from running Claude Code on production codebases — the patterns that scale, the failure modes that look like success, and the rituals I keep.

On this page
  1. Treat it like a contractor, not autocomplete
  2. The prompt patterns that actually scale
  3. The failure mode that looks like success
  4. Review is the new bottleneck
  5. The infrastructure that turns it into a teammate
  6. Where I don't use it

After six months of running Claude Code as a daily driver across three production codebases, I stopped reaching for autocomplete-style copilots. The difference isn't the model. It's how you work with it.

Here is the field report: the prompt patterns that scale, the failure mode that looks like success, and the small infrastructure investments that turn a clever assistant into an actual teammate.

Treat it like a contractor, not autocomplete

Autocomplete finishes your line. An agent does a task. The mental shift is to stop thinking in keystrokes and start thinking in work orders: "add a rate limiter to these three routes, match the existing error shape, and show me the diff." The closer your request looks to a ticket you would hand a competent contractor, the better the result.

The prompt patterns that actually scale

Three habits did most of the work:

  • Point at examples. "Match the pattern in this file" beats any amount of description.
  • Constrain the surface. Tell it which files to touch and which to leave alone. Unbounded tasks produce unbounded diffs.
  • Ask for the plan first on anything non-trivial. A 30-second plan review catches the wrong approach before it writes 300 lines.

The failure mode that looks like success

The dangerous output isn't the obviously broken one. It's the confident, plausible diff that passes a glance and quietly does the wrong thing: a test that asserts nothing, an error path swallowed, a security check skipped because it "wasn't in scope." This is the same lesson as building agents that don't hallucinate: the model is most dangerous when it is wrong and smooth about it. The fix is simple and non-negotiable — never merge what you didn't read.

Review is the new bottleneck

When the agent writes faster than you can read, your review becomes the constraint. I leaned into it: smaller diffs, clearer commits, and a habit of asking "what would make this wrong?" before merging. The teams that get burned by AI coding tools are the ones that let velocity outrun review.

The infrastructure that turns it into a teammate

A few cheap investments compounded:

  • A tight project doc stating conventions, so I'm not re-explaining the stack every session.
  • Fast tests and a type-checker it can run itself, so it gets a feedback loop instead of guessing.
  • Small, reviewable commits, so when something is wrong I can see exactly where.

None of this is exotic. It is the same hygiene that makes human teams fast — the agent just rewards it more obviously.

Where I don't use it

I still write the hard parts myself: the security-sensitive code, the gnarly state machine, the thing where being 95% right is worse than not shipping. The agent is fast on the well-defined 80%. The remaining 20% is where the judgment lives, and that judgment is exactly what clients pay an AI architect for.

Claude Code didn't replace engineering judgment. It moved where I spend it: less typing, more reviewing and deciding. If you want this kind of velocity wired into your own stack, the AI Lab shows what I've shipped, and you can book a call to talk about it.

ShareLinkedInX / Twitter
Newsletter

Get the build log

One email a month with what I shipped, what broke, and what I learned. No spam, unsubscribe in one click.

Anil Pervaiz
Anil Pervaiz
AI Agents & Automation Engineer

I ship production AI for startups and teams — agents, RAG, automations — on a decade of design & Webflow craft.

About me →
← Newer
The shape of an AI architect
Older →
Websites that grow with the brand
← All articlesWork with me
Related reading

Keep going.

AI agency vs. in-house vs. fractional: how to staff your AI work
AI Architecture

AI agency vs. in-house vs. fractional: how to staff your AI work

The real trade-offs between hiring an AI agency, building an in-house team, and bringing in a fractional AI lead — and which fits your stage.

May 26, 2026·7 min read
How to add AI to your SaaS (without a rebuild)
AI Architecture

How to add AI to your SaaS (without a rebuild)

A practical sequence for shipping your first real AI feature into an existing product — what to build first, what to skip, and how not to break what already works.

May 26, 2026·7 min read
What does an AI consultant cost in 2026?
AI Architecture

What does an AI consultant cost in 2026?

Real 2026 pricing for AI audits, builds, retainers, and fractional leads — what drives the number, and how to avoid overpaying.

May 23, 2026·8 min read

London, UK — GMT/BST

hello@anilpervaiz.com

Async across US · UK · EU

Studio

  • Work
  • AI Lab
  • Services
  • About

Resources

  • Blog
  • FAQ
  • Newsletter

Contact

  • Email
  • Twitter / X
  • LinkedIn
Get started

An independent AI agents & automation engineer building production AI for startups & teams.

© 2026 Anil Pervaiz·
Terms & ConditionsPrivacy Policy
Anil Pervaiz