Playbook · 10 min read

How to Automate Workflows with AI

A six-step playbook we run for every new client. Skip the hype, ship something that earns its keep in week one.

Every AI automation project that fails has the same shape: someone picked a glamorous workflow, tried to do it end-to-end with an agent, and discovered week three that the LLM hallucinates 4% of the time and 4% of an important workflow is catastrophic.

The projects that work follow a boring playbook. Here it is.

Step 1

Pick the right workflow

The best first automation is repetitive, high-volume, and has a forgiving error budget. Aim for workflows that hit all three:

  • Runs at least 10× per week.
  • Currently costs a human 5+ minutes each time.
  • A wrong answer can be caught and corrected — not pushed to a customer's bank account.

Good first picks: inbox triage, lead enrichment, meeting note distribution, support tier-1 deflection.

Bad first picks: outbound sales emails, code deploys, anything billing-related.

Step 2

Map the current process

Get the actual humans on a call. Walk the workflow step by step. Write down:

  • The trigger (what tells someone to start?).
  • Every manual step in order.
  • Every tool touched (CRM, inbox, sheet, etc.).
  • The exception cases — "if X, then we ping Y on Slack".
  • The handoff (where does the work end?).

If you can't draw it on one page, you're not ready to automate it.

Step 3

Decide where AI is actually needed

Most steps don't need an LLM. Mark each step as one of:

  • Deterministic: fetch, write, transform — use plain workflow nodes.
  • Needs reading: understand free-text email, parse a PDF — use an LLM.
  • Needs judgment: classify, prioritise, route — use an LLM with a short prompt.

Fewer LLM calls = cheaper, faster, more reliable. Use AI only where it earns its keep.

Step 4

Pick a small stack and stop

You need three layers and that's it:

  • Orchestrator: n8n, Make, or Zapier.
  • Brain: OpenAI, Anthropic, or Gemini.
  • System of record: whatever CRM/helpdesk/database you already use.

Resist adding vector DBs, fine-tuned models, or new SaaS until the v1 has proven it's worth the complexity. Most workflows never need them.

Step 5

Ship behind a human, then remove the human

Don't deploy autonomous on day one. Three phases:

  1. Shadow: the automation runs and logs what it would have done; a person still does the work. Compare.
  2. Suggest: the automation drafts; a person approves with one click.
  3. Autonomous: safe paths run end-to-end; risky cases escalate to a human with full context.

Most clients are at "autonomous on safe paths" within two weeks.

Step 6

Measure and iterate

Track three numbers from week one:

  • Time saved per week (hours × hourly cost = dollars).
  • Accuracy on the steps the agent touches.
  • Escalation rate — what % of runs need a human.

If accuracy drops below your threshold, tighten the prompt, add an example, or carve off the failure cases as deterministic exceptions. Don't expect 100% — aim for "better than the human on average".

What "ROI" actually looks like

For a typical 10–50-person business, the first AI automation pays back in 4–8 weeks. We see numbers like:

  • Lead-to-CRM agent: 6 hours/week saved per sales rep.
  • Support deflection: 40–70% of tier-1 tickets resolved without a human.
  • Invoice processing: 8–12 hours/week reclaimed for a 15-person ops team.
  • Inbox triage: founder inbox time drops from 60 min/day to ~10.