Playbook · 10 min read
How to Automate Workflows with AI
A six-step playbook we run for every new client. Skip the hype, ship something that earns its keep in week one.
Every AI automation project that fails has the same shape: someone picked a glamorous workflow, tried to do it end-to-end with an agent, and discovered week three that the LLM hallucinates 4% of the time and 4% of an important workflow is catastrophic.
The projects that work follow a boring playbook. Here it is.
Step 1
Pick the right workflow
The best first automation is repetitive, high-volume, and has a forgiving error budget. Aim for workflows that hit all three:
- Runs at least 10× per week.
- Currently costs a human 5+ minutes each time.
- A wrong answer can be caught and corrected — not pushed to a customer's bank account.
Good first picks: inbox triage, lead enrichment, meeting note distribution, support tier-1 deflection.
Bad first picks: outbound sales emails, code deploys, anything billing-related.
Step 2
Map the current process
Get the actual humans on a call. Walk the workflow step by step. Write down:
- The trigger (what tells someone to start?).
- Every manual step in order.
- Every tool touched (CRM, inbox, sheet, etc.).
- The exception cases — "if X, then we ping Y on Slack".
- The handoff (where does the work end?).
If you can't draw it on one page, you're not ready to automate it.
Step 3
Decide where AI is actually needed
Most steps don't need an LLM. Mark each step as one of:
- Deterministic: fetch, write, transform — use plain workflow nodes.
- Needs reading: understand free-text email, parse a PDF — use an LLM.
- Needs judgment: classify, prioritise, route — use an LLM with a short prompt.
Fewer LLM calls = cheaper, faster, more reliable. Use AI only where it earns its keep.
Step 4
Pick a small stack and stop
You need three layers and that's it:
- Orchestrator: n8n, Make, or Zapier.
- Brain: OpenAI, Anthropic, or Gemini.
- System of record: whatever CRM/helpdesk/database you already use.
Resist adding vector DBs, fine-tuned models, or new SaaS until the v1 has proven it's worth the complexity. Most workflows never need them.
Step 5
Ship behind a human, then remove the human
Don't deploy autonomous on day one. Three phases:
- Shadow: the automation runs and logs what it would have done; a person still does the work. Compare.
- Suggest: the automation drafts; a person approves with one click.
- Autonomous: safe paths run end-to-end; risky cases escalate to a human with full context.
Most clients are at "autonomous on safe paths" within two weeks.
Step 6
Measure and iterate
Track three numbers from week one:
- Time saved per week (hours × hourly cost = dollars).
- Accuracy on the steps the agent touches.
- Escalation rate — what % of runs need a human.
If accuracy drops below your threshold, tighten the prompt, add an example, or carve off the failure cases as deterministic exceptions. Don't expect 100% — aim for "better than the human on average".
What "ROI" actually looks like
For a typical 10–50-person business, the first AI automation pays back in 4–8 weeks. We see numbers like:
- Lead-to-CRM agent: 6 hours/week saved per sales rep.
- Support deflection: 40–70% of tier-1 tickets resolved without a human.
- Invoice processing: 8–12 hours/week reclaimed for a 15-person ops team.
- Inbox triage: founder inbox time drops from 60 min/day to ~10.