BRIEFING 003

Replacing the spreadsheet someone updates at 11pm.

Every operation has one. The Tuesday-morning roll-up, the Friday compliance digest, the Sunday-night inventory reconciliation. The spreadsheet someone secretly maintains at 11pm because if they don't, Monday breaks. Here's how to retire it without breaking what already works.

Find the spreadsheet first.

In every operation we've looked at, there's one document, usually a Google Sheet or an Excel file, that quietly carries more weight than its appearance suggests. It's the spreadsheet that gets opened at 11pm on Sunday because the operations lead remembers Monday's leadership meeting needs the numbers. Or it's the Friday-afternoon scramble to roll up the week's hours before payroll closes. It's the reconciliation between two systems that supposedly talk to each other but actually don't.

This is the right place to start with AI. Not your chatbot. Not your customer service. Not your "AI-powered" anything. The spreadsheet.

Why this is the right starting point

Three reasons:

  1. The work is well-defined. Whoever maintains it can describe it in two minutes. "I pull X from system A, Y from system B, drop them into these columns, this formula calculates the delta, then I email it to the leadership team." That's a job spec, and it took 30 seconds to write.
  2. The output is internal. Low-stakes. If the AI version is wrong once in the first month, you fix it and try again. Nobody got fired. No customer was harmed.
  3. The savings are immediate. Whoever was doing it at 11pm gets that hour back. They become enthusiastic supporters of further AI projects. Without that internal advocate, the second project never happens.

How to spot YOUR 11pm spreadsheet

Sometimes you know what it is the moment you read this. If not, ask three people in your operation a single question:

"What's the recurring task you do every week that you wish you didn't have to do?"

You'll get a few different answers. Look for the overlap. The thing that two or more people mention, that's your candidate. If everyone mentions a different thing, pick the one that runs on the most predictable schedule. Schedule beats stakes for a first project.

Good candidates share these properties

  • The data sources are tools that have APIs (any modern SaaS, your CRM, accounting tool, project tracker, cloud database).
  • The output is a document, a Slack message, or an email, not a phone call or a meeting.
  • The structure of the output doesn't change week to week. (The numbers change. The shape doesn't.)
  • It currently takes 1–4 hours per cycle. Less than 1 hour: not worth the project. More than 4 hours: scope it down before you start.
  • Someone could explain how to do it correctly to a new hire in 30 minutes.

Bad candidates share these properties (skip these for now)

  • "It depends on what's happening that week", too much judgment, too little structure.
  • The task involves contacting customers in any way (much higher stakes, leave it for project 4 or 5).
  • The data sources are PDFs, scanned documents, or things in someone's email inbox that they manually triage. Solvable, but harder; not a first project.
  • "It's faster to just do it than explain it", usually a sign the job isn't actually well-defined.

The scoping conversation (60 minutes)

Once you have a candidate, sit down with the person who currently does it. One hour, maximum. Cover these six questions and write the answers in a shared doc:

  1. What triggers it? ("Every Monday at 7am" / "When a new lead enters CRM" / "When QuickBooks closes the month")
  2. Where does the data come from? List every source, with names of specific tables/views/folders/sheets.
  3. What transformations happen? Be exact. "If column X is empty, skip the row. If column Y > column Z, flag it as 'overdue.'"
  4. What's the output? A Slack message? An email? A doc? Show an example of the most recent one.
  5. Who reads it? Names. Roles. What action do they take after reading?
  6. What can go wrong? What's the worst-case error and how would someone notice?

If at the end of this hour you can't answer all six questions, the task isn't ready to be automated. That's not a failure, it's information. Sharpen the spec, or pick a different candidate.

Building it (the technical bit, kept short)

The implementation looks roughly the same for every operation:

1. MCP servers for each data source
   (existing ones if available, custom for internal tools)

2. A scheduled trigger
   (cron job, GitHub Actions, or your existing scheduler)

3. An agent script that:
   - Calls each MCP server to pull data
   - Applies the transformations from the spec
   - Generates the output document/message
   - Posts it where it needs to go

4. Error handling that pings a human if anything fails

The whole thing is typically 200-500 lines of code, depending on complexity. Building it from scratch with AI assistance: 2-3 days of focused work. Hiring a specialist: 1-2 weeks elapsed. Either way, far cheaper than the consulting-firm "AI roadmap" that every operation gets pitched.

The transition period

Don't kill the old spreadsheet on day one. For the first month, run the agent in parallel with the manual process. The 11pm operator still does their thing; the agent also runs. Compare the outputs. If they match for 4-5 cycles in a row, you have green light. Retire the manual version.

This step is non-negotiable for two reasons:

  1. It catches edge cases the spec missed. There are always edge cases the spec missed.
  2. It builds trust. The team needs to see the agent be correct over a real period of time before they'll let go of the manual safety net.

What changes after

Three things, in this order:

  1. The output gets better, not just faster. An agent that runs reliably every week tends to produce more consistent output than a human doing it at 11pm. The format stabilizes. Edge cases get handled the same way every time. Numbers stop drifting because of last-minute manual tweaks.
  2. The schedule becomes load-bearing. Once leadership knows the report shows up every Monday at 7am without fail, they start using it differently. Decisions get made earlier in the week. The data stops being something you ask for and becomes something that arrives.
  3. The 11pm operator is freed up. Often this person is one of your most valuable employees, that's why they were doing the manual roll-up in the first place. Now their hours go to the work that actually requires their judgment. This is the biggest dollar-value win, and the hardest to put on a project ROI sheet.

Don't make these mistakes

  • Don't build the perfect agent. Build the boring agent that exactly replicates the current spreadsheet. Improvements come later, after it's running reliably.
  • Don't hide the agent. Tell the team what it does, where it runs, who owns it. Mystery automation is the fastest way to lose trust the next time you propose a project.
  • Don't pick the spreadsheet that's the most painful. Pick the one that's the most predictable. Pain doesn't predict automation success, predictability does.

What we do

WildBreeze specializes in exactly this category of project: custom scheduled agents for small operations, lives in your cloud, three weeks, fixed price. If your 11pm spreadsheet has a name, we'd like to hear about it. Tell us about your operation.


Related: What is an AI agent? · AI for small business: a realistic 90-day plan

Keep reading

More from the Field Guide