LLM systems that do real work — without becoming a money pit.

I build and optimize business processes with AI agents, then measure the cost per outcome so you can see the return before you scale it. Production systems, traced to the dollar — not pilots that quietly bleed.

20+ years shipping production software · AI agents running in production today · Toronto-based

Konstantin Borovik

the problem

Every business runs on a layer of repetitive, lookup-heavy work that quietly consumes skilled time. Answering the same product questions over and over. Pulling a number out of a document and writing it into a reply. Triaging inbound email. Filling the same fields from the same forms. None of it is hard — that’s exactly why it’s expensive. It’s a knowledgeable person, on a loaded salary, spending hours a week on tasks that follow a pattern.

The cost is hidden because it never shows up as a line item. It’s spread across people who could be doing higher-value work, and across customers who wait hours for an answer that could take seconds.

Repetitive, lookup-heavy work is what LLM systems are genuinely good at — high-volume, rule-following tasks, done instantly and consistently. The promise is simple: take that repetitive load off your team, answer it in seconds instead of hours, and free your skilled people for the work that actually needs a human.

The catch is that it has to be built so you can trust it and afford it — grounded in real sources, designed to decline rather than guess, and measured to the dollar. To show what that looks like in practice, here’s a real one I built and the math behind it.

what I do

01

Build

A new business process, powered by an LLM agent and built to be measured from day one. High-volume, lookup-heavy, repetitive work — pre-sales questions, RFP responses, document processing, tier-one support — answered instantly and consistently, with the cost per outcome traced before you scale it.

02

Optimize / Rescue

An AI initiative that’s stalled or running up a bill nobody can explain. I instrument what it’s actually doing, find where the money goes, cap the runaway paths, and add the grounding and guardrails that should have been there first — so it earns its keep instead of becoming the budget line you regret.

Either way you get one senior owner end to end: the retrieval, the guardrails, the observability, the deployment. No layer of account managers between you and the person building it.

proof

I built MailPilot to solve a business problem of my own (not a product I sell) — a production AI agent running on my company’s inbox. It reads an incoming business email, searches a knowledge base, and replies with a sourced answer in under a minute. If the question is outside its knowledge base, it declines and says so rather than guessing.

Here is the real proof — not just words, not empty promises →

Every model call is traced, so the economics aren’t a guess:

MailPilot (measured) A person (assumption)
Cost per sourced reply ~$0.03 ~$5.00
Response time ~20 seconds hours
Out-of-scope questions declines, no guessing varies
At 1,000 emails / month ~$30 ~$5,000

More than a 100× difference in cost, hours-to-seconds in speed. The point isn’t to replace people — you can’t fire 1/100th of an employee. It’s that a category of repetitive work stops consuming thousands of dollars of skilled time and frees that person for work that needs them.

The cost is not the only risk — it’s an agent confidently inventing a price or a spec and sending it over your name. MailPilot grounds every answer in a source document and cites the file it used, so any reply is checkable. Out-of-scope detection and fabrication checks run as mechanical tests in every build cycle — behavior measured in the logs, not asserted in a slide.

Read the full cost-per-outcome breakdown →

how the money stays under control

None of this is exotic — it’s just the discipline most pilots skip:

That’s the difference between the 5% and the 95%, and it’s mostly a question of how the system is built — not which model it uses.

why me

You pay wholesale, not retail.

When you hire a software agency, most of your bill never reaches the people writing your software. It pays for the sales team that closed the deal, the account manager who forwards your emails, the project manager who runs the standups, and the margin stacked on top of all of it. The engineer doing the actual work often sees only a fraction of the rate you’re billed — by common estimates as little as a quarter of it. Put the other way: the rate an agency charges is frequently three to four times what reaches the person who writes your code.

Working with me, that overhead doesn’t exist. No sales layer, no management tier, no markup on a subcontractor. You pay the person building the system directly — and you get that senior person on every call, not a junior assigned after the pitch.

It’s also why the strongest engineers don’t stay at agencies: the model keeps most of what they earn. Going direct gets you both the more senior person and the lower rate at once — the two things an agency’s structure is built to keep apart.

The credentials behind that:

Built on: Anthropic Claude · Google Gemini · Python · FastAPI · Pydantic AI · RAG · MCP · Google Cloud / Azure · Terraform · Kubernetes · OpenTelemetry

closing

If you’re evaluating whether AI can do real work in your business without becoming a money pit — that’s precisely the question I build for, and measure.

Want the cost-per-outcome math on a workflow in your own operation?