About Agent Harnesses

Agents

Overview

You may have heard the term agent harness throwed around here and there, but what exactly is a agent harness.

Lets understand what exactly a harness is in this blog.

At the heart of every coding agent there is a LLM. And regardless whatever this agents do, the model itself is still doing one basic thing:

Predicting Text

That's it. It's just guessing what's come after a given word over and over again.

It cannot "open files", "edit code", or "delete your production database :)"

So if this is the case, how does coding agents like Cursor and Claude Code can:

The answer is tool calling, orchestrated by a harness

A harness is the environment and tool layer around the model.

It is a software layer that:

So when people are comparing different coding agents, they are not just comparing the models.

They're comparing:

All of this can massively influence how good the final result is.

Lets say you ask your agent:

What are the contents of this directory:

A plain chat model can’t actually inspect your filesystem.
But a harness can give it a tool like bash.

Instructions on how to use this tool are already present in the context, and the tool descriptions are send via API.

The model outputs special tokens that tells the harness, that it wants to call a tool with given parameters.

<tool: bash>
ls -la
</tool>

At that point, the model stops.

Then the harness takes over:

So every tool call creates a loop:

model → tool call → harness executes → result added to history → model continues

That’s the core architecture behind nearly every modern AI coding tool.