SECTION 01 / OPENING

Notes on harness engineering

Where reliable AI agent capability actually comes from — and how to build it.

In engineering, AI capability does not live in the model alone. It is distributed across tools, verifiers, control flow, and the design of the operating environment. The Harness is a working notebook on that problem — writing from the intersection of agentic AI and architecture, engineering, and construction.

Written by Theodoros Galanos.

read_archive subscribe

SECTION 02 / TOPICS

Harness Engineering

The tools, verifiers, orchestration, and process design that make agentic work reliable. Not the model — the system around it.

10 articles 02

Agent Evaluation

Benchmarking agents on real engineering tasks. What to measure, how to measure it, and why most evals miss what matters.

5 articles 03

AI in AEC

Applying agentic AI to architecture, engineering, and construction — where the work is instance-bound, constraint-heavy, and intolerant of generic answers.

9 articles

SECTION 03 / LATEST

2026-07-15 task-worlds

Fluent, But Unsafe

How 150 supposedly finished tasks and perfect model scores hid a weak engineering benchmark—and how auditable reviews exposed what the numbers missed.

SECTION 04 / RECENT