#ai-in-aec

Posts in this topic thread.

2026-04-03

Why domain-specific agent harnesses, not bigger models, are what close the AI performance gap on real engineering tasks — and why the AEC industry needs proper benchmarks to prove it.

benchmarks aec ai-agents harness-design evaluation ai-in-aec 2026-03-14

What If the Harness Could Improve Itself?

Applying the autoresearch pattern to self-improve an engineering agent harness. Automated prompt optimisation across HVAC audit tasks on Claude and GPT-4.1-mini, showing how harness engineering compounds when the improvement loop runs itself.

harness-engineering autoresearch agentic-ai ai-in-aec ai-benchmarks agent-evaluation prompt-optimisation design-review 2026-03-12

Benchmarking Agents on Real Engineering Work Is Already Teaching Us Something Important

Benchmarking AI agents on real HVAC engineering tasks across Claude and GPT models. Results on harness-dependent capability, agent evaluation design, and why AEC-domain benchmarks reveal what general benchmarks miss.

harness-engineering agentic-ai ai-in-aec ai-benchmarks agent-evaluation hvac-ai design-review mep-automation 2026-03-10

Where Capability Actually Lives in Agentic Engineering

In AEC and domain-specific engineering, AI agent capability lives not in the model alone but in harness engineering — the tools, verifiers, orchestration, and process design that make agentic work reliable.

harness-engineering agentic-ai ai-in-aec engineering-ai orchestration agent-reliability construction-ai