Foundations

Foundations

Where You Can Afford to Be Surprised

"Agent" means at least three different things right now, and the confusion isn't just semantic. Underneath the terminology sits a design decision with real operational consequences: who picks the next step, your code or the model? Code-determined paths produce bounded, explainable failures. Model-determined paths produce creative ones you didn't plan for. Most production systems need both. The hard part is drawing the line between them, and it's a risk-tolerance decision that happens to look like an architecture one.

Where You Can Afford to Be Surprised
"Agent" means at least three different things right now, and the confusion isn't just semantic. Underneath the terminology sits a design decision with real operational consequences: who picks the next step, your code or the model? Code-determined paths produce bounded, explainable failures. Model-determined paths produce creative ones you didn't plan for. Most production systems need both. The hard part is drawing the line between them, and it's a risk-tolerance decision that happens to look like an architecture one.
What a Website Chooses to Tell an Agent

A new browser protocol lets websites publish structured tool definitions that agents can call directly, skipping the page entirely. A site registers a product search with four categories in the enum. The actual catalog has fifty. The agent receives that simplified view as the complete picture and has no mechanism to ask what else exists. When a website decides what to tell an agent about itself, it's also deciding what disappears.
What a Website Chooses to Tell an Agent
Anew browser protocol lets websites publish structured tool definitions that agents can call directly, skipping the page entirely. A site registers a product search with four categories in the enum. The actual catalog has fifty. The agent receives that simplified view as the complete picture and has no mechanism to ask what else exists. When a website decides what to tell an agent about itself, it's also deciding what disappears.

Further Reading




Past Articles

Before a browser agent reasons about a task, it needs a representation of the page. Raw HTML, a screenshot, or the acces...

A developer's coding agent worked perfectly for thirty minutes, then started contradicting its own decisions. The prompt...

A login form needs zero intelligence. A Playwright selector finds the username field, fills it, clicks "Sign In" in mill...

A system with 95% reliability at each model-directed step delivers 36% reliability over twenty steps. The math is just m...
