Beyond Screen Scraping: Transforming Web Pages Into Callable Tool Surfaces
The era of flaky screen scraping is over. We are entering the age of the 'Tool Surface', where websites explicitly define their capabilities for the agents that navigate them.

The Fragility of Visual Inference
For years, AI agents have navigated the web by 'looking' at pages much like humans do. This process, known as screen scraping or visual inference, is notoriously brittle. A simple change in CSS classes or a reordered layout can render an agent useless.
We are building a more resilient internet. By defining 'Tool Surfaces', we are giving agents a way to interact with functionality directly, bypassing the presentation layer entirely.
From Buttons to Capabilities
Think about a checkout flow. For a human, this is a series of forms and buttons. For an agent, this should be a 'checkout' tool with typed inputs for address, payment, and shipping. When the UI is treated as a capability rather than a visual obstacle, the friction of the web disappears.
- Reduced Token Cost: Agents don't need to parse thousands of lines of HTML/CSS.
- Deterministic Outcomes: Actions succeed or fail with clear error codes, not visual guesses.
- Zero-UI Workflows: Tasks can be completed in the background without the browser ever opening.
The 'Tool Surface' is the final evolution of the API. It makes every piece of software on the planet interoperable by default.