The agent mode is really disappointing. I thought OpenAI would try to be more in...

The agent mode is really disappointing. I thought OpenAI would try to be more innovative with how the agent interacts with webpages, but it looks like it's the same DOM parsing and screenshot workflow the rest of the AI browser agents use. Giving the agent full access to the page is a recipe for disaster.

We have better tools for this now. This is a draft video I put together for the W3C demoing WebMCP. It blows their agent mode out of the water, and you can even use in-browser models for inference (see the end of the video)

https://screen.studio/share/hbGudbFm

I've been working on this full-time after putting out the MCP-B/WebMCP Hacker News post.

https://news.ycombinator.com/item?id=44515403