Show HN: Open-source browser for AI agents(github.com/theredsix)
76 points by theredsix 6 hours ago | 20 comments
- Retr0id 5 hours ago> As proof, ABP with opus 4.6 as the driver scores 90.5% on the Online Mind2Web benchmark
And what does opus score with "regular" browser harnesses?
[-]- 9wzYQbTYsAIc 4 hours ago90% easy or 90% average?[-]
- theredsix 3 hours ago90% average with 85.51% hard![-]
- 9wzYQbTYsAIc 3 hours agoNice! Will take a look at this for my homelab - was debating using crawl.cloudflare.com to try it out, as browser rendering was my next stretch goal.
- esafak 5 hours ago[-]
- Retr0id 4 hours agoHm I can't see Opus 4.6 on there[-]
- theredsix 3 hours agoI tweeted at the OSUNLP and they're backed up on eval validation. In the meantime, here's the benchmark repo with the saved runs and also instructions on how to run it locally. https://github.com/theredsix/abp-online-mind2web-results
- appcustodian2 28 minutes agohow do you know when a page is "settled"?[-]
- theredsix 16 minutes agoGood question! ABP keeps a list of all same/parent/sibling network request and wait for them to complete within a timeout. If the timeout hits, it'll still freeze and screenshot back to the agent. There's a browser_wait() that the agent can call with increased timeouts to wait for network requests + DOM changes.
- nobrains 8 minutes agoload event or "DOMContentLoaded" event. No?
- giancarlostoro 5 hours agoInteresting, I wonder if this would help with other projects too, one project that comes to mind is archivebox, I don't know if they still have the issue I'm thinking of, but archivebox eventually had the Chrome instances (as the meme goes) basically consume all available RAM. If by freezing execution this could stop that, it could be useful for more than just AI agents.[-]
- theredsix 3 hours agoYeah, I noticed CPU use goes to near zero during the pausing phase. You can also trigger pause via REST/MCP so a script can take advantage of these abilities as well.
- theredsix 6 hours agoOp here, happy to answer any question![-]
- esafak 5 hours agoHow does it compare with https://agent-browser.dev/ ? It would be great if you could add it to your table: https://github.com/theredsix/agent-browser-protocol?#compari...[-]
- octoclaw 3 hours ago[dead]
- bhekanik 4 hours ago[dead]
- webpolis 2 hours ago[dead][-]
- sebmellen 1 hour agoDoes it feel good to be botting HN with ads for your own product?
I'm so sick of reading OpenClaw comments! No activity for 7 months, and then in the past day, five comments from an LLM pitching your tool. What are you doing man? This degrades the quality of HN so badly.
- theredsix 2 hours agoGreat insight! ABP exposes display resolution controls right now. I've noticed almost zero reCAPTCHAs during testing compared puppeteer stealth or other packages. Regarding the freezing mechanic, virtualtime is paused as well and the entire browser clock is captured so it would be very hard for a page's JavaScript to notice the time drift unless they were querying an external API clock.