Is a JVM/CDP based browser agent stack fundamentally a bad idea?

Question

Hi HN, We built a very early prototype: a Browser-Agent/browser-automation runtime using Kotlin/JVM and raw CDP. Before investing further, we&rsquo;d like advice from anyone who has worked on browser agents, AI browsers, large-scale automation, crawling, browser farms, or who has deep knowledge of Chromium/CDP. We ourselves suspect many of our design assumptions may be flawed, so sharp criticism is very welcome. --- TL;DR We&rsquo;re building an open-source runtime: &bull; AI planning/reasoning/logic lives on the JVM &bull; Browser actions are driven via raw CDP &bull; High concurrency via Kotlin coroutines &bull; A small ML agent learns page structure But we&rsquo;re not sure any of this is actually meaningful. Feedback&mdash;especially negative feedback&mdash;is appreciated. --- 1. JVM + CDP: possibly the wrong abstraction layer AI planning/reasoning/logic is on the JVM; browser actions are sent through CDP. Some doubts we cannot resolve internally: &bull; Is the JVM too heavy for this domain? Will GC and scheduling cause tail latency? &bull; Is CDP inherently unsuitable for high-throughput automation? &bull; Does nobody actually need a JVM-native browser agent? &bull; Would Go/Node/Python be more sensible choices? If the answer is &ldquo;no, this is the wrong direction,&rdquo; we&rsquo;d really like to hear it. --- 2. High-concurrency runtime: likely to fall apart in real workloads We&rsquo;re trying to push single-machine throughput on real, complex pages by relying on: &bull; Kotlin coroutines &bull; Minimizing DevTools round-trips &bull; Raw CDP with multi-tab concurrency But our doubts are even larger: &bull; Can Chromium realistically survive this scale? (render-process contention, GPU-thread limits, compositor stalls, etc.) &bull; Are multi-tab workloads doomed to event interference, reordering, and deadlocks? &bull; Will CDP scheduling become the true bottleneck? &bull; Is raw CDP unavoidably more brittle than Playwright? If you&rsquo;ve seen similar attempts fail, we&rsquo;d especially like to know how they failed. --- 3. Non-LLM page-structure learning: probably not generalizable We built a small ML module to avoid calling an LLM every time we parse HTML. It works well on e-commerce pages, but we strongly suspect it will break elsewhere. Concerns: &bull; Will it fail outright on news, forums, SaaS dashboards, and other domains? &bull; Has anyone built DOM-structure-learning systems and then abandoned them? Why? &bull; Is the long tail of the web fundamentally hostile to non-LLM approaches? Failure stories are particularly valuable. --- 4. Some questions we have zero confidence about &bull; Does the world actually need yet another browser-automation stack? &bull; Do &ldquo;Browser Agents&rdquo; have long-term practical value at all? &bull; Do coroutine-style concurrency models provide real benefits under heavy CDP I/O? &bull; Should we drop the &ldquo;agent&rdquo; layer entirely and just build a runtime? &bull; What fatal issues exist around resource isolation, multi-tenancy, event storms, or long-tail page behaviors? &bull; Do all high-concurrency browser runtimes eventually die for the same reasons? If the answer is &ldquo;yes, stop now,&rdquo; we&rsquo;d prefer to know early. --- Prototype status We&rsquo;ll open-source a very early version (missing docs, missing examples, and possibly flawed designs). Known issues include: &bull; Deadlocks on certain complex sites that are hard to reproduce &bull; CDP event reordering under high concurrency &bull; Worse-than-expected memory behavior &bull; Structure-learning module is inaccurate on non-e-commerce pages If you&rsquo;ve built systems with heavy browser interaction, automation, data extraction, or treating the browser as a runtime, we&rsquo;d love to hear about the bottlenecks you hit&mdash;so we don&rsquo;t optimize toward the wrong direction. --- Finally Any single sentence of criticism may save us months. &mdash; Browser4 Team

grizzles · Accepted Answer

Open source it and you'll get all the feedback you desire.