- Cerebral Valley
- Posts
- Browserbase gets AI Agents on the Web đ
Browserbase gets AI Agents on the Web đ
Plus: Founder Paul Klein on why he believes AI-powered browsing is only just beginning...

CV Deep Dive
Today, weâre talking with Paul Klein, founder and CEO of Browserbase.
Browserbase offers a powerful cloud-based headless browser infrastructure designed for both AI-driven and traditional automation workflows. If an application needs to navigate the web like a human wouldâfilling forms, clicking buttons, interacting with pagesâBrowserbase provides the underlying infrastructure so you donât have to maintain your own fleet of headless browsers.
Paulâs experiences working at Twilio + Mux shapes his belief that infrastructure companies are uniquely powerful, offering the âbuilding blocksâ that power countless use cases.
After raising their seed round in June 2024, Browserbase raised a $21M Series A in October 2024 co-led by Kleiner Perkins and CRV with participation from Okta Ventures.
Key Takeaways
Web Browser for AI: Browserbase is building the infrastructure that lets AI agents (and any automation) interact with websites as though they were human users.
Stagehand, the AI Web Browsing Framework: An extension of Playwright that allows natural-language instructions for web actions, making generative browser automation seamless.
Not Just AI: While AI automation is a major driver, Browserbase also supports any headless browser workflows for testing, web scraping, or automation.
Hands-off Scalability: The team has scaled to tens of thousands of browser sessions, handling tough Kubernetes and cloud infrastructure challenges so customers donât have to.
In this conversation, Paul explains how Browserbase came to be, how it fits into the rapidly expanding web automation landscape, and why he believes AI-powered browsing is only just beginning.
Letâs dive in âĄď¸
Read time: 8 mins
Our Chat with Paul đŹ
Welcome Paul - give us a bit of background on yourself and what led you to found Browserbase?
Iâve been working for infrastructure companies basically my whole career. I actually started out as an intern at Twilio during their IPO, which was a wild experience because everyone around me seemed to be getting rich overnight, and we could drink for free at the bar. I ended up staying for three years and watched Twilio go from about $100 million in revenue to over a billion, which really convinced me that building developer tools and infrastructure is incredibly powerfulâgiving devs the âLego blocksâ to build anything.
After Twilio, I co-founded Stream Club to create a browser-based live-streaming platform during COVID. Existing livestream software felt clunky, and we wanted to make it dead simple for creators to stream right from their browsers. We ran it for a couple of years, then sold to Mux, another infrastructure company that provides video APIs. It made sense because we constantly ran into video infrastructure headaches, and Mux solved those at scale. All of that led me to realize I get the most joy from building these horizontal platforms. Twilio does it for communications, Stripe does it for payments, and I wanted to do that for browsersâso I started Browserbase.
Can you give me an elevator pitch for Browserbase for the uninitiated AI developer.
The simplest way to explain it is: humans use browsers to go to websites and fill out forms or click on buttons. In the world of AI, we believe a lot of those routine tasks will be handled by AI on our behalf. But the browsers we use every day arenât designed for AI; theyâre designed for people on laptops. AI runs on servers in the cloud, so it needs something different.
Browserbase is building the infrastructure layer that runs thousands of browsers in the cloud. On top of that, we have Stagehand, which is the framework for telling a browser what to do using natural language. If you say âClick the Buy buttonâ in plain English, Stagehand translates that into code that can drive a headless browser. Thatâs a perfect match for LLM-based agents because they speak and interpret natural language, so weâre essentially giving AI its own specialized browser to interact with the web.
I see, so youâre not an alternative to automation frameworks like Selenium or Playwrightâyouâre a complementary tool?
Exactly. Selenium, Playwright, and Puppeteer are automation frameworks that let you script browser actions, but you still have to run the actual browser somewhere. If youâre using Selenium or Playwright on your own, you might spin up containers in the cloud, handle scaling, logging, debuggingâit can get complicated. Browserbase is that managed environment, so you can bring your Selenium, Puppeteer, or Playwright scripts, and weâll run them at scale. Or, if you prefer, you can use Stagehand, which is our own extension of Playwright thatâs built for AI agents.
We also record each session so you can watch what happened if something breaks. We log console output, network traffic, all those details that help you see why a script failed. That level of observability is essential when youâre running thousands of browsers, because the web is messy and unpredictable.
Who are your customers today, and what kind of people are finding the most value in Browserbase?
Itâs a mix of AI startups and more traditional automation use cases. We have a lot of new AI companies building agents that need to do multi-step tasks on websites. For example, you might have an âAI SDRâ tool that logs into LinkedIn, searches for prospects, and sends connection requests. Or you might have a workflow automation app in healthcare or insurance that logs into complex portals. All of that needs robust headless browsing.
Then we also see customers who are not explicitly AI-driven but have legacy scripts for web testing or scraping. They used to maintain their own browser fleet but realized how much time that eats up. For them, Browserbase is an infrastructure-as-a-service for browsers. So we end up powering everything from e-commerce automation to data collection. The common thread is: you need to run a browser reliably and at scale without spending half your engineering cycles on it.
A great success story is Structify, which built a web research agent that navigates sites lacking an API. The AI decides where to go, how to fill out forms, what to parse on each page, and that all happens on Browserbase. Without our infrastructure, they wouldnât be able to spin up enough sessions or handle the complexity of real-world web flows at scale.
Another is Benny, which helps people on food stamps get rebates for healthy groceries. The normal process involves going to a painful web form, categorizing items, uploading receipts. Benny automates that on behalf of the user. Itâs not even using AI âitâs just a specialized automation that Browserbase makes super simple to scale. They can run thousands of concurrent sessions without building their own headless environment. Itâs a clear example of saving people real time and effort.
This is a really exciting and emerging market â how do you think about the competitive landscape, and how does Browserbase stand out?
I was a customer of older headless browser services and felt they never had the polish of a Vercel or a Stripe for browser automation. I saw the installation metrics for things like Playwright or Puppeteer going exponentialâmillions of downloads and growing fast. That signaled a real, underserved need. So we set out to build a best-in-class developer experience for headless browsers, with great logging, replay, reliability, and support for multiple frameworks.
We differentiate ourselves, frankly, by being a really great product. We handle concurrency, weâre stable, we offer detailed logs, session recordings, and we also have Stagehand specifically for AI use cases. Itâs a wide net: people can do old-school statically-coded scripts, or they can do cutting-edge LLM-driven interactions. In both cases, you get a robust environment that scales and is easier to debug.
Web Agentic AI is accelerating fast. How do things like Operator or Anthropicâs âcomputer useâ factor into Browserbaseâs plan?
In many ways, they boost demand for Browserbase. Operator and Anthropicâs approach let an LLM call external tools to do complex tasks. If those tasks involve websitesâclicking through, filling out forms, navigating multi-page flowsâthey still need a browser. Operator is basically a web agent living inside ChatGPT, and we power similar web agents. We released something called Open Operator as a reference project, to show you exactly how you can build something like Operator, but embedded in your product. Similarly, you can wire up Anthropicâs âcomputer useâ to Browserbase so an AI can step through real websites.
The net effect is that as these agent frameworks get better, they need an infrastructure layer to power actual browsing. Weâre not competing with them; weâre complementary. If Operator or Anthropic wanted to host a massive browser cluster themselves, thatâs a different conversation, but so far, we see them as catalysts driving more interest in headless automation.
As Web Agentic AI becomes more common and AI Agents increasingly need to interact with websites, how do you feel the Internet and web dev might change in the long term?
Itâs going to be a while before AI agents outnumber humans, because billions of people arenât online yet. But yes, over time, more mundane web tasks will be offloaded to AI. That means an increasing volume of automated âusers,â though I doubt websites will build entirely separate versions for bots. If the AI is sufficiently intelligent, it can work with the existing site just as a human does.
Weâll likely see better differentiation between âgoodâ and âbadâ bots. CAPTCHAs are crudeâthey canât tell if youâre an agent doing legitimate tasks or someone trying to spam or scrape without permission. A âgoodâ bot typically represents a real user, logging in with their credentials, performing tasks with consent, and contributing some value. A âbadâ bot is the one scalping tickets, scraping data for shady purposes, or spamming. Weâre working with authentication providers like Okta, Clerk, and Stytch to help authenticate beneficial bots so they can pass through without friction. In theory, that would reduce the arms race of AI vs. CAPTCHAs.
Could websites add more alt tags or specialized endpoints for AI? Possibly, but that might be incremental rather than a total revamp. Accessibility best practices already encourage labeling UI elements thoroughly, and a well-structured HTML page is easier for an agent to parse. I donât expect a wholesale rewrite of the webâthereâs too much legacy out there. Browserbase is built to handle the web as it is, so even if some sites make AI-friendly features, the rest wonât change overnight.
How will Browserbase evolve over the next 12 months?
Weâre doubling down on Stagehand, which has been TypeScript-only but is getting a Python version soon. A lot of AI devs are Python-focused, so that will be a big milestone. Weâre also rolling out a visual web interface for Stagehand, kind of a WYSIWYG for building automation scripts. That should help people prototype or debug browser flows quickly.
We plan to offer more customization for things like cost vs. performance trade-offs. Some customers want extremely cheap concurrency for massive scale; others need bulletproof reliability or specific rendering requirements. Because weâre usage-based, we only succeed if you succeed. So weâre building more knobs you can turn to dial in your exact needs.
Overall, weâll keep adding features that make cloud browser infrastructure easierâmore logging, more debugging, better concurrency. Our goal is to remove every obstacle so teams can launch new products faster without wrestling with the low-level details of running browsers.
You guys seem to be on quite a rocket shipâseed round last summer, Series A by fall. How have you managed that insanity?
Rocket ships are exciting and chaotic at the same time. From the outside, it looks amazing, but inside, it can feel like everythingâs on fire. Iâm grateful, though. Weâve gone from one to fifteen team members, and we plan to be around thirty soon. I also got married last year and started focusing on my fitness again, which is a lot to juggle.
A lot of it was mapped out in this memo I wrote called âAn Internet Browser for AI.â Things have lined up nicely, partly because the AI space is moving at breakneck speed. That can be overwhelming, but itâs also an opportunity. Weâve raised capital to scale quickly to meet demand, and if our customers flourish, our usage-based model means we grow too. We win when our customers win.
Whatâs the culture like at Browserbase? Obviously youâre hiringâwhat kind of environment are you building?
Weâre fully in person, five days a week, which is unusual post-COVID, but we love the energy. We run together on Mondays, share ideas in real time, and basically have a hacker-house vibe, but more organized. Itâs a place where engineers geek out over dev tools and infrastructure challenges.
We emphasize personal growth, so I once asked at an all-hands: âWhoâs done something they never thought they could do in the last six months?â Everyone raised their hand. Thatâs what we want. People step outside their comfort zone and learn rapidly, whether itâs mastering Kubernetes scaling or building open-source tooling. Many of us are former founders or might become founders in the future. We share that passion for building real products people love.
If youâre someone whoâs excited about dev tools and wants to solve challenging infra problems, weâre looking for you. Itâs a rocket ship, but we try to keep it fun and supportive. We want a place where the slope of your learning curve is as steep as possible.
Is there anything else you'd like people to know about Browserbase?
If youâre building AI (or any) automation that needs to interact with websites, fill out forms, or replicate user actions, you should probably check out Browserbase. Instead of reinventing the wheel with your own headless fleet, let us handle the infrastructure. That way, you can focus on building a great product rather than debugging Chromium at scale.
Feel free to email me directly at [email protected] if youâre working on something interesting. We love supporting innovatorsâespecially those using AI to do things that arenât possible with regular APIs or static scripts. Thereâs a huge opportunity in letting AI act on the web, and weâre here to power that.
AI is definitely not zero-sum, so letâs help each other succeed. If we can empower you to ship faster and tackle the hard parts of headless browsing, everybody wins.
Conclusion
Stay up to date on the latest with Browserbase, learn more about them here.
Read our past few Deep Dives below:
If you would like us to âDeep Diveâ a founder, team or product launch, please reply to this email ([email protected]) or DM us on Twitter or LinkedIn.