๐ŸŒ

Browser Agents

AI agents that navigate and interact with websites autonomously

13 Tools in Browser Agents

AGI Inc (MultiOn)

autonomous browser agents

Applied AI lab building AGI-0, a personalized proactive co-worker that gets things done on your smartphone. Redefining human-AI interaction with autonomous task completion.

freemium Autonomous web agentMobile integration

Axiom.ai

no-code browser automation

No-code browser automation and web scraping platform. Save time with browser bots that automate website actions, data entry, and repetitive tasks with ChatGPT integration.

freemium No-code automationChatGPT integration

Browse AI

web scraping

#1 AI web scraper and monitoring platform. Point and click to extract, scrape, and monitor data from any website with no coding required and 7,000+ integrations.

freemium No-code web scrapingChange monitoring

Browser Use

browser agent framework

Open-source Python library that enables LLMs to interact with websites through browser automation. Lets AI agents navigate, click, type, and extract information from web pages autonomously.

open-source LLM browser controlMulti-tab support

Browserbase

cloud browser infrastructure

Web browser infrastructure for AI agents and applications. Scalable, secure browser instances compatible with Playwright, Puppeteer, and Selenium with SOC-2 compliance.

freemium Serverless browsersStealth/anti-detection

Browserbase

browser infrastructure

Serverless browser infrastructure for AI agents that provides scalable, headless browsers in the cloud. Supports Playwright, Puppeteer, and Selenium with built-in captcha solving and proxy management.

freemium Serverless browsersStealth mode

LaVague

ai web agent framework

Open-source framework for building and deploying AI web agents. Uses Selenium driver with natural language commands to automate web interactions programmatically.

open-source Open-source frameworkNatural language automation

LaVague

web agent framework

Open-source Large Action Model framework that automates web browsing using natural language instructions. Converts high-level goals into browser automation code using vision and action models.

open-source Natural language automationVision-based actions

Playwright MCP

browser automation MCP

Model Context Protocol server that enables AI agents to control browsers via Playwright. Provides standardized tools for web navigation, form filling, and data extraction for LLM applications.

open-source MCP integrationPlaywright automation

Playwright

browser automation framework

Microsoft's end-to-end testing framework for modern web apps. Cross-browser, cross-platform automation with one API supporting TypeScript, JavaScript, Python, .NET, and Java.

open-source Cross-browser supportAuto-wait/retry

Skyvern

ai browser automation

AI browser automation platform using computer vision to adapt to any webpage. Execute complex tasks with natural language commands, run thousands simultaneously via API.

freemium Computer vision automationNatural language commands

Skyvern

browser automation

AI-powered browser automation platform that uses vision and LLMs to automate manual browser workflows. Handles dynamic websites without brittle selectors by understanding page content visually.

freemium Vision-based automationNo-selector approach

Stagehand

web agent framework

Open-source TypeScript framework from Browserbase for building robust web agents. Provides high-level abstractions for browser automation with AI-powered element selection and action planning.

open-source AI element selectionAction planning