Browser Control

Overview

PalanK provides comprehensive browser automation through natural language commands. The AI understands context and executes complex multi-step operations automatically.

"Go to amazon.com"
"Navigate to https://github.com/PALAN-K"
"Open google.com in a new tab"
"Go back to the previous page"
"Refresh this page"

Tab Management

"Open a new tab"
"Close this tab"
"Switch to the second tab"

Interactions

Clicking Elements

"Click the login button"
"Click on the search icon"
"Click the first product in the list"
"Double-click on the image"

Typing Text

"Type 'hello world' in the search box"
"Fill the email field with [email protected]"
"Enter my username in the login form"
"Clear the input field and type 'new text'"

Form Handling

"Fill out the contact form with my information"
"Select 'United States' from the country dropdown"
"Check the 'Remember me' checkbox"
"Submit the form"

Page Analysis

Reading Content

"What's the title of this page?"
"Read the main article content"
"List all the links on this page"
"Find the price of this product"

Screenshots

"Take a screenshot"
"Capture the visible area"
"Screenshot the entire page"

Element Finding

"Find all buttons on this page"
"Locate the login form"
"Where is the search bar?"

Scrolling

"Scroll down"
"Scroll to the bottom of the page"
"Scroll up a little"
"Scroll to the comments section"

Advanced Operations

Waiting

"Wait for the page to load"
"Wait until the spinner disappears"
"Wait 3 seconds"

Conditional Actions

"If there's a cookie banner, close it"
"Click 'Load more' until all items are visible"
"If logged in, go to dashboard; otherwise login first"

Data Extraction

"Extract all product names and prices"
"Get the table data as CSV"
"Copy all email addresses on this page"

Best Practices

Be specific about which element you want to interact with. Instead of “click the button”, say “click the blue Submit button at the bottom”.

The AI sees the page as a user would. If an element is hidden or requires scrolling, mention that in your command.

Limitations

Cannot interact with browser chrome (address bar, bookmarks)
Cannot access cross-origin iframes with different security policies
Some heavily protected sites may block automation
File upload dialogs require manual interaction

Get Started

Features

Overview

Navigation

Page Navigation

Tab Management

Interactions

Clicking Elements

Typing Text

Form Handling

Page Analysis

Reading Content

Screenshots

Element Finding

Scrolling

Advanced Operations

Waiting

Conditional Actions

Data Extraction

Best Practices

Limitations

Get Started

Features

​Overview

​Navigation

​Page Navigation

​Tab Management

​Interactions

​Clicking Elements

​Typing Text

​Form Handling

​Page Analysis

​Reading Content

​Screenshots

​Element Finding

​Scrolling

​Advanced Operations

​Waiting

​Conditional Actions

​Data Extraction

​Best Practices

​Limitations

Overview

Navigation

Page Navigation

Tab Management

Interactions

Clicking Elements

Typing Text

Form Handling

Page Analysis

Reading Content

Screenshots

Element Finding

Scrolling

Advanced Operations

Waiting

Conditional Actions

Data Extraction

Best Practices

Limitations