5 Things ChatGPT Agent Can Do for You Right Now (That You're Still Doing Manually)
ChatGPT Agent lets AI browse the web on your behalf. I tested it on shopping, booking, forms, research, and scheduling. Here's what worked and what didn't.
Key Takeaways
- ChatGPT Agent (formerly Operator) lets ChatGPT browse the web, click buttons, fill forms, and complete tasks on your behalf — all inside its own browser.
- It's powered by Computer-Using Agent (CUA), which combines GPT-4o's vision with reinforcement learning to navigate real websites with an 87% success rate.
- Available to Plus ($20/month), Pro ($200/month), and Team subscribers through the tools dropdown.
- I used it for five everyday tasks — shopping, booking, form-filling, research, and scheduling — and it genuinely surprised me on three of them.
- It's not perfect. CAPTCHAs, complex payment flows, and sites with aggressive bot detection still trip it up.
Table of Contents
- What ChatGPT Agent Actually Is (and Isn't)
- How to Turn It On
- 5 Tasks It Handles Better Than You'd Expect
- Where It Still Struggles
- How It Compares to Claude and Google
- Privacy and Safety: What You Should Know
- Frequently Asked Questions
What ChatGPT Agent Actually Is (and Isn't)
Here's a confession: I didn't take ChatGPT Operator seriously when it first launched. An AI that browses the web for you? Sounded like a gimmick. Then I asked it to find me the cheapest roundtrip flight from SFO to Tokyo in April, and it came back with an option $180 cheaper than what I'd found after 20 minutes on Google Flights. That got my attention.
This article is part of our ChatGPT beginner's guide. Start there for a complete overview.
ChatGPT Agent — the current evolution of what OpenAI originally called Operator — is an agentic mode built directly into ChatGPT. When you activate it, ChatGPT gets its own browser. Not a simulated one. A real browser that loads real web pages, sees them the way you see them, and interacts with them: clicking buttons, scrolling pages, typing into search boxes, filling out forms.
The technology behind it is called Computer-Using Agent (CUA). It combines GPT-4o's vision capabilities with reinforcement learning specifically trained to interact with graphical interfaces. CUA identifies buttons, sliders, text fields, and checkout forms by their visual context — not by reading HTML source code. This means it works on almost any website, including JavaScript-heavy single-page apps that would break traditional web scrapers.
What it isn't: it's not autonomous. It asks for your confirmation before completing purchases, submitting forms with personal data, or making any irreversible action. Think of it as a very fast research assistant who does the legwork and checks in before pulling the trigger.
How to Turn It On
If you're on a paid ChatGPT plan, you might not even realize you have access. Here's how to find it:
- Open ChatGPT (web or desktop app).
- Click the tools dropdown in the composer area (bottom of the chat window).
- Select "Agent mode" from the list.
- Type your request as you normally would.
That's it. No separate app, no Operator website (that's been folded into the main ChatGPT interface). The old Operator standalone site is no longer accessible — everything lives inside ChatGPT now.
One thing I wish someone had told me sooner: be specific in your requests. "Find me a good hotel in Paris" gives mediocre results. "Find a hotel in the 6th arrondissement of Paris for March 20-24, under €200/night, with at least 4.2 stars on Google, that includes breakfast" gives you exactly what you need on the first try.
5 Tasks It Handles Better Than You'd Expect
1. Price Comparison Shopping
I asked ChatGPT Agent to find the best price for a specific pair of running shoes (Nike Pegasus 41, size 10, black). It visited six retailer websites, compared prices including shipping, checked for active coupon codes, and came back with a ranked list. Total time: about 90 seconds.
The winning price was $34 less than the first result on Google Shopping. The agent found an active 15% off code on one retailer's site that I never would have discovered scrolling through their homepage.
My favorite trick with shopping: ask it to "find this product and also check if any retailers have price-match guarantees." It'll actually read the policy pages.
2. Restaurant Reservations
This one genuinely surprised me. I said: "Book a table for two at an Italian restaurant in downtown Portland, Saturday at 7pm, outdoor seating preferred." The agent searched OpenTable, found four options with availability, showed me ratings and reviews for each, and asked which one I wanted. When I picked one, it completed the reservation in about 20 seconds.
The old way: open OpenTable, set filters, scroll through results, read reviews on a separate tab, go back, select a time, fill out the form. Seven minutes minimum. The agent did it in under two.
3. Government and Utility Forms
Here's where I was genuinely impressed. I needed to update my address with my utility company — the kind of task that involves logging into a clunky portal, navigating three menu levels, finding the right form, and filling in fields that are never where you'd expect them.
The agent navigated the portal, found the address change form, filled in my new details (which I'd provided in the chat), and stopped right before the submit button to confirm. The whole interaction took about 45 seconds of actual agent work. The website's interface was objectively terrible, with nested iframes and dropdown menus that required hovering. CUA handled it anyway.
4. Competitive Research
I asked it to "visit these three competitors' websites and summarize their pricing tiers, free trial policies, and main feature differences." It opened each site, navigated to pricing pages, extracted the data, and compiled a clean comparison — including features buried in FAQ pages that I would have missed.
This is the task I now use ChatGPT Agent for almost daily. It's not just faster; it's more thorough. The agent reads fine print that I'd normally skip. Last week it caught a competitor's "unlimited" plan that actually had a 10,000-request monthly cap mentioned only in their Terms of Service.
5. Appointment Scheduling Across Multiple Sites
The task: find available appointment slots at three different medical specialists, compare availability, and hold the best option. The agent visited each provider's booking portal, navigated their scheduling systems (each completely different in design), and came back with a side-by-side availability comparison.
This would have taken me 30+ minutes of tab-switching and form-navigating. The agent did it in under 4 minutes. The trick nobody mentions: you can tell the agent to "prioritize morning appointments" or "avoid Mondays" and it filters intelligently, not just by matching text but by understanding the calendar layout on screen.
Where It Still Struggles
ChatGPT Agent isn't magic. Here are the failure modes I've encountered.
CAPTCHA and Bot Detection
Some websites actively detect and block automated browsing. When the agent hits a CAPTCHA, it pauses and asks you to solve it. Cloudflare's bot protection is particularly effective at blocking CUA. About 1 in 5 websites in my testing triggered some form of bot detection.
Complex Payment Flows
The agent can fill in credit card forms, but it gets confused by 3D Secure verification, bank redirects, and multi-step checkout processes. OpenAI intentionally limits payment capabilities for safety reasons — the agent always asks for explicit confirmation before submitting payment information.
Sites That Require Login
The agent can log into sites if you provide credentials, but it can't access accounts you're already logged into in your personal browser. Its browser session is isolated. This means you'll need to re-authenticate for every service, which is a security feature but an annoyance for frequent use.
Real-Time Inventory
For tasks that depend on real-time availability — like grabbing concert tickets or flash sales — the agent is too slow. The screenshot-analyze-act loop takes 2-3 seconds per step, and in situations where milliseconds matter, a human clicking "Buy Now" will always win.
How It Compares to Claude and Google
Every major AI company is building browser agents. Here's where things stand.
| Feature | ChatGPT Agent | Claude Cowork | Google Jarvis |
|---|---|---|---|
| Browser Control | Full (own browser) | Full (desktop + browser) | Chrome only |
| Success Rate | 87% (web tasks) | ~75% (mixed tasks) | Not published |
| Payment Support | With confirmation | Limited | Google Pay only |
| File Management | No | Yes (local files) | Drive only |
| Availability | Plus/Pro/Team | Pro/Max/Team/Enterprise | Preview |
ChatGPT Agent is the most polished option for pure web browsing tasks. Claude's Cowork is stronger for workflows that combine web browsing with local file management. Google's Jarvis has the deepest Chrome integration but remains in limited preview.
My recommendation: if you're already paying for ChatGPT Plus, start with Agent mode. It's included in your subscription and covers 80% of common browser-based tasks. You don't need another tool for everyday web automation.
Privacy and Safety: What You Should Know
Letting an AI browse the web on your behalf raises legitimate privacy questions. Here's what you should know about ChatGPT's data handling.
The agent's browser session is sandboxed — it doesn't have access to your cookies, saved passwords, or browsing history from your personal browser. Every session starts fresh. This is good for privacy but means you'll need to provide login credentials explicitly when needed.
OpenAI states that browsing data from agent sessions is not used for model training on paid plans. Screenshots captured during agent operation are processed in real-time and discarded after the session ends.
The confirmation step before irreversible actions (purchases, form submissions, account changes) is a genuine safety feature, not theater. I tested this extensively — the agent consistently pauses at the right moments, even on websites it hasn't seen before.
Real AI Responses (Tested March 2026)
Frequently Asked Questions
Do I need a separate subscription for ChatGPT Agent?
No. Agent mode is included with ChatGPT Plus ($20/month), Pro ($200/month), and Team plans. You activate it through the tools dropdown in the composer area — no additional setup required.
Can ChatGPT Agent make purchases on my behalf?
It can fill in payment forms and navigate checkout flows, but it always pauses to ask for your confirmation before submitting any payment. You remain in control of the final "buy" click. OpenAI has built this as a hard safety constraint that can't be overridden.
What websites work best with ChatGPT Agent?
Major e-commerce sites (Amazon, Best Buy), travel platforms (Booking.com, OpenTable), and standard business websites work well. Sites with heavy bot protection, custom JavaScript frameworks, or CAPTCHAs may cause issues. Government portals are hit-or-miss — some work perfectly, others block automated access.
How is ChatGPT Agent different from browser extensions like Perplexity?
Perplexity and similar tools search and summarize web content. ChatGPT Agent goes further — it can interact with websites by clicking buttons, filling forms, and completing transactions. It's the difference between reading about a restaurant and actually making a reservation.
Is ChatGPT Agent available on mobile?
Currently, Agent mode is available on the ChatGPT web interface and desktop apps. Mobile support is rolling out gradually, with iOS expected first. The experience is optimized for larger screens where you can see the agent's browser activity alongside the chat.
Give It a Try — Start Small
If there's one thing I've learned from using ChatGPT Agent daily for the past month, it's this: start with the boring stuff. The tasks you dread — price comparisons, form filling, appointment scheduling — are exactly where it shines. Don't start by asking it to do something complex and novel. Start with the task you've been putting off because it's tedious.
For me, that was insurance quote comparison. I'd been meaning to compare car insurance rates for three months. I kept not doing it because it meant visiting five websites and filling out the same 20-field form five times. ChatGPT Agent did all five in under 8 minutes. The cheapest option saved me $420 per year.
That's the real value here. Not that the AI is brilliant — it's that it handles the kind of work that makes you feel like a robot, so you don't have to be one. Give it a try with one task this week. You might be surprised at how much time you get back.