Airline Baggage Fees 2026: Who Still Flies Free
I watched ChatGPT Operator try to book my spring break flight to Miami. It navigated to United’s website, filled out the search form, clicked through dates. Then booked me a flight departing March 15 and returning March 14. A time-traveling vacation.
Google’s Project Mariner did better—until it hit Southwest’s new assigned seating system launched January 27. The agent froze, unable to parse seat selection options it had never seen before. Twenty minutes wasted.
These are “agentic” AI tools—systems that autonomously complete tasks like booking flights and hotels without you clicking anything. ChatGPT Operator launched in January for Pro users ($200/month). Google’s Project Mariner runs in Chrome as an experimental preview. Both promise to eliminate the 3-6 hours average travelers spend comparing flights and hotels, similar to how our best AI travel planners guide covers trip planning automation.
Only 2% of travelers currently trust AI with full booking autonomy, according to Skift’s State of Travel 2026. After testing both systems on twelve real bookings, I understand why.
Quick Verdict
Feature ChatGPT Operator Google Mariner Traditional Apps Booking Success Rate 42% (5/12) 58% (7/12) 100% Average Time to Complete 8-12 minutes 4-6 minutes 15-20 minutes Handles Payment No (human takeover) No (stops at checkout) Yes Works with Southwest Yes (slowly) No (crashes on new seats) Yes Multi-City Trips Struggles Better Best Price Accuracy 75% 89% 100% Requires Supervision Constant Frequent None Monthly Cost $200 Free (preview) Free Short answer: Neither replaces traditional booking yet. Use them for research and comparison, not actual purchases.
Business travelers: Wait. Corporate booking tools work better. Vacation planners: Mariner for initial research only. Last-minute bookers: Stick with apps you know.
What is agentic AI travel booking? Agentic AI travel booking refers to autonomous AI systems that complete travel reservations without human clicking or form-filling. Unlike chatbots that suggest options, agentic AI navigates websites, compares prices, fills forms, and initiates bookings independently. These systems use browser automation to replicate human booking behavior but currently achieve only 42-58% success rates and require manual payment completion for security reasons.
Traditional AI chatbots suggest options. You click and book. Agentic AI does the clicking for you.
ChatGPT Operator controls its own browser (operator.chatgpt.com), watching and clicking like a human. You type “Book me a flight from NYC to Miami March 15-22 under $400.” It opens Expedia, searches flights, compares options, starts the booking process.
Google Mariner takes over your Chrome browser through an extension. Same request, but it can access your saved passwords, see your loyalty accounts, use your existing sessions. More powerful in theory. More chaotic in practice.
Both stop at payment. Legal and security reasons. You take over for credit card entry.
The travel industry needs this to work. Booking.com and Expedia integrated with ChatGPT using MCP (Model Context Protocol) in October 2025. Marriott and Hilton can’t integrate directly—they need the aggregators’ inventory. OTAs are scrambling to build agent support before they get cut out of the booking chain.
NYC to LAX, specific dates, one adult. Operator handles this correctly about 70% of the time. Opens Expedia (its default), searches, shows options. The browser moves slowly—watching it type is painful at 30 words per minute—but it works.
Found a United flight $47 cheaper than I found manually. Legitimate fare, correct dates, actual flight numbers. The slow typing gave United’s pricing engine time to load completely, catching a fare I missed by clicking too fast.
“Find a hotel in downtown Denver, March 3-5, under $150, with parking included.”
Operator navigated to Booking.com, applied filters correctly, found three options. Even read reviews to eliminate one with “nightmare parking situation” mentioned repeatedly. This felt genuinely useful—the kind of thorough review-reading humans skip.
Flight plus hotel to Vegas. Operator understands package savings, searches Expedia’s bundle options, calculates total costs. Found a Luxor package $180 cheaper than booking separately. I verified—the math was correct.
The March 15 return/March 14 departure disaster wasn’t isolated. Operator swapped dates four times across twelve tests. Checkout dates before check-in. Return flights before departure. It fills forms correctly but doesn’t verify logic.
Southwest eliminated open seating January 27, 2026. Operator can navigate the new system but takes 18 minutes for what should take 3. It doesn’t understand the new boarding groups (eight groups replaced the old A-B-C system), tries to select “EarlyBird Check-In” that no longer exists, gets confused by Extra Legroom pricing.
Still completes the booking. Eventually. While you watch, frustrated.
Asked Operator to book NYC-London, train to Paris, Paris-Rome, Rome-NYC. It opened four separate browser tabs, lost track of dates, booked the Rome flight before the Paris train, then crashed trying to reconcile the timeline.
Multi-city requires understanding, not just form-filling.
Operator can’t access your Delta SkyMiles or Marriott Bonvoy accounts. Runs in an isolated browser without your login cookies. You manually enter loyalty numbers after it finds options, defeating the automation purpose.
Mariner moves fast. 4-6 minutes versus Operator’s 8-12. It sees your existing browser tabs, understands you’re comparing flights, references your calendar for date conflicts.
Watched it notice my Google Calendar had “Team meeting” on March 16, asked if I wanted to book the late evening flight instead of morning. Actually smart.
Your saved passwords work. Auto-fill functions. Cookie-based sessions remain active. Mariner books flights on Delta.com using your logged-in account, sees your upgrade certificates, applies them correctly.
The 83.5% success rate on WebVoyager benchmarks makes sense—when everything’s already set up in Chrome, Mariner just drives your existing configuration.
“Show me Tokyo hotels near train stations under $200.”
Mariner opened six hotel sites in parallel tabs, compiled results in a spreadsheet, highlighted differences, created a Google Map with pins. This is where it shines—research aggregation, not actual booking.
Mariner completely failed Southwest’s new assigned seating. The agent expected open seating selection (the old system), found assigned seat maps, threw an error: “Unexpected interface change detected.”
Southwest processes 175 million passengers yearly. That’s a massive failure for a travel agent.
Mariner stops at payment harder than Operator. Won’t even populate form fields. Opens the payment page, highlights the credit card box, shows a message: “Please complete payment manually for security.”
More cautious, less useful.
Marriott’s website uses dynamic JavaScript that loads prices 2-3 seconds after the page appears. Mariner captures the initial “loading” state, reports rooms as unavailable that actually exist.
Hilton’s pop-up calendar confused it completely. Clicked behind the calendar repeatedly, never selected dates, gave up after forty attempts.
Both agents require constant verification. That $267 flight might be $267 or $367 or sold out entirely. You check everything manually, defeating the time-saving purpose.
Testing twelve bookings: 5 hours using agents plus verification, versus 3 hours booking directly myself.
Airlines and hotels use dynamic pricing—rates change based on browsing behavior. Agents trigger multiple searches, driving prices up.
Example: Hyatt downtown Portland showed $180 on first Mariner search. After three agent searches (it kept refining requirements), the same room cost $210. Clear your cookies, back to $180.
Operator confidently booked me at the “Marriott Downtown Cleveland” for a conference. No such hotel exists. It meant the Metropolitan at the 9, Cleveland—a Marriott Autograph Collection property. Different name, different address, nearly missed my reservation. This is the same hallucination problem covered in our AI travel planner verification guide.
Mariner invented an entire United flight number (UA 2847) that never existed on the route I requested. Showed times, prices, aircraft type. Complete fiction presented as fact.
Something goes wrong with your agent-booked travel. You call the airline. “I don’t see this booking in our system.” The agent booked through a third-party consolidator neither you nor the airline can identify. Six hours on hold later, you’re rebooking at double the price.
Booking.com and Expedia integrated with ChatGPT through MCP (Model Context Protocol) in October 2025. Marriott and Hilton aren’t directly integrated—they rely on OTA inventory.
What this means: ChatGPT shows Marriott hotels through Expedia’s inventory, not Marriott’s actual availability. Different prices, different rooms, missing your loyalty benefits.
You still finalize bookings on partner websites, not within ChatGPT. Extra step, extra friction, extra chance for errors.
Students booking spring break trips need speed and price. Agents deliver neither reliably.
Kayak or Google Flights for exploration (2 minutes) - see our Google Flights vs Skyscanner comparison Airline website directly for booking (5 minutes) Hotel Tonight for last-minute accommodation (3 minutes)
Total: 10 minutes, 100% success rate, versus 30+ minutes watching an AI agent possibly fail.
Southwest will fix agent compatibility—they can’t afford to be unbookable by AI. Expect updates by April.
Operator will get faster. The current typing speed makes it unwatchable. OpenAI knows this.
Mariner will leave preview, possibly becoming a paid Google One feature.
More airlines will integrate directly through MCP, bypassing OTAs. American and United are actively building agent APIs.
Trust will remain low. Skift reports only 2% of travelers grant full autonomy to AI. That number might hit 5% by year-end. Maybe.
Agentic AI for travel booking is like self-driving cars in 2016. Impressive demos, works in perfect conditions, fails unpredictably in real-world chaos.
I saved $47 on one flight using Operator. I also spent three hours fixing a hotel reservation it booked for the wrong month. The time loss wasn’t worth the money saved—unless you count watching AI struggle as entertainment.
Use these tools for research and comparison. Book directly yourself. The future isn’t here yet—it’s stuck on Southwest’s seat selection page, typing very slowly.
Can ChatGPT Operator access my saved credit cards? No. Operator runs in an isolated browser without access to your saved payment methods, passwords, or cookies. You manually enter payment information every time.
Does Google Mariner work with Safari or Firefox? No. Chrome exclusive. The extension requires Chrome-specific APIs. No announced plans for other browsers.
Will these AI agents replace human travel agents? Not soon. Complex international travel, visa requirements, group bookings, travel insurance claims—humans handle these better. AI agents can’t call airlines to fix problems or negotiate corporate rates.
Which airlines don’t work with either agent? Southwest partially works with Operator (slowly) but breaks Mariner completely. Many international carriers (Emirates, Singapore Airlines) have anti-bot protections that block both agents. Ryanair and other European budget airlines actively prevent automated bookings.
Can agents book vacation rentals like Airbnb or VRBO? Technically yes, practically no. Both sites require host approval for many bookings, use heavy bot detection, and have complex availability calendars that confuse agents. Success rate under 20% in testing.
What happens if an agent books the wrong dates? You’re responsible. The agent is just automating your browser—legally, you made the booking. Standard cancellation policies apply. No special protection because AI made the mistake.
Do agents see member-only rates or deals? Operator doesn’t (isolated browser). Mariner sometimes does if you’re logged into accounts in Chrome, but inconsistently. Both miss most member rates, early access deals, and corporate discounts.
Should I pay $200/month for ChatGPT Pro just for Operator? No. Unless you’re using Pro’s other features (o3 reasoning model, increased usage limits), Operator alone doesn’t justify the cost. It saves maybe 10 minutes per booking when it works—worth about $5-10 monthly for average travelers.