Back to blog
AI agentsagent protocolsMCPllms.txt

AI agent protocol adoption: we analyzed 8 agents in 2026

Research across 8 agents found 0 confirmed cases of live web agents using site-side protocols, while browser automation and semantic HTML led adoption in 2026.

By · · 20 min read
Research dashboard comparing live browser stacks, MCP integrations, llms.txt publication, and site-side protocol adoption.
Research dashboard comparing live browser stacks, MCP integrations, llms.txt publication, and site-side protocol adoption.

AI agent protocol adoption has a strange gap in 2026. The standards are arriving faster than the runtime behavior. MCP is real. A2A is specified. llms.txt is being published by serious agent projects. Web Bot Auth is moving into production. Still, when agents browse the live web, the public evidence points somewhere more ordinary: screenshots, accessibility trees, DOM snapshots, CDP, Playwright, and manually configured tool servers.

For platform teams, "agent ready" now means two different things. One version is about manifests, cards, and machine-readable docs. The other is about whether today's agents can get through the product without getting lost. The second version is less tidy, but it is where the evidence keeps pointing.

This research extends our guides to agent skills discovery, A2A vs MCP architecture, AI agent API design, signed agent access, and llms.txt vs robots.txt.

Key Takeaways

  • CanAgentUse Research reviewed 8 agent systems in June 2026 and found 0 confirmed cases of live general-purpose web agents consuming target-site manifests during ordinary browsing.
  • In a preliminary 2,000-page form test with Hermes and OpenClaw, agents performed better on semantic, accessible pages and struggled on JavaScript-only controls.
  • Agents usually tried browser automation first, even when OpenAPI existed. They switched to OpenAPI after failure when the spec was visible in page context.

Methodology

In June 2026, CanAgentUse Research reviewed 8 agent systems across official docs, public repos, issue threads, and credible third-party technical analysis. We classified evidence as Use, Pub, Manual, Discussed, Closed, or None. A claim only counted as Use if public evidence showed runtime consumption, not just support language or a marketing mention.

The reviewed sample included OpenAI ChatGPT agent and Operator, OpenAI Computer Use API, Anthropic Claude Computer Use, Perplexity Comet, Browser Use, OpenClaw, Hermes Agent, and OpenAI Agents SDK. We removed browser infrastructure projects and MCP servers from the agent sample. This is not the whole market. It is a grounded slice of the agents site owners are most likely to encounter or benchmark against.

Original Data

Our main measurement was intentionally strict: does a real general-purpose agent automatically discover and consume a target website's llms.txt, llms-full.txt, auth.md, WebMCP endpoint, A2A Agent Card, /.well-known/agent.json, UCP file, ai.txt, sitemap.xml, or robots.txt during normal live website use? In this 8-agent sample, the answer was 0 confirmed cases.

ParameterValue
Sample size8 agent systems
Review windowJune 2026
Evidence typesOfficial docs, public repositories, issues, specs, and technical analyses
Positive thresholdRuntime consumption during operation
Negative thresholdNo public evidence found in reviewed sources
Main limitationClosed products may have private behavior not visible in public docs

We separated tool interoperability from website discovery. MCP, configured OpenAPI, and remote tool servers can be good integration paths, but a developer or operator usually wires them up. Website discovery is a different claim: the agent lands on a normal site and discovers a capability from the site itself.

CanAgentUse Research reviewed 8 agent systems in June 2026 and found 0 confirmed public cases of general-purpose live web agents consuming target-site llms.txt, A2A Agent Cards, WebMCP, UCP, ai.txt, robots.txt, or sitemap.xml as runtime instructions during ordinary browsing.

What did agents actually use in 2026?

In the 8-agent sample, every system with enough public technical detail pointed toward browser control, screenshot-based interaction, accessibility snapshots, DOM-derived mappings, CDP, browser automation, or a configured tool path. The short answer is that agents use the web the way automation tools use the web: they inspect what the browser exposes, then act through a controlled browser session.

For OpenAI's Computer Use API, the official guide describes a model that inspects screenshots and returns UI actions. It also mentions optional harnesses such as Playwright for DOM-based workflows, which makes DOM access a harness choice rather than a discovered website standard. Anthropic's Computer Use docs follow a similar screenshot and action pattern, with domain controls and operational safety guidance.

Open agent systems are often more explicit about their browser stack. Browser Use, OpenClaw, and Hermes all describe some combination of accessibility trees, refs, DOM maps, CDP, browser sessions, or screenshots. Perplexity Comet is less transparent officially, but public third-party technical analysis points toward DOM, accessibility-tree, and Chrome-debugger style control rather than target-site protocol discovery.

This is the unglamorous part, but it is where teams get the fastest return. If a checkout button has no accessible name, a static manifesto will not rescue the flow. If a dashboard hides important state in a canvas with no semantic fallback, the agent may have to guess from pixels. A clean accessibility tree is now part of agent infrastructure.

Runtime substratePublic adoption signal in the reviewed sampleWhy it matters
Accessibility trees and ARIA snapshotsHighAgents can extract labeled controls and structured page state
DOM-derived maps and refsHighAgents can connect observations to reliable actions
Screenshots and visual inspectionHighClosed computer-use products and some open stacks use pixels
CDP and Playwright controlHighAgents need a way to click, type, navigate, and observe
Site-side manifests during live browsing0 confirmed casesPublication did not equal runtime use in this sample

Unique Insight

The adoption story is almost backwards from the protocol conversation. The most used "agent protocol" for websites is not a new file format. It is the accessibility tree plus browser automation. That is a little annoying, because it means old web quality problems are still the agent-readiness problem.

For practical implementation guidance, see our AI agent API design guide and agent readiness guide.

In the CanAgentUse 2026 review, browser perception and control had the broadest visible adoption. Across 8 reviewed agent systems, public evidence pointed to screenshots, accessibility snapshots, DOM mappings, CDP, browser automation, or configured tools, while target-site protocol consumption had 0 confirmed runtime cases.

Is MCP adopted by real agents?

In June 2026, MCP showed clear adoption across several reviewed systems, but mostly as a manually configured integration layer. The Model Context Protocol tools specification defines how servers expose tools for models to list and call. That is real infrastructure, but it is not the same thing as a browser agent discovering an arbitrary website's hidden capability endpoint.

OpenAI's Agents SDK documents configured MCP servers. Anthropic documents MCP connectors and Claude Code integrations. Browser Use ships MCP server support. OpenClaw can act as an MCP server and manage saved MCP servers. Hermes has extensive MCP docs. Those signals matter, but they belong to configured tool paths rather than automatic site-side discovery.

The question is who configures the connection. In the public evidence we reviewed, MCP usually enters through a developer, operator, app setting, local server, remote server URL, or explicit connector. That makes it useful for product integrations and internal tooling. It does not yet prove broad website-side discovery.

That should shape the roadmap. If you want agents to operate your product reliably, an MCP server can be worth building. It can expose stable tools like search_orders, get_inventory, create_ticket, or export_report. If the plan depends on agents discovering that server automatically from a marketing site, the public evidence is not there yet.

Adoption labelWhat it meansMCP status in this review
UseThe agent consumes it at runtimeConfirmed for configured tool/server contexts
ManualA user or developer configures itCommon in public evidence
Website discoveryA target site advertises it and the browser agent consumes itNot confirmed in the reviewed sample

The architecture rule is simple: use MCP for dependable tool paths, not as a replacement for a usable web UI. For deeper planning, our A2A vs MCP guide, MCP server SEO guide, and OpenAPI to MCP guide cover where tool surfaces fit.

MCP adoption is visible in 2026, but the CanAgentUse review found it mainly in manually configured tool and server integrations. OpenAI, Anthropic, Browser Use, OpenClaw, and Hermes expose or document MCP paths, while automatic target-site MCP discovery was not publicly confirmed.

Do llms.txt and llms-full.txt get consumed by browser agents?

In the 8-agent sample, 3 projects were confirmed publishers of llms.txt or llms-full.txt: Browser Use, OpenClaw, and Hermes. Publication matters, especially for documentation and coding assistants. Still, the review found 0 confirmed cases where a general-purpose browser agent consumed a target site's llms.txt file during ordinary live browsing.

Browser Use publishes llms.txt and llms-full.txt, and its llms.txt links to its own OpenAPI spec. OpenClaw publishes both files and links its sitemap and robots policy. Hermes publishes both files and documents them as machine-readable entry points. Those are real publishing records.

Hermes gave the clearest caution. A public issue stated that the project's published llms.txt and llms-full.txt files were "not wired into the agent itself." That is the gap in one sentence. A project can believe in machine-readable docs, publish them, and still leave them outside the runtime agent loop.

This does not make llms.txt useless. It means the use case is narrower than some people imply. For docs-heavy products, SDKs, API references, pricing pages, integration guides, and support content, llms.txt can help retrieval systems and coding agents find the right material. It should sit beside sitemap, docs navigation, OpenAPI, and clean markdown content. It should not become your only agent-readiness bet.

Projectllms.txt or llms-full.txt statusRuntime consumption evidence
Browser UsePublishes bothNot confirmed as target-site browser input
OpenClawPublishes bothNot confirmed as target-site browser input
Hermes AgentPublishes bothPublic issue says files were not wired into the agent itself

For implementation details, start with llms.txt vs robots.txt, then connect it to agent skills discovery if your product exposes reusable actions or instructions.

CanAgentUse Research found 3 confirmed llms.txt or llms-full.txt publishers in the reviewed sample, but 0 confirmed browser-agent consumption cases. Browser Use, OpenClaw, and Hermes publish files, while Hermes publicly noted that its files were not wired into the agent runtime.

What did our form testing show?

In a preliminary CanAgentUse Research test across 2,000 public and purpose-built web pages, Hermes and OpenClaw behaved like browser operators first. The tested agents were asked to complete different form flows across semantic pages, accessible pages, JavaScript-heavy pages, pages with OpenAPI references, and pages with weak success or failure signals.

HTML quality was the clearest pattern. Agents struggled most when inputs were fully JavaScript-driven and did not expose useful semantic HTML tags. They performed better when pages used normal form elements, labels, accessible names, clear button text, and visible submission states. This matches the desk research: the accessibility tree and browser state are not side details. They are often the working interface.

Original Data

The OpenAPI behavior surprised us more. Agents preferred browser automation even when an OpenAPI document was available. They usually tried to fill and submit the form first. When that attempt failed, they almost always switched to openapi.json quickly if the spec was visible through a meta tag or mentioned in the body near the form. We did not see the same behavior for standard OpenAPI discovery paths.

Agents can use API contracts when the page puts the contract into the task context. They did not reliably go hunting through standard discovery locations before interacting with the page. "We have OpenAPI" is weaker than "this exact workflow tells the agent where the relevant OpenAPI contract is."

Two failure modes kept showing up outside the protocol conversation. Agents still struggled with website firewalls and captchas. They also struggled to know whether they had succeeded when the submission page gave weak signals. If success and failure states looked similar, or if validation errors were subtle, agents got confused and sometimes repeated the wrong action.

Preliminary findingWhat we sawPractical implication
JavaScript-only inputs caused the most troubleAgents had weak field understanding and brittle interactionsUse semantic form controls and accessible labels
Accessible pages performed betterLabels, button text, and states gave agents a usable mapTreat accessibility as agent infrastructure
Browser automation came firstAgents filled forms before using available API docsKeep browser flows reliable even when APIs exist
Visible OpenAPI helped after failureAgents switched when openapi.json was in page contextLink the relevant spec near the workflow
Captchas and firewalls blocked progressAgents struggled to pass or reason through bot defensesProvide verified agent paths for allowed automation
Weak result states caused confusionAgents could not tell success from failure reliablyMake submission outcomes explicit in text and DOM

These findings are preliminary. We are preparing a full research paper with the page taxonomy, prompts, task setup, scoring rubric, success criteria, and failure examples. For this article, the takeaway is already clear: agent readiness is not just protocol publication. It is whether a live agent can understand the page, attempt the task, recover from failure, and know when the job is done.

In a preliminary 2,000-page CanAgentUse Research test, Hermes and OpenClaw preferred browser automation first, performed better on semantic accessible forms, struggled with JavaScript-only controls, captchas, and weak result states, and switched to OpenAPI mainly after browser failure when the spec appeared in page context.

Are A2A, Agent Cards, WebMCP, UCP, and ai.txt being used on live sites?

In the reviewed materials, A2A, Agent Cards, WebMCP, UCP, /.well-known/agent.json, auth.md, navigator.modelContext, and ai.txt had 0 confirmed cases of live general-purpose browser agents consuming them from target websites. That does not make these ideas dead. It means the public runtime evidence has not caught up with the protocol discussion.

A2A is worth watching because it has a clear job: agent-to-agent delegation. The A2A specification defines Agent Cards, task state, discovery, streaming, and agent collaboration. That is different from MCP, which is better understood as a tool-calling surface. Our finding is not that A2A lacks value. Our finding is narrower: we did not find public evidence that the reviewed live browser agents consume target-site Agent Cards during normal browsing.

UCP is also worth tracking for commerce, especially because it discusses multiple transports and commerce semantics. But for ordinary shopping tasks in the reviewed agent materials, the evidence pointed to browser operation, user approvals, screenshots, forms, product pages, and sometimes APIs. We did not find public evidence that general-purpose browser agents were UCP-native buyers.

WebMCP and browser-native context ideas face a tougher proof problem. A site-side handshake sounds attractive, but the agent has to trust it, authenticate it, fit it into the browser runtime, and defend against prompt injection. Until there is native browser support or a widely adopted discovery handshake, these ideas will likely stay behind browser automation and manual integrations.

Protocol or filePublic status in this reviewBest current interpretation
A2A Agent CardsSpecified, not confirmed as target-site browser inputUseful for agent delegation planning
WebMCPDiscussed, not confirmed in the reviewed runtime pathsWatch, but do not depend on it yet
UCPEmerging commerce protocol, no confirmed general-agent browsing useTrack for commerce APIs and agentic checkout
/.well-known/agent.jsonNo confirmed consumption in reviewed systemsCandidate future discovery surface
ai.txtNo confirmed consumption in reviewed systemsToo early to treat as runtime-critical

CanAgentUse should be strict here. A site can get credit for publishing emerging surfaces, but the score should distinguish "published" from "currently consumed by known agents." Otherwise, readiness reports will reward speculation as if it were adoption.

In the CanAgentUse 2026 review, A2A Agent Cards, WebMCP, UCP, /.well-known/agent.json, auth.md, navigator.modelContext, and ai.txt had 0 confirmed cases of target-site runtime consumption by the 8 reviewed agent systems.

Why is protocol adoption slower than protocol publication?

In the CanAgentUse review, 3 projects published llms.txt files, several systems used MCP manually, and 0 confirmed cases showed target-site protocol consumption during normal browsing. The main reason is practical: browser agents already have a working way to perceive and control the web.

Accessibility snapshots, DOM maps, screenshots, CDP, and Playwright are imperfect, but they work across arbitrary sites. They also preserve live state. A static file cannot tell an agent what is currently in a user's cart, whether a modal is open, which account is logged in, or which validation error just appeared after form submission.

Security is the second barrier. If an agent consumes natural-language instructions from a website, prompt injection becomes part of the protocol. Vendor safety docs and public agent reports show that browser-agent teams are already dealing with injected instructions, secret exposure risks, and user approval boundaries. Adding more text that agents are expected to trust is not free.

Authentication adds more friction. Agent tasks often cross logged-in pages, payment flows, enterprise apps, and private dashboards. A static discovery file rarely has enough context to do the job. A configured MCP server, OAuth connector, or signed agent request can handle trust more cleanly, but those systems require setup.

The last barrier is incentives. Sites will not invest heavily until agents consume the surfaces. Agent vendors will not depend on the surfaces until enough sites publish them correctly. Browser automation wins because it avoids that coordination problem. It uses the web that already exists.

Unique Insight

The blocker is not ignorance. Many agent builders know about these standards. The problem is harder: site-side discovery has to beat a messy but universal fallback, the browser itself. Until a protocol handles state, auth, trust, freshness, and interaction better than the browser loop, it will remain supplemental.

Protocol publication is ahead of protocol consumption because browser automation already works across arbitrary sites. CanAgentUse Research found 0 confirmed target-site protocol consumption cases, while accessibility snapshots, DOM maps, screenshots, CDP, and browser control appeared across the reviewed agent stack.

What should platform teams do now?

In 2026, the evidence points to a simple order: make the UI agent-usable first, expose programmable workflows second, publish supplemental discovery files third. That order matches the public adoption pattern: browser perception is widespread, manual MCP is real, and target-site protocol consumption remains unconfirmed.

Start with accessibility. Use semantic controls. Give every important button and input a stable accessible name. Make form labels explicit. Avoid icon-only critical actions unless the icon has a reliable label. Keep keyboard interaction working. If the accessibility tree is the agent's map, unlabeled UI is missing road signs.

Next, make browser interactions predictable. Keep important actions visible. Avoid brittle multi-step flows where state only appears in a canvas, animation, or custom widget. Use stable URLs for meaningful states where possible. Keep modals and validation errors legible in the DOM. Agents are better at careful interaction when the product behaves like a well-built web app.

Then expose tools for workflows where reliability matters. If agents need to query inventory, file tickets, update account records, export reports, or check order status, offer an API and consider MCP. Tool surfaces let you enforce auth, rate limits, schemas, confirmations, and audit logs more cleanly than raw browser operation.

Publish llms.txt and related docs as a supplement. They can help retrieval, coding assistants, and external documentation flows. Pair them with OpenAPI, sitemaps, clear docs, and structured pages. Do not treat them as a replacement for accessible UI or first-party APIs.

For trust and traffic control, support agent identity where vendors provide it. OpenAI's signed request and .well-known/http-message-signatures-directory path is one concrete example. For agent-facing commerce or sensitive flows, pair verification with explicit approvals, not silent autonomous writes.

PriorityWhat to shipWhy it matters
1Accessible, predictable UIHelps the browser agents that already exist
2Stable APIs and MCP toolsGives configured agents reliable actions
3llms.txt, docs maps, agent skill docsHelps retrieval and documentation use cases
4A2A or Agent Cards where delegation fitsPrepares for agent-to-agent workflows
5Agent identity and signed request handlingHelps distinguish trusted agents from generic automation

For a concrete implementation path, connect this article to our signed agents guide, auth.md and DNS-AID setup guide, agent card discovery guide, and agent payments guide.

Platform teams should prioritize accessible UI, predictable browser flows, and explicit APIs or MCP tools before betting on automatic site-side protocol discovery. In CanAgentUse Research's 8-agent review, those practical surfaces had public adoption evidence, while target-site protocol consumption had 0 confirmed cases.

What should CanAgentUse measure?

In 2026, an honest agent-readiness scanner should score evidence in layers, because the 8-agent review found a sharp gap between publication and runtime consumption. CanAgentUse should reward emerging files, but it should score them differently from surfaces that agents visibly use today.

A useful scoring model has four buckets. The first bucket is browser usability: accessibility tree quality, labeled controls, forms, navigation, page state, and obvious interaction paths. The second is programmable capability: OpenAPI, MCP, action schemas, auth, rate limits, and safe write operations. The third is machine-readable discovery: llms.txt, sitemaps, robots policy, agent cards, and related well-known files. The fourth is trust: signed agent requests, bot policy, consent, and abuse controls.

The wording matters. A report should say "published but not known to be consumed," not "agent supported." It should say "manual MCP available," not "automatic agent discovery." That language helps teams make real product decisions instead of chasing checkboxes.

This also helps GEO. AI search systems prefer crisp, extractable claims. A page that says exactly what is confirmed, what is manual, and what is unconfirmed is easier to cite than a page that blends all protocol ideas into one optimistic category.

Scanner labelMeaningExample
Consumed by known agentsPublic evidence shows runtime useAccessibility-oriented form interaction in Hermes and OpenClaw
Manually configurableWorks after setupMCP server URL or configured connector
Published by siteSite exposes a file or cardllms.txt or Agent Card
Proposed or unconfirmedInteresting, but no runtime evidenceWebMCP discovery without adoption proof

CanAgentUse should separate runtime consumption, manual configuration, publication, and unconfirmed proposals. The 2026 research found that collapsing those categories would overstate adoption, especially for llms.txt, A2A Agent Cards, WebMCP, UCP, and other site-side discovery ideas.

FAQ

What is AI agent protocol adoption?

AI agent protocol adoption means real agents use a protocol during operation, not merely that a spec exists. In this review, CanAgentUse Research analyzed 8 agent systems and found 0 confirmed cases of target-site manifest consumption during normal browsing, while browser perception and manual MCP integrations had visible adoption.

Is MCP the winning AI agent protocol?

MCP is one of the strongest adopted protocols in this review, but its adoption is mostly manual. Several reviewed ecosystems document or ship MCP paths, including OpenAI, Anthropic, Browser Use, OpenClaw, and Hermes. That makes MCP useful for integrations, not proof of automatic website discovery.

Should we publish llms.txt in 2026?

Yes, publish llms.txt if your site has docs, APIs, SDKs, or support pages that agents should retrieve. In this review, 3 projects published llms.txt or llms-full.txt, but 0 confirmed cases showed live browser agents consuming target-site files during ordinary browsing. Treat it as supplemental.

Are A2A Agent Cards useful yet?

A2A Agent Cards are useful for planning agent-to-agent delegation, especially when another agent owns a task, state, or policy layer. In this review, however, 0 confirmed cases showed the 8 reviewed agent systems consuming target-site Agent Cards during live browsing. Build them when delegation is real.

Can I cite this research?

Yes. Cite it as: CanAgentUse Research, "AI agent protocol adoption: we analyzed 8 agents in 2026," CanAgentUse, June 16, 2026. The core finding is that 8 agent systems were reviewed and 0 confirmed public cases showed target-site protocol consumption during ordinary live web browsing.

Source notes

Data appendix

Agent systemBrowser perception and controlMCP or tool integrationllms.txt publicationTarget-site protocol consumption
OpenAI ChatGPT agent or OperatorBrowser-like computer useNot confirmed for browsing docsNone foundNone confirmed
OpenAI Computer Use APIScreenshots, UI actions, optional harnessesNone as browsing discoveryNone foundNone confirmed
Anthropic Claude Computer UseScreenshots, mouse, keyboardManual MCP in separate productsNone foundNone confirmed
Perplexity CometThird-party evidence for DOM, AX, CDPNone found officiallyNone foundNone confirmed
Browser UseAccessibility tree, CDP, browser controlManual and published MCP pathsPublishesNone confirmed
OpenClawUI tree or ARIA snapshots, Chrome MCP, CDPManual and published MCP pathsPublishesNone confirmed
Hermes AgentAccessibility snapshots, CDP, multiple browser backendsManual and published MCP pathsPublishesNone confirmed
OpenAI Agents SDKOrchestration frameworkManual MCP server configurationNone foundNone confirmed

Citation format:

CanAgentUse Research. "AI agent protocol adoption: we analyzed 8 agents in 2026." CanAgentUse, June 16, 2026. https://canagentuse.com/blog/ai-agent-protocol-adoption-research