Architecture

NLWeb Website Endpoints: Machine-Readable Interfaces for Agents

NLWeb gives websites natural-language endpoints for humans and AI agents. Learn how it uses Schema.org, RSS, MCP, and site data.

Updated June 28, 2026

NLWeb is an open-source Microsoft project for adding natural-language interfaces to websites. It matters for agent-ready sites because it turns existing site data, such as Schema.org and RSS, into endpoints that can serve humans and AI agents. It is a practical bridge between content, search, and MCP-style tools.

What NLWeb is#

The Microsoft NLWeb repository describes NLWeb as a way to build conversational interfaces for websites. Its README says it natively supports MCP, uses Schema.org and RSS-like semi-structured formats, and provides open protocols plus implementation code.

That makes NLWeb relevant for sites that want agents to ask structured questions instead of scraping pages blindly.

NLWeb should not be treated as a magic SEO layer. It is closer to a website interface pattern: a way to expose the site’s own knowledge and records through natural-language access.

Why NLWeb is a new keyword opportunity#

Most websites still think in three layers:

  1. HTML pages for humans.
  2. Sitemaps for search engines.
  3. APIs for developers.

NLWeb points to a fourth layer: natural-language endpoints for agents and AI-assisted users.

LayerMain userTypical format
WebsiteHuman visitorHTML
SitemapSearch crawlerXML
Structured dataSearch and AI systemsSchema.org JSON-LD
NLWeb endpointHuman or AI agent asking a task questionNatural-language interface backed by site data
MCP toolAgent that needs a structured actionTool schema and endpoint

For related architecture, see MCP Resources vs Tools vs Prompts and MCP vs API for Agents.

Which sites should watch NLWeb first#

NLWeb is most interesting for sites with rich, queryable data:

  1. Ecommerce catalogs.
  2. Travel inventory.
  3. Real estate listings.
  4. Job boards.
  5. Documentation portals.
  6. Local directories.
  7. Support knowledge bases.
  8. Event databases.

These sites already contain structured facts. NLWeb gives them a path toward conversational and agent-readable retrieval.

How to prepare for NLWeb#

Even if you do not implement NLWeb yet, prepare the data layer:

  1. Use consistent entity names across pages and structured data.
  2. Keep Schema.org markup aligned with visible content.
  3. Maintain clean feeds for products, articles, events, listings, or docs.
  4. Create stable canonical URLs for every important entity.
  5. Document update frequency and availability rules.
  6. Decide which data agents may query and which data requires authentication.
  7. Map possible questions to source data fields.

This overlaps with Programming Websites for AI Agents and Agent-Ready Web Apps.

NLWeb vs scraping#

QuestionScrapingNLWeb-style endpoint
Does the site control the answer shape?Usually noYes
Can the agent ask a focused question?IndirectlyYes
Is the output tied to source data?Often fragileMore controllable
Is access policy easier to enforce?HarderEasier if designed well
Does it replace normal pages?NoNo

The strategic benefit is control. If agents are going to query your site anyway, a structured endpoint can be safer and more accurate than forcing them through visual pages.

FAQ#

Is NLWeb an official web standard?#

NLWeb is an open-source Microsoft project with open protocols and implementation code. Treat it as an emerging approach, not a settled universal standard.

Does NLWeb replace Schema.org?#

No. NLWeb builds on site data such as Schema.org and RSS-like formats. Good structured data still matters.

Is NLWeb the same as MCP?#

No. NLWeb can support MCP, but MCP is a protocol for tools and context exchange. NLWeb is focused on natural-language website interfaces.

Should every website implement NLWeb now?#

No. Start with sites where users or agents need to ask detailed questions against structured site data.

Sources#

Primary and reference sources: Microsoft NLWeb GitHub repository, NLWeb README raw source, Model Context Protocol documentation, Schema.org, and Google structured data documentation.