AI Crawlability Test
GuidesBy Soorya· April 1, 2025· 7 min read

What is llms.txt and Why Every Website Needs One

Learn about the llms.txt specification — the emerging standard that helps AI agents and large language models understand and navigate your website's content.

In the age of AI-powered search, a new file is rapidly becoming as essential as robots.txt: llms.txt. While robots.txt tells crawlers what they can access, llms.txt tells large language models (LLMs) how to understand and work with your site. If you've never heard of it, now is the time to pay attention.

What Is llms.txt?

llms.txt is a plain-text Markdown file placed at the root of your website (/llms.txt). It provides AI agents with a structured, human-readable overview of your site: what it does, what content it contains, and where to find key pages. Think of it as a table of contents written specifically for artificial intelligence.

The specification was introduced by fast.ai and documented at llmstxt.org. It defines a minimal format: a title, an optional description, and a list of sections containing links with short descriptions. This simple structure allows LLMs to quickly orient themselves within your site's content — even before crawling individual pages.

The llms.txt Format

A valid llms.txt file looks like this:

# My Company

> We build developer tools for the AI era.

## Documentation

- [Getting Started](/docs/start.md): How to set up and run your first project.
- [API Reference](/docs/api.md): Complete reference for all API endpoints.

## Blog

- [What is llms.txt](/blog/what-is-llms-txt.md): Introduction to the llms.txt standard.

The # heading is the site name. The > blockquote is a one-line description. Sections (##) group related links, and each link entry includes a short description after the colon.

Why AI Agents Need llms.txt

When a large language model like Claude or ChatGPT visits a website, it often needs to answer a specific question quickly. Without guidance, it must crawl multiple pages, parse HTML, ignore navigation and ads, and piece together a mental model of your site — all within a limited context window.

llms.txt eliminates this friction. By providing a curated map of your most important content, you help AI agents find the right information faster, produce more accurate answers about your products or services, and represent your brand correctly to users who ask about you.

  • AI agents can answer questions about your site accurately without guessing
  • Reduces the chance of LLMs hallucinating incorrect information about your brand
  • Helps AI-powered search surfaces (Perplexity, ChatGPT, Claude) cite your content
  • Works alongside per-page .md files for granular content access
  • Future-proofs your site as AI browsing becomes mainstream

llms.txt vs. Other Crawl Files

You might wonder how llms.txt fits alongside the files you already have. Here's the distinction:

  • robots.txt — Controls access: which bots can visit which paths
  • sitemap.xml — Controls discovery: lists all URLs so crawlers know what exists
  • llms.txt — Controls comprehension: explains what your site is about and where key content lives

All three serve different purposes and all three should coexist. A well-prepared site has all of them.

Per-Page Markdown Companions

llms.txt often links to Markdown versions of your pages (e.g., /about.md alongside /about). These per-page Markdown files give AI agents clean, distraction-free content — no navigation, no ads, no JavaScript. Pure text.

If your site has a blog post at /blog/how-to-start, adding a companion at /blog/how-to-start.md means AI agents can read the content directly, without executing JavaScript or stripping HTML boilerplate. This dramatically improves the quality of AI-generated answers that cite your work.

How to Create Your llms.txt Today

  1. Open a text editor and create a file named llms.txt
  2. Add a # heading with your site or brand name
  3. Add a > blockquote with a one-sentence description of your site
  4. Create ## sections for major content areas (Docs, Blog, Products, etc.)
  5. List your most important pages as Markdown links with descriptions
  6. Place the file at the root of your site (accessible at yoursite.com/llms.txt)
  7. Optionally add /.well-known/llms.txt as a mirror

Test your llms.txt right now: enter your URL in the AI Crawlability Test tool above to see if your file is detected and properly formatted.

The Bigger Picture

llms.txt is one piece of a larger AI visibility puzzle. Combined with a proper robots.txt that explicitly allows AI crawlers, a sitemap.xml for page discovery, structured data (JSON-LD) for machine-readable context, and Open Graph tags for rich previews, it forms a comprehensive foundation for AI-era web presence.

Early adopters who implement llms.txt today are positioning their sites to be cited, referenced, and recommended by AI systems as those systems become the primary interface between people and information on the web.

llms.txtAI crawlabilityLLMAI agentssite manifest

More articles

Check your site's AI crawlability

Free, instant, live checks — no account required.

Run free test →