What an llms.txt validator does
This validator fetches the `/llms.txt` file (and optionally `/llms-full.txt`) from your domain, parses it as Markdown per the llmstxt.org spec, and shows you exactly what an AI agent will see: H1 title, blockquote summary, H2 sections with their bullet links.
`llms.txt` is essentially a site map written for language models, similar to how `robots.txt` works for search crawlers. ChatGPT, Claude, Perplexity and other AI tools use it to understand a site's structure without grinding through the full HTML. A good `llms.txt` measurably improves the quality of AI answers about your product.
The validator does three jobs: it shows the raw file, it parses it into a tree (title, summary, sections, links), and it HEAD-probes the first 20 links (with a GET-range fallback) so you instantly see if any reference returns a 404.
How to use it
- Enter a domain (e.g. `example.com`) or a full URL ending in `/llms.txt`. We auto-prepend `https://` if missing.
- Pick a file: `llms.txt` (concise navigation), `llms-full.txt` (full content in one file) or Both - we fetch both so you can diff them.
- Click "Check". We fetch with an 8-second timeout, parse the Markdown, and probe the links.
- The Parsed tree panel shows the title, summary, and H2 sections with their bullets. Each link gets a colored health badge (200 / 3xx / 4xx / timeout).
- The Issues panel flags missing pieces: no H1, no sections, malformed bullet, suspicious Content-Type, HTTP error. Each issue carries a line number.
- Raw view in a collapsible section shows exactly what we received from the server - handy for diffing against your CMS output.
- Fix, redeploy, hit "Check" again. The validator never caches; every request fetches fresh.
When this is useful
Six situations where the `llms.txt` validator saves you a debugging session:
- First-time `llms.txt` rollout. You just shipped one and want to confirm the format is correct before AI agents discover it. The validator catches missing H1s, malformed bullets, and relative URLs that some parsers reject.
- CMS migration aftermath. Your link list in `llms.txt` may now point to dead pages. The 4-second health probe shows you every bullet that 404s.
- AI-SEO audit deliverable. A client wants a "how does your site look to ChatGPT and Claude" report - the validator output is a clean, copy-pasteable starting point.
- Diffing variants: the "Both" mode pulls `llms.txt` and `llms-full.txt` in parallel so you can confirm they didn't drift out of sync (different sections, stale dates, missing pages).
- Your build-step generator started shipping a broken file. Anthropic's parser refuses it but the error message is opaque. The validator points you at the exact line.
- Competitive research. Drop in a competitor's domain, study their `llms.txt` structure, learn what they think is worth surfacing.
Related: robots.txt + sitemap.xml validator, OpenGraph preview, DNS lookup, HTTP headers inspector.