Knowledge base
All checks
49 signals AI agents use to decide whether to cite your website. Each one has a dedicated page explaining what it is, why it matters, and how to fix it.
Agent Readiness
(5)- AUTO
skill.md file present
A markdown file at the root of your site that describes your product to AI agents in plain language.
- AUTO
skill.md provides a name
Your skill.md file should declare a product name in a YAML frontmatter block so agents can identify it unambiguously.
- AUTO
skill.md name format valid
The name field in your skill.md must contain only lowercase letters, numbers, and hyphens.
- AUTO
/skill JSON endpoint
A dynamic API endpoint that returns structured JSON about your product's capabilities, audience and pricing.
- AUTO
.well-known/agent.json
The standard agent discovery file, analogous to robots.txt but written specifically for AI agents.
Machine Access
(5)- AUTO
Homepage returns HTTP 200
Your homepage must respond with a 200 status code when an AI crawler fetches it.
- AUTO
Core content visible without JavaScript
Your key content must be present in the initial HTML response. Most AI crawlers do not execute JavaScript.
- AUTO
robots.txt does not block key pages
Your robots.txt must allow crawlers to reach your homepage and main content.
- AUTO
llms.txt file present
A markdown file at /llms.txt that gives AI models a curated map of your most important content.
- AUTO
Sitemap present and reachable
An XML sitemap at /sitemap.xml that lists all your indexable pages.
Content
(7)- AUTO
Logical heading hierarchy
Your page should have exactly one H1 and a nested H2/H3 structure that reflects the content outline.
- AUTO
Meta description present
A meta description tag between 50 and 160 characters that summarizes the page.
- AUTO
Sufficient content length
Pages should have at least 300 words of substantive content. Thin pages rarely get cited.
- SEMI
Clear, plain language
Your content should be readable by a general audience without industry jargon.
- SEMI
No keyword stuffing
Avoid repetitive keyword-heavy phrases. Keyword stuffing reduces AI visibility by around 30%.
- SEMI
Content includes statistics
Adding specific statistics to your content boosts AI visibility by around 37%.
- SEMI
Content includes quotations
Including customer quotes or expert testimonials boosts AI visibility by around 40%.
Structure & Architecture
(5)- AUTO
Healthy internal link structure
Your pages should link to each other with descriptive anchor text so crawlers can discover them.
- AUTO
Canonical URL set correctly
Each page should declare a canonical URL so crawlers know which version to index.
- AUTO
Mobile-friendly meta tag
Your pages should have a viewport meta tag and responsive layout.
- AUTO
Page loads in acceptable time
Slow pages get skipped by AI crawlers. Target under 2 seconds to first content.
- AUTO
Clean, descriptive URL slugs
URLs should be human-readable and reflect the page content.
Entities & Semantics
(5)- AUTO
Schema.org structured data
JSON-LD markup that tells crawlers exactly what kind of entity your page describes.
- AUTO
Open Graph tags present
og:title, og:description, and og:image meta tags so your pages preview well when shared.
- AUTO
About page with entity info
A dedicated About page that tells crawlers who you are, where you are, and why you exist.
- SEMI
Business info findable
An AI crawler should be able to identify your name, what you do, and how to contact you from your homepage alone.
- SEMI
Consistent NAP (Name, Address, Phone)
Your Name, Address, and Phone must be identical everywhere they appear.
Trust & Authority
(4)- AUTO
HTTPS enabled
Your site must be served over HTTPS with a valid TLS certificate.
- AUTO
Privacy policy present
A linked privacy policy is a baseline trust signal and legally required in most jurisdictions.
- SEMI
E-E-A-T signals present
Experience, Expertise, Authoritativeness and Trustworthiness signals measurably boost AI citations.
- SEMI
Outbound links to authoritative sources
Linking out to trusted external sources raises your own trustworthiness.
Conversational Readiness
(4)- AUTO
FAQ section or Q&A structured data
An FAQ section answers the questions users actually ask — exactly what AI assistants are looking for.
- AUTO
Clear contact and call-to-action options
A visible way to contact you or take the next step, linked from the homepage.
- SEMI
Clear service or product descriptions
An AI should be able to answer 'what does this business offer?' from your homepage text alone.
- SEMI
Unique value proposition identifiable
An AI should be able to answer 'why choose this over competitors?' from your content.
Crawl Permissions
(3)- AUTO
GPTBot not blocked in robots.txt
OpenAI's crawler must be allowed to access your site if you want to appear in ChatGPT answers.
- AUTO
ClaudeBot not blocked
Anthropic's ClaudeBot must be allowed to access your site to appear in Claude's answers.
- AUTO
PerplexityBot not blocked
Perplexity's crawler must be allowed if you want to appear in Perplexity search results.
Extractability
(4)- AUTO
llms.txt lists key pages with descriptions
An llms.txt that exists is good. An llms.txt with clear page descriptions is much better.
- AUTO
Content updated within 3 months
There is a measurable freshness cliff at three months. Older content gets cited less.
- AUTO
Content structure converts cleanly to markdown
A page whose HTML can be extracted to clean markdown is easier for models to consume.
- SEMI
Self-contained, quotable paragraphs
Write key facts as 1-3 sentence paragraphs that stand alone without surrounding context.
Authority Signals
(4)- SEMI
Brand name consistent and prominent
Brand authority is the single strongest predictor of AI citations, with a 0.334 correlation.
- SEMI
Author bios with credentials
Named authors with visible credentials are a major authority signal.
- SEMI
Social proof and testimonials
Customer testimonials, case studies, and reviews establish you as a real, used product.
- SEMI
Certifications and awards
Industry certifications, awards, and partnerships are durable authority markers.
Technical Performance
(3)- AUTO
Server response time under 500ms
Your server should return the first byte within 500 milliseconds of the request.
- AUTO
Clean HTML without critical errors
Your HTML should parse cleanly without unclosed tags, duplicate IDs, or structural errors.
- AUTO
Good text-to-HTML ratio
At least 25% of your page's raw size should be actual content text, not markup.