AI / GEO
AI robots.txt Generator
Control which AI crawlers can access your site. Block LLM training bots while allowing retrieval agents — or find your own balance.
What gets checked
- 18 AI and LLM crawlers across OpenAI, Anthropic, Google, Meta, and more
- Preset modes: block training only, allow all, block all, or selective
- Distinguishes training crawlers from retrieval agents
- Sitemap URL injection for complete robots.txt output
- One-click copy and download as robots.txt
Quick preset
AI bot permissions
BotCompanyPurposeStatus
GPTBot
GPTBot
ChatGPT-User
ChatGPT-User
OAI-SearchBot
OAI-SearchBot
ClaudeBot
ClaudeBot
Claude-Web
Claude-Web
Google-Extended
Google-Extended
PerplexityBot
PerplexityBot
Bytespider
Bytespider
CCBot
CCBot
PetalBot
PetalBot
Diffbot
Diffbot
ImagesiftBot
ImagesiftBot
cohere-ai
cohere-ai
AI2Bot
AI2Bot
anthropic-ai
anthropic-ai
Meta-ExternalAgent
meta-externalagent
FacebookBot
FacebookBot
Applebot-Extended
Applebot-Extended
2 blocked · 16 allowed
Sitemap URL (optional)
robots.txt
# robots.txt — generated by SEOCheckPilot # Standard crawlers (always allowed) User-agent: * Allow: / # AI / LLM crawlers — blocked User-agent: Bytespider Disallow: / User-agent: ImagesiftBot Disallow: / # AI / LLM crawlers — allowed User-agent: GPTBot Allow: / User-agent: ChatGPT-User Allow: / User-agent: OAI-SearchBot Allow: / User-agent: ClaudeBot Allow: / User-agent: Claude-Web Allow: / User-agent: Google-Extended Allow: / User-agent: PerplexityBot Allow: / User-agent: CCBot Allow: / User-agent: PetalBot Allow: / User-agent: Diffbot Allow: / User-agent: cohere-ai Allow: / User-agent: AI2Bot Allow: / User-agent: anthropic-ai Allow: / User-agent: meta-externalagent Allow: / User-agent: FacebookBot Allow: / User-agent: Applebot-Extended Allow: /
Upload as robots.txt to your site root at
example.com/robots.txt. The file must be served with Content-Type text/plain.