Free browser tool

AI Robots.txt Generator

Choose how GPTBot, OAI-SearchBot, ClaudeBot, Google-Extended, PerplexityBot, and other crawlers should handle your site. Nothing is uploaded.

Runs locally

1. Site settings

Enter your public domain and choose the first rule set.

Site URL

Policy preset Search open, training blocked Allow search and user-requested fetches; block training crawlers. Block AI training Block training-oriented AI crawlers while keeping Google Search open. Block most AI crawlers Use when access policy is stricter than search coverage. Allow crawlers Allow major AI and search crawlers, while private paths stay blocked.

Protected paths

Quick add

One path per line. These are added under User-agent: *.

2. Crawler controls

Decide which crawler tokens are allowed or blocked.

3. Generated robots.txt

Copy this into the root of the exact host you want to control.

0 lines

Robots.txt validator

Find obvious rule problems

Publish check

Check the robots.txt before uploading it

Check score 0

Generate rules to see the checks.

What this tool controls

Robots.txt tells crawlers which paths they should avoid. Some AI and search products publish their own user-agent names, so you can write separate rules for training, search, and user-requested fetches. It is public guidance, not access control.

Safer default

For most public sites, keep normal search crawlers open, block account or checkout paths, and decide separately whether training crawlers can read public pages. The default preset follows that pattern.

Before publishing

Crawler names and provider rules change. Check the official docs before replacing a production robots.txt file, especially on publisher, SaaS, or ecommerce sites.

Official crawler references

Use these provider pages to verify crawler names and policy behavior before changing production robots.txt.

OpenAI crawlers Google crawlers Anthropic crawler guidance Perplexity robots.txt

Guides and examples

Practical AI crawler guides

Use these short guides when you need to review a policy before changing production robots.txt.

How to block GPTBot

Block training access while keeping search and user-requested fetches separate.

Open guide

Google-Extended vs Googlebot

Avoid blocking Google Search when you only want Gemini-related control.

Open guide

AI crawler user-agent list

Review common crawler tokens by training, search, and user-triggered use.

Open guide

Robots.txt examples

Compare common policies before generating and publishing your own file.

Open examples

FAQ

Should I block all AI crawlers?

Not always. A full block can stop training bots, but it can also block search or user-requested fetches. Use separate rules when you still want search traffic.

Does Google-Extended affect Google Search rankings?

Google describes Google-Extended as a Gemini-related control token, not a Google Search ranking signal. Keep Googlebot open if you rely on organic search.

Can robots.txt protect private data?

No. Sensitive data must be protected with authentication, authorization, and server-side access controls. Robots.txt is a crawl preference file.

Does this site upload my robots.txt?

No. The generator, validator, and publish checks run in your browser.