How to Block Google-Extended
Complete guide to blocking Google-Extended (Google) from crawling your website using robots.txt, server configuration, and Switch workflows.
Should You Block Google-Extended?
Google-Extended collects data for AI model training. Blocking it prevents your content from being used in Google's AI products without affecting your search visibility.
This is a common and recommended action for sites that want to control how their content is used in AI training.
Blocking Methods
1robots.txt
High for cooperative crawlersAdd a Disallow rule for Google-Extended's user-agent string in your robots.txt file. This is the standard, cooperative method that well-behaved crawlers respect.
2Server-side UA filtering
HighConfigure your web server (nginx, Apache, Cloudflare) to reject requests matching Google-Extended's user-agent patterns. This blocks at the network level before your application processes the request.
3Switch Journey Workflows
Highest — granular, real-time controlCreate a custom journey in Switch that detects Google-Extended and routes it to a block action, challenge, redirect, or modified content — without touching your server configuration.
robots.txt — Block Google-Extended
Add the following to your robots.txt file (at the root of your domain) to block Google-Extended:
User-agent: Google-Extended Disallow: /
robots.txt — Allow with Restrictions
Alternatively, allow Google-Extended on most pages while blocking specific directories:
User-agent: Google-Extended Disallow: /private/ Allow: /
Google-Extended User-Agent Strings
Use these patterns to identify Google-Extended in your server logs or firewall rules:
Frequently Asked Questions
Does blocking Google-Extended affect my Google search rankings?
No. Blocking Google-Extended does not affect your Google search rankings. Only blocking Googlebot impacts Google Search visibility.
Does Google-Extended respect robots.txt?
Yes, Google-Extended respects robots.txt directives. Adding a Disallow rule for its user-agent will prevent it from crawling blocked paths.
Can I allow Google-Extended on some pages but not others?
Yes. Use robots.txt to disallow specific directories, or use Switch journey workflows for granular page-level control with conditional logic.
Go beyond robots.txt
Switch detects Google-Extended in real-time and lets you build custom journey workflows — block, challenge, redirect, or serve modified content. No server changes required.
Get Started Free