Applebot-Extended
Apple's AI training token controlling how Applebot data is used for Apple Intelligence.
What is Applebot-Extended?
Applebot-Extended is not a separate crawler but a robots.txt token that controls how Apple uses content already crawled by Applebot for AI training purposes. Specifically, it governs whether your content feeds into Apple Intelligence, Siri improvements, and other machine learning features.
This is similar in concept to Google-Extended — it provides granular control over AI training without affecting standard search functionality. By disallowing Applebot-Extended while allowing Applebot, you maintain visibility in Siri and Spotlight while preventing your content from being used to train Apple's AI models.
As Apple Intelligence expands across iPhones, iPads, and Macs, the content governed by Applebot-Extended reaches a massive audience. Understanding this distinction is critical for any AI content governance strategy.
User-Agent Strings
These are the known user-agent patterns used by Applebot-Extended. Use them to identify this crawler in your server logs or configure robots.txt rules.
robots.txt example:
User-agent: Applebot-Extended Disallow: /private/ Allow: /
How to Manage Applebot-Extended
Block Applebot-Extended to prevent AI training while keeping Siri/Spotlight visibility.
This is a robots.txt token, not a separate crawler — it controls Applebot data usage.
Use Switch to monitor all Apple-related crawler activity in one view.
Important for sites with premium content that want to limit AI training use.
Start managing Applebot-Extended today
Switch detects, tracks, and lets you build custom journeys for Applebot-Extended and 35+ other AI agents and crawlers. Set up in five minutes.
Get Started FreeRelated Agents
Applebot
Search EnginesApple
Apple's search crawler powering Siri, Spotlight, and Safari Suggestions.
AI2Bot
Commercial CrawlersAllen AI
Allen Institute for AI's research crawler for academic AI development.
Amazonbot
Commercial CrawlersAmazon
Amazon's web crawler powering Alexa, Amazon search, and AI services.
Bytespider
Commercial CrawlersByteDance
ByteDance's web crawler for TikTok AI and LLM training data.
CCBot
Commercial CrawlersCommon Crawl
Common Crawl's open-source web archive used by multiple AI companies for training.
ClaudeBot
Commercial CrawlersAnthropic
Anthropic's web crawler collecting training data for Claude models.