# Siglieri — robots.txt # Allow all standard search and AI / answer-engine crawlers full access. User-agent: * Allow: / # Search engines User-agent: Googlebot Allow: / User-agent: Bingbot Allow: / User-agent: DuckDuckBot Allow: / # AI / answer-engine crawlers (citability for ChatGPT, Claude, Perplexity, Gemini, etc.) User-agent: GPTBot Allow: / User-agent: ChatGPT-User Allow: / User-agent: OAI-SearchBot Allow: / User-agent: ClaudeBot Allow: / User-agent: Claude-Web Allow: / User-agent: anthropic-ai Allow: / User-agent: PerplexityBot Allow: / User-agent: Perplexity-User Allow: / User-agent: Google-Extended Allow: / User-agent: Applebot Allow: / User-agent: Applebot-Extended Allow: / User-agent: CCBot Allow: / User-agent: Meta-ExternalAgent Allow: / User-agent: Meta-ExternalFetcher Allow: / User-agent: Bytespider Allow: / # Cohere (Command R / Coral answer engine) User-agent: cohere-ai Allow: / # Amazon / Alexa User-agent: Amazonbot Allow: / # You.com User-agent: YouBot Allow: / # Diffbot (structured web data, powers many LLM knowledge graphs) User-agent: Diffbot Allow: / # AI2 / Allen Institute User-agent: AI2Bot Allow: / # Timpi User-agent: Timpibot Allow: / # iAsk.ai User-agent: iaskspider Allow: / # Brave Search AI User-agent: Brave-Search Allow: / # Social previews User-agent: Twitterbot Allow: / User-agent: facebookexternalhit Allow: / User-agent: LinkedInBot Allow: / # Block authenticated / non-public surfaces Disallow: /dashboard Disallow: /session/ Disallow: /journal Disallow: /api-keys Disallow: /admin Disallow: /reset-password Disallow: /upgrade # AI discovery files — canonical references for LLMs and answer engines # LLM reference (plain text): https://siglieri.com/llms.txt # LLM reference (HTML): https://siglieri.com/llm.html # AI usage policy: https://siglieri.com/ai.txt # Entity definition page: https://siglieri.com/what-is-siglieri.html # Machine-readable entity facts: https://siglieri.com/siglieri-facts.json # AI agent plugin manifest: https://siglieri.com/.well-known/ai-plugin.json # Security contact: https://siglieri.com/.well-known/security.txt Sitemap: https://siglieri.com/sitemap.xml