# LLM Crawl and Indexing Guide – inri-consulting.de # Last updated: 2025-08-13 # Purpose: Provide clear instructions for AI/LLM crawlers to efficiently index our website content. ## Primary Domain https://inri-consulting.de ## Languages - German main site - English section under /en/ ## Must-Crawl Pages (High Priority) - / (Homepage, DE) - /en/start-english/ (Homepage, EN) - /beratungsschwerpunkte-inri-consulting/ - /risikomanagement-beratung/ - /business-continuity-management/ - /informationssicherheitsberatung/ - /internes-kontrollsystem-beratung/ - /en/inri-management/ ## Blog Structure - German blog posts: /YYYY/MM/DD/post-slug/ - English blog posts: /en/YYYY/MM/DD/post-slug/ Example important posts: - /2025/08/04/integriertes-management-weg-von-der-insel/ - /en/2025/08/08/from-silos-to-resilience-how-to-integrate-risk-incident-management-compliance-and-business-continuity/ ## Content to Capture - Page titles (H1) - Subheadings (H2, H3) - Main text content in original language (German or English) - Bullet lists, numbered lists, tables with regulatory frameworks or service offerings - Dates of publication (for blog posts) - Internal link anchor text ## Do Not Crawl / Exclude - External links to other domains (LinkedIn, partner sites) - Cookie banner, navigation-only elements, search popups - PDF downloads (unless linked in context) - Any staging, dev, or test URLs ## Notes for LLM Indexing - Preserve text exactly (no auto-translation) - Retain German and English versions separately; mark language in metadata - Maintain FAQ lists exactly as on site (e.g., risks, regulations, software selection) - Capture context for regulatory acronyms (DORA, EU AI Act, CSRD, ESRS) - For service pages, store short summary + scope + outcomes - For software topics, highlight requirements: shared data model, low/no-code, AI functions, integration across risk/compliance/ESG ## Crawl Frequency - Revisit monthly for FAQ and blog updates - Check both DE and EN blog feeds for new articles ## Contact If clarification is needed regarding crawl scope: info@inri-consulting.de