Skip to content

Agent Access

This site is intentionally agent-friendly. The published content is public reference material and one of the project goals is to make it easily consumable by AI assistants, MCP clients, search bots, and similar tooling.

This page documents the agent-facing surfaces, what they contain, what is excluded, and how they are maintained.

SurfaceURL patternPurpose
Curated index/llms.txtShort, hand-picked index of canonical entry points + pointers to bundles
Compact bundle/llms-small.txtHierarchy + summaries — for token-tight agents
Full bundle/llms-full.txtEntire site as a single Markdown blob — for one-shot RAG ingestion
Raw page source/<slug>.mdAny rendered page also served as raw Markdown
Crawler policy/robots.txtExplicit allow rules for known AI crawlers

All four are emitted as part of the standard build and deployed alongside the rendered HTML. Their absolute URLs at production are:

For preview builds the prefix is https://arda-cards.github.io/documentation/....

The curated index follows the llms.txt proposal. It contains:

  • A title and one-paragraph description of the site.
  • Pointers to the small and full bundles.
  • An Optional section listing about 20 hand-picked entry points — the top-level section indexes plus a few high-leverage deep references (architecture patterns, the information model, the authoring guide, the four core technology references).

The index is deliberately short. Agents that need more should follow it into the small or full bundles.

The agent surfaces deliberately exclude roadmap/**. Roadmap content is project-internal — designs, plans, analyses, and session logs that decay in relevance after the project ships. Agents would otherwise pick up superseded designs and cite them as authoritative.

Excluded:

  • roadmap/in-progress/<project>/...
  • roadmap/completed/<project>/...
  • roadmap/backlog/...
  • roadmap/ideas/...

The exclusion only affects the agent surfaces (/llms.txt, /llms-small.txt, /llms-full.txt, and the /<slug>.md raw endpoints don’t render for excluded paths). The human site (/roadmap/... HTML pages) remains fully browsable.

The robots.txt explicitly allows:

VendorTraining crawlerLive fetchSearch index
AnthropicClaudeBotClaude-UserClaude-SearchBot
OpenAIGPTBotOAI-SearchBot
GoogleGoogle-ExtendedGooglebot (default)

All other user-agents are also allowed (User-agent: * Allow: /). The site is reference material; we choose breadth over scarcity.

Two Starlight plugins handle generation; both are configured in astro.config.mjs.

  • starlight-llms-txt emits /llms.txt, /llms-small.txt, and /llms-full.txt. Configuration: project name, description, exclude patterns (roadmap/**), promote patterns (durable section roots), and an optionalLinks array driving the curated index.
  • starlight-dot-md emits a /<slug>.md route alongside every rendered page, returning the raw source.

robots.txt is a static file at public/robots.txt — Astro copies it through to dist/ unchanged.

When a section grows or shrinks, update the optionalLinks array in astro.config.mjs. Keep it short — fewer than 30 entries is the target. The full bundle covers the rest.

When a project completes and its content is durable enough to live outside roadmap/, promote the relevant pages into a permanent section (e.g. current-system/, domain/) before retiring the project’s roadmap directory. The agent surfaces will pick up the promoted content automatically.

If you are building an agent or MCP client that consumes this site:

  • Start at /llms.txt. It is short and tells you which bundles to fetch next.
  • For per-page reads, append .md to any URL. The raw markdown contains frontmatter (title, description, tags, domain, maturity, author) plus the unrendered body.
  • For bulk ingestion, /llms-full.txt is one ~10 MB markdown file with all included pages concatenated. Suitable for a single embedding pass.
  • For sparse retrieval, /llms-small.txt contains hierarchy and per-page descriptions only — useful when a small index needs to fit a tight context.