Sub-agent context gate (shared kernel)¶
Cross-skill rules for forking work to a sub-agent. Adopted from Anthropic's effective-context-engineering guidance, Tavily's published skills, and the cross-harness norms used by LangChain DeepAgents and the Skills standard.
Each skill names its own triggers. This file is the single source of truth for the things every skill agrees on.
Digest contract¶
The sub-agent returns roughly 2 KB or less: structured summary, citations, gaps. No raw bodies, no full file dumps, no copy-paste of fetched content. Skills name what the digest contains โ claim table, orientation paragraph, root-cause summary, etc. โ but never relax the size ceiling.
Harness-agnostic sub-agent selection¶
"Small, fast sub-agent" means whatever cheap-tier or read-only worker the host harness exposes (e.g. an explore-style default, a lighter task-tier worker, or a separate context window). Skills do not name specific model tiers; the harness chooses.
What the parent never delegates¶
- Severity grading, final verdicts, approval gates.
- Dialogue, contradictions, handshakes, user-facing decisions.
- Writing the canonical artifact (report, spec, claim table) โ the sub-agent supplies the digest; the parent writes the doc.
What the sub-agent owns¶
- Bulk fetches, extracts, crawls, multi-source research.
- Many-file reads, dependency / caller graph traversals.
- Anything yielding mostly raw bodies that the parent will not read line by line โ about 5 K tokens of raw output is the industry rule of thumb for "fork it".
Parallelism¶
When two or more heavy units of work are independent, spawn one small sub-agent per unit in parallel and merge their digests in the parent. One sub-agent doing five things sequentially is the wrong shape.