Entities & relationships for structured, relational reasoning.
The trick isn't storing it — it's injecting only what's relevant, every turn, without token bloat.
Delta injection: a memory recalled once isn't re-injected the same session.
Spoke 2 — repeatable behavior
Skills: fixed guides & evolving know-how
Memory stores facts. Skills store how to do something — markdown procedures, recalled by BM25.
📘 Subsystem guides
Authored, fixed. "How to use the email MCP," "how working memory works." Ship with the agent.
🌱 Mutable skills
The agent writes its own — captures a procedure that worked, then refines it over time.
The refine step happens offline — in the dream. →
Spoke 3 — autonomy
Tasks on a schedule
No user required. Cron-driven "patrol" tasks fire on their own — each with its own directive.
Same harness, same LLM — a different directive turns it into a different worker.
Spoke 4 — cost & resilience
LLM routing: tiers & fallback
The LLM is swappable — so RockBot routes work across cost/capability tiers, and falls back when a provider fails.
Balanced is the one requirement. Low & High are levers for cost and capability — and the fallback chain keeps the agent working when a provider is down or out of tokens.
Spoke 4 — cost & resilience
The orchestrator delegates down
The primary agent runs the conversation — and pushes each task to the lightest executor that can do it. Routine work drops to a cheap model, or no model at all.
Primary and subagents both reach for any tier. The savings come below: workers on the Low tier, wisps with no model at all.
Spoke 5 — self-learning
RockBot dreams
A background cycle that runs offline and refactors the agent's own knowledge — no user, no goals changed.
🧠 Consolidate memory
Merge duplicates, decay stale facts, mine anti-patterns from corrections.
🌱 Optimize skills
Refine, merge, and cluster the skills it has written.
🔀 Tune LLM routing
Learn which work belongs in which tier — Low, Balanced, or High — for the best cost/quality.
🪞 Infer preferences
Notice patterns across conversations; learn how you like to work.
Yesterday's experience makes tomorrow's agent sharper — automatically.
Governance — trust & isolation
Principle of least privilege
The core philosophy, restated: nothing trusts the LLM. The core agent holds no keys, no passwords.
A compromised prompt can't leak a secret the agent never had.
Governance — trust & isolation
Untrusted code runs sandboxed
RockBot can run Python and pull web pages — but never in-process.
🐍 Python execution
Runs in an ephemeral, low-privilege container. Spun up per run, destroyed after. No host access, no secrets, minimal blast radius.
🌐 Web search & fetch
Search the web and retrieve pages — but that content is untrusted input, handled with the same suspicion as the LLM's output.
Same pattern everywhere: assume the LLM (and the web) can be wrong or hostile — and contain it.