Summer Yue, Director of Alignment at Meta Superintelligence Labs, told her OpenClaw agent to "confirm before acting" on her email inbox. The agent deleted 200+ emails anyway. She sent stop commands from her phone. It kept going. She had to physically run to her Mac mini to kill the process.
Her quote is probably the best summary of the problem: "Nothing humbles you like telling your OpenClaw 'confirm before acting' and watching it speedrun deleting your inbox."
Why Her Safety Instruction Failed
The cause wasn't a bug. It was context window compaction, a normal part of how LLM agents manage memory in long sessions. When the context filled up, OpenClaw summarized older conversation history to make room. That summary quietly dropped her "confirm before acting" instruction. The agent didn't ignore her. It forgot her.
Any safety rule you put in a prompt can be compacted away. That's not a flaw in OpenClaw specifically. It's how large context windows work across every LLM agent.
What ClawBands Does Differently
ClawBands, released February 9, 2026, by Sandro Munda (CEO of RootCX), hooks into OpenClaw's before_tool_call plugin event. It sits in code, outside the context window. Compaction can't touch it.
Its creator calls it "sudo for your AI agent." Nothing happens without explicit permission.
The policy engine maps every tool call to one of three decisions. File reads get ALLOW. File writes and shell commands get ASK, meaning the agent pauses and waits for human approval. File deletes get DENY. Unknown tools default to ASK, which is the right call. Every decision gets logged to an append-only JSONL audit trail.
On Telegram and WhatsApp, the approval prompt shows up directly in your chat. You respond YES or NO, and the agent proceeds or stops. No terminal access required.
ClawBands vs SecureClaw
These two get compared, but they solve different problems. SecureClaw audits your OpenClaw instance before deployment, scanning for misconfigurations and injecting behavioral rules. ClawBands enforces during runtime, blocking actions in real time. You probably want both.
What This Means for You
If you're running OpenClaw through ClawHosters, you already get built-in monitoring and isolation. But for self-hosted instances, ClawBands is the most practical tool for preventing the exact failure that caught Meta's alignment director off guard. Our security hardening guide covers the full picture, and the safety scanner catches the pre-deployment side.