AI · 8 min read · May 2, 2026
Safe Bilevel Delegation: Runtime Safety Control for Multi-Agent LLM Systems
A formal framework that dynamically adjusts safety-efficiency trade-offs when delegating tasks to specialized AI sub-agents during execution.
SBD is a bilevel optimization framework that dynamically controls how much authority human operators retain when delegating tasks to specialized LLM sub-agents.
- — Outer meta-weight network learns context-dependent safety-efficiency weights during runtime.
- — Inner delegation policy optimizes task execution subject to probabilistic safety constraints.
- — Continuous delegation degree (0 to 1) interpolates between human override and full autonomy.
- — Three theoretical guarantees: safety monotonicity, policy convergence, and accountability propagation.
- — Tested on medical AI, financial risk, and educational supervision domains.
- — Addresses gap between design-time architecture selection and dynamic runtime adjustments.
- — Distributes responsibility across multi-hop delegation chains with provable per-agent ceilings.
Frequently asked
- Alpha is a continuous value between 0 and 1 that controls how much decision authority transfers to a sub-agent. At alpha=0, a human retains full override power. At alpha=1, the sub-agent executes autonomously. Values in between create a graduated trust model where the system adjusts alpha based on task context and safety constraints.