openclaw

OpenClaw’s Media-Fallback PR Shows Why Multi-Agent Delivery Needs Receipts, Not Hope

Anatoliy Kolodkin

19 May 2026 • 4 min read

Agent platforms love to say a task is “complete.” PR #84371 is a useful reminder that complete is not a boolean. A generated image can exist on disk, the media task can have finished successfully, the requester-agent handoff can have fired, and the human can still receive nothing useful in Discord, Telegram, Slack, or wherever the request actually started. That gap is where optimistic agent UX goes to die.

The OpenClaw pull request targets a specific failure mode across image, video, and music generation: generated media completes, but the normal requester-agent or message-tool delivery path fails to attach or deliver the artifact. The proposed patch keeps media tasks active until completion delivery is confirmed, adds duplicate and recent-completion guards, and introduces a direct channel fallback when the usual active wake plus message-tool path misses the artifact. That sounds like plumbing because it is. It is also the product.

The PR was opened on May 20 at 00:17 UTC and is correctly not being treated as a rubber-stamp fix. It carries P1 labels, proof supplied, proof sufficient, and risk labels around compatibility, session state, and message delivery. ClawSweeper accepted the proof quality but flagged a compatibility break: the patch can return an active image task for a different prompt, while current behavior and docs allow distinct image prompts to start separately. That is not a bikeshed. It is the difference between idempotency and accidentally dropping new user intent.

Completion is not delivery

The strongest part of the PR is the mental model: a media task should not become terminal merely because the generator finished. In a real agent system, there are at least four receipts worth tracking. First, generation completed. Second, delivery was attempted. Third, the platform accepted the message or attachment. Fourth, the transcript or audit mirror committed what happened. If any of those receipts is missing, the system may be operationally “done” while being experientially broken.

The proof attached to the PR is unusually concrete. The author tested against a local checkout reporting OpenClaw 2026.5.19, forced the active requester wake to return queued:false with source_reply_delivery_mode_mismatch, forced the requester-agent handoff to return no message-tool or media-delivery evidence, and then allowed the fallback to call the real channel sender. The direct fallback reported delivered: true, path: direct, phase: direct-primary, and phaseDelivered: true.

More importantly, the proof did not stop at “sendMessage returned ok.” Live Discord readback showed the bot message with the content “The generated image is ready,” one attachment, filename PR84371-MEDIA-FALLBACK-fe92ef2-1779237455145.png, content type image/png, and a timestamp of 2026-05-20T00:37:39.608Z. The transcript mirror also committed an assistant message under provider openclaw, model delivery-mirror, with an idempotency key ending in generated-media-direct. That is the kind of receipt chain agent platforms should normalize: channel proof plus internal audit proof.

The existing failure category is familiar if you have operated multi-agent systems for more than a demo. Work happens in one lane. Notification happens in another. The parent session is busy. The channel expects media as an attachment, but the handoff path emits text. A retry sees something “recent” and suppresses itself. A completion event gets accepted by some internal queue, but the human-visible artifact never lands. Every individual component can claim it behaved reasonably. The user still gets a ghost task.

Duplicate guards need a scalpel, not a broom

The compatibility warning is the part practitioners should not skip. Duplicate protection is necessary because media generation can cost money, burn quota, and spam channels. But the guard must distinguish the same request from a different request. “Do not deliver the same completed image twice” is good idempotency. “Do not start a second image because an unrelated image task is active” is accidental single-flight behavior. The first protects users. The second discards intent while looking efficient in logs.

This is where many agent runtimes blur concepts. They store “active task” as a broad state and later use it as if it were a dedupe key. A proper dedupe key should include the requester route, prompt or normalized intent, media kind, task id, and maybe model/provider identity. Recent-completion suppression should be even narrower: suppress the same completion event or resend only if the platform can prove the user already received the artifact. Otherwise, the safe fallback is not silence; it is an explicit “I already have one media job running for X; do you want to queue Y?”

The PR also connects to a broader OpenClaw pattern. Issue #83577 showed subagent completions could be silently dropped when collect-mode batching got poisoned by unresolved delivery origins. PR #83734 dealt with Control UI tool-card rendering for externally started runs. Different surfaces, same contract: a child process, background task, or external runtime completing work is not enough. The completion has to reach the relevant human or orchestrator, in the right channel, with enough evidence to audit later.

For engineers building on OpenClaw, the immediate checklist is boring and high leverage. Test the exact channel you use, not a happy-path local mock. Force the primary delivery route to fail. Confirm exactly one fallback fires. Verify the attachment in the target platform, not just in the runtime. Check the transcript mirror. Inspect idempotency keys. Then repeat while the parent session is active, while the channel is rate-limited, and while a user sends another message mid-run.

The editorial take is simple: agent systems do not need more optimistic “completed” flags. They need durable receipts that prove the thing reached the human who asked for it. PR #84371 may still need compatibility work, but the direction is right. In agent infrastructure, delivery is not a courtesy notification after the real work. Delivery is part of the work.

Sources: OpenClaw PR #84371, OpenClaw issue #83577, OpenClaw PR #83734, OpenClaw v2026.5.19-alpha.1 release

Completion is not delivery

Duplicate guards need a scalpel, not a broom

Sign up for more like this.