Multiplayer: local debugging agent for Claude Code
Multiplayer captures complete, unsampled traces alongside Claude Code so code agents fix real bugs, not partial symptoms.
There is a silent problem that anyone who has run Claude Code against a production codebase knows well: the agent generates a PR that looks correct, passes the tests, gets merged, and fails in production. Not because the agent is incompetent, but because it was working with incomplete data. Traditional observability tools sample, aggregate, and truncate context at service boundaries. What reaches the agent is a blurry photograph of what happened, not what actually happened.
That is precisely the starting point described by the team behind Multiplayer, unveiled this week on Hacker News under the thread Show HN: Multiplayer, a debugging agent to run locally next to your coding agent. Their argument is straightforward: if the code agent inherits the limitations of the existing observability stack, its fixes will inherit those same limitations.
What Multiplayer does
Multiplayer is a debugging agent that runs locally alongside Claude Code. Rather than connecting to an external observability platform, it captures complete, unsampled traces across the entire stack on its own: user actions in the frontend, backend traces, logs, headers, and full request and response content. In other words, exactly what most observability tools discard or never collect in the first place.
The design makes an interesting storage decision: Multiplayer only persists data when something fails. It does not record the continuous flow of a normal session, which avoids the cost of storing everything the system produces at every moment. Only when a problem is detected is the complete session surrounding it saved.
Before that context reaches the code agent, Multiplayer deduplicates locally. If the same bug appears in a hundred different sessions, the agent receives a single issue, a single prompt, and proposes a single PR. The stated goal is for Claude Code to work from a correlated and complete picture of what broke, not a partial reconstructed signal.
Why complete context matters
The phrase the team uses to describe the result of working with incomplete context is "PR slop": pull requests that seem plausible and fail in production. It is a known problem among teams that have enthusiastically adopted code agents. It is not a model failure, it is a failure of the information it receives.
When an agent only sees aggregated metrics or traces sampled at 1%, it might infer there is latency in a service, but it cannot see what specific payload arrived at the moment of failure, what header was missing, or what sequence of frontend events preceded the error. With that incomplete information, the agent writes code that solves the visible symptom, not the underlying cause.
Multiplayer points directly at that gap: if the code agent sees what a senior engineer would see with access to unsampled traces, the fixes should be more precise and less prone to introducing regressions.
Who it makes sense for
The clearest fit is backend or full-stack teams already using Claude Code in bug-fixing workflows who have run into the limitation described: the agent proposes reasonable fixes that do not reach the root cause. Also teams with microservices where context is lost at service boundaries, precisely because standard distributed traces do not always capture the complete payload.
In its current state, integration is available with Claude Code. The team mentions upcoming support for Codex, Copilot, and Cursor, though without a concrete timeline.
Local execution is relevant for environments with privacy requirements or where sending production data to external services is not viable. The fact that deduplication and capture happen locally before anything reaches the agent is a detail some teams will value more than others.
Our take
Multiplayer addresses a real problem and does so with a coherent technical approach: better context produces better fixes. It remains to be seen whether local, unsampled capture scales well in high-load systems and how deduplication behaves with bugs that manifest slightly differently in each session, but the direction is sound.
Sources
Read next
COOCON joins AAIF to connect payments and MCP in AI agents
South Korean fintech COOCON is joining the global AAIF foundation to integrate payments and data business based on MCP within the AI agents ecosystem.
Webull lanza un servidor MCP para trading con IA
El bróker Webull integra el Model Context Protocol de Anthropic para que agentes de IA accedan a datos de mercado en tiempo real desde sus flujos de trabajo.
Vera: AI-Powered Smart Contract Audits Without Third Parties
Vera is an open-source tool that audits smart contracts using AI autonomously, eliminating the need for external audit firms or manual review processes.