ResearchInfra6 min read

Into:Ninja

King-of-the-hill for coding agents

By vaNlabs ResearchJune 28, 2026View as Markdown

Priceτ0.00863

Market cap38.1k τ

Standingi78/ 100

Buildi95/ 100

Unique holders2.06k

Emission+0.76%

Net flow453.9 τ

As of · Jun 29, 21:12 UTC

Subnet 66 turns software engineering into a tournament: coding agents fight to fix real GitHub bugs, and only the reigning champion gets paid.

What is Ninja

Ninja (Subnet 66) runs a continuous contest between AI coding agents. Miners build small programs that read a real software bug and write a patch to fix it, and the subnet pits those programs against each other to find the one that fixes problems best. The single strongest agent at any moment is the "king" and keeps the title until a challenger beats it.

The simple version: It's like a chess ladder, except the players are AI programmers and every match is a real GitHub bug that needs fixing.

Centralized equivalent: Think the SWE-bench leaderboard, or a coding assistant like Cursor or Devin, but run as an open, always-on competition where the winning agent is decided on-chain instead of by a vendor's private scoreboard.

How it works:

Miners build a solver agent: a small Python program that takes a code repository plus an issue description and returns a patch.
Validators generate tasks from real commits, run each agent inside a sandboxed Docker container, and score its patch against the known-good fix using a changed-line similarity check and an LLM judge.

Why This Matters

The problem it solves:

Full Analysis

Category: Code Generation and Development Tools | Centralized Competitor: SWE-bench leaderboards, Cursor, Devin

AI coding agents have moved from autocomplete to systems that open pull requests, and benchmarks like SWE-bench have become the scoreboard everyone cites. Ninja's bet is that the scoreboard itself should be live, adversarial, and decentralized rather than a static dataset run once and published.

Mechanism:

Miners write a solver: a Python agent.py that exposes a solve(repo_path, issue, model, api_base, api_key) function and returns a patch. In production, miners do not open public pull requests. They submit the agent privately to the validator's API, signed by their . Before a submission is queued, the validator runs three checks: Agent Smoke (it must compile and pass a pyflakes lint), a Submission Scope Guard (it cannot break the solve contract or hardcode its own model, provider, or sampling settings), and an OpenRouter Submission Judge that reviews the diff. The validator supplies the model, endpoint, and key, so agents compete on prompting and tool-use strategy rather than on who can afford the bigger model.

Scoring is king-of-the-hill. The validator keeps two fixed pools of 50 tasks each, mined from real commits. A challenger first duels the current king on the primary pool, and if it wins it has to win again on a separate retest pool before it can take the crown. That two-pool rule guards against an agent that got lucky on a single task set. When a challenger is promoted, the validator publishes the winning agent.py to the public base repo and assigns all of its weight to the winning hotkey on the next weight-setting . Reward concentrates on whoever currently holds the title.

The infrastructure is specific. According to the repository, solver, task-generation, and evaluation traffic runs on a self-hosted Qwen3-32B endpoint, while the diff judge and the submission judge run z-ai/glm-5.2 through OpenRouter at temperature 0. Crowned kings are also benchmarked against external suites, SWE-bench Verified, SWE-rebench, DeepSWE, and Terminal-Bench, measured against a fixed baseline agent. That external benchmarking is where the "distilling software agents" framing comes from: the tournament is meant to surface and harden a single best agent rather than reward a crowd.

One note on identity. The article we published for netuid 66 in April covered a project called Tau. The slot was re-registered on 2026-04-27 and now carries the on-chain identity "ninja" at ninja66.ai, built on the same unarbos/tau codebase. That development repo, created on 2026-03-31, has 324 commits on its main branch from 8 contributors, with the most recent on 2026-06-24 (verified by a live GitHub check). On-chain at , the subnet holds roughly 0.75% of network , with an alpha price of 0.00903 TAO, a near 46,581 TAO, and a pool depth around 11,377 TAO. Price was up about 13% over the prior week and down about 12% over the prior month, and the subnet showed 5 active miners.

//What is Ninja

//Why This Matters

//What is Ninja

//Why This Matters

//Full Analysis

//Risk Factors

What is Ninja

Why This Matters

What is Ninja

Why This Matters

Full Analysis

Risk Factors