ResearchSubnet 0675 min read

Into:Harnyx

Better harnesses compound faster than better models.

By vaNlabs ResearchMay 12, 2026View as Markdown

Priceτ0.01040

Market cap4.9k τ

Momentum41/ 100

Unique holders910

Emission+1.32%

Net flow 7d-244.6 τ

As of · Jun 4, 10:37 UTC

A that turns deep research into a competitive harness, with submitting Python agents that get sandboxed, scored against reference answers, and replaced only when a challenger is materially better.

What is Harnyx

Harnyx (SN67) is a Bittensor subnet for deep research. Miners write Python agents that answer research-style questions under a tight tool budget, run those agents in sandboxes against benchmark tasks, and the network keeps whichever agent is currently best as the "champion."

The simple version: It's like a continuous coding competition for research bots, where the winner stays on the throne until a challenger clearly does better.

Centralized equivalent: Think OpenAI Deep Research or Perplexity Pro, except the actual research workflow is built and improved by anyone willing to compete, and every run can be inspected.

How it works:

Miners submit Python agent scripts that take a query and return a research answer, while staying inside a tool budget
Validators execute those scripts in sandboxed containers against a stream of tasks and score the outputs against a stronger reference answer

Why This Matters

The problem it solves: Deep research is not one reasoning step. It is decomposition, retrieval, ranking, cross-checking, and synthesis. Closed product teams iterate on those harnesses behind a wall, and the wider field cannot tell what is actually working.
The opportunity: If the harness around a model matters as much as the model itself, an open arena where harnesses are continuously stress-tested is a real lever on research quality, not a cosmetic one.
The Bittensor advantage: Sandboxed execution, a public benchmark, and on-chain make the competition both legible and economically motivated. Champion replacement rules push for genuine improvement, not noise.
Traction signals: The repo went live in March 2026 and shows daily commits, with the most recent push on May 12, 2026. Live benchmark history is published at dashboard.harnyx.ai/benchmark, and miner and validator guides are in the repo.

Keep exploring

Other research from the same neighborhood of the network.

ResearchSubnet 0675 min read

Into:Harnyx

Better harnesses compound faster than better models.

By vaNlabs ResearchMay 12, 2026View as Markdown

Priceτ0.01040

Market cap4.9k τ

Momentum41/ 100

Unique holders910

Emission+1.32%

Net flow 7d-244.6 τ

As of · Jun 4, 10:37 UTC

A that turns deep research into a competitive harness, with submitting Python agents that get sandboxed, scored against reference answers, and replaced only when a challenger is materially better.

What is Harnyx

The simple version: It's like a continuous coding competition for research bots, where the winner stays on the throne until a challenger clearly does better.

Centralized equivalent: Think OpenAI Deep Research or Perplexity Pro, except the actual research workflow is built and improved by anyone willing to compete, and every run can be inspected.

How it works:

Miners submit Python agent scripts that take a query and return a research answer, while staying inside a tool budget
Validators execute those scripts in sandboxed containers against a stream of tasks and score the outputs against a stronger reference answer

Why This Matters

The problem it solves: Deep research is not one reasoning step. It is decomposition, retrieval, ranking, cross-checking, and synthesis. Closed product teams iterate on those harnesses behind a wall, and the wider field cannot tell what is actually working.
The opportunity: If the harness around a model matters as much as the model itself, an open arena where harnesses are continuously stress-tested is a real lever on research quality, not a cosmetic one.
The Bittensor advantage: Sandboxed execution, a public benchmark, and on-chain make the competition both legible and economically motivated. Champion replacement rules push for genuine improvement, not noise.
Traction signals: The repo went live in March 2026 and shows daily commits, with the most recent push on May 12, 2026. Live benchmark history is published at dashboard.harnyx.ai/benchmark, and miner and validator guides are in the repo.

Keep exploring

Other research from the same neighborhood of the network.

Full Analysis

Category: Search and Information Retrieval | Centralized Competitor: OpenAI Deep Research, Perplexity Pro, Exa

Deep research as a product category took off in 2025, and most of the interesting work happens inside closed labs. Harnyx is one of the first attempts on Bittensor to make the research harness itself the asset, with the model layer treated as one ingredient among many. The thesis is that compounding on harness quality, decomposition, retrieval shape, ranking, cross-checking, beats compounding on raw model size for this task.

Mechanism:

A task is one research-style query plus a stronger reference answer that the platform generates with a more expensive model than miners are allowed to spend on. The platform mixes factual recall, explanation, comparison, and synthesis so memorized outputs do not win.

Miners implement a `query` entrypoint. Validators receive miner-task batches from the platform, run each script-on-task combination inside a sandbox, and score the response. According to the repo, scoring is `total_score = comparison_score`, where `comparison_score` is a pairwise judge against the reference answer, run twice with the order swapped to dampen position bias. Ties go to the run with lower total tool cost. Validators report scored runs back to the platform and submit weights on-chain.

Champion selection is the part worth understanding. The platform keeps an incumbent champion and walks through challengers in batch order. A challenger only takes the throne if it beats the incumbent by a sufficient score margin, or is effectively non-regressing while being materially better on runtime or cost. Inside the tolerance band, small score differences do not trigger a swap. The highest score in the batch is not automatically the new champion.

That design choice flows directly into emissions. Total miner weight is capped at `0.20 * latest champion batch score`, and owner uid 0 receives the remainder, which burns. Today that produces a roughly 80% miner emission burn, by construction, not by neglect. Until a champion materially improves the benchmark, most of the miner share is destroyed rather than paid out. If no champion has been selected for a round, miner emission burns entirely.

The economics, then, are deliberately tight. With about 1,362 in the pool, a around 5,628 TAO, and roughly 1.53% of network emissions on a smoothed basis, the subnet is small but has been moving: roughly +22% on price over seven days and +34% over thirty, with about 272 TAO of net inflow over the last week. Active miner count is currently 1, which is consistent with a champion-driven design where one strong agent dominates until it gets dethroned.

//What is Harnyx

//Why This Matters

//What is Harnyx

//Why This Matters

//Full Analysis

//Risk Factors

What is Harnyx

Why This Matters

What is Harnyx

Why This Matters

Full Analysis

Risk Factors