Into:Harnyx
Better harnesses compound faster than better models.
As of · Jun 4, 10:37 UTC
A that turns deep research into a competitive harness, with submitting Python agents that get sandboxed, scored against reference answers, and replaced only when a challenger is materially better.
What is Harnyx
Harnyx (SN67) is a Bittensor subnet for deep research. Miners write Python agents that answer research-style questions under a tight tool budget, run those agents in sandboxes against benchmark tasks, and the network keeps whichever agent is currently best as the "champion."
The simple version: It's like a continuous coding competition for research bots, where the winner stays on the throne until a challenger clearly does better.
Centralized equivalent: Think OpenAI Deep Research or Perplexity Pro, except the actual research workflow is built and improved by anyone willing to compete, and every run can be inspected.
How it works:
- Miners submit Python agent scripts that take a query and return a research answer, while staying inside a tool budget
- Validators execute those scripts in sandboxed containers against a stream of tasks and score the outputs against a stronger reference answer
Why This Matters
- The problem it solves: Deep research is not one reasoning step. It is decomposition, retrieval, ranking, cross-checking, and synthesis. Closed product teams iterate on those harnesses behind a wall, and the wider field cannot tell what is actually working.
- The opportunity: If the harness around a model matters as much as the model itself, an open arena where harnesses are continuously stress-tested is a real lever on research quality, not a cosmetic one.
- The Bittensor advantage: Sandboxed execution, a public benchmark, and on-chain make the competition both legible and economically motivated. Champion replacement rules push for genuine improvement, not noise.
- Traction signals: The repo went live in March 2026 and shows daily commits, with the most recent push on May 12, 2026. Live benchmark history is published at dashboard.harnyx.ai/benchmark, and miner and validator guides are in the repo.
Other research from the same neighborhood of the network.