Into:Hone
Reasoning benchmarks, executed in a sandbox.
As of · Jun 4, 10:37 UTC
Hone is an ARC-AGI-2 reasoning benchmark on Bittensor. do not run solvers directly during evaluation, they point to a git repo containing their solution. clone the repo, run the solver in a secure GPU sandbox, and score miners by exact match rate on ARC problems. Only the top 5 miners above a 20% accuracy floor receive rewards.
What is Hone
Hone is a Bittensor subnet built around running the ARC-AGI-2 reasoning benchmark. The key idea is that miners provide solutions in git repositories rather than running solvers directly at query time. Validators clone a miners repo, build it inside a secure GPU sandbox, and execute evaluation tasks with network access during prep and no network access during inference.
The simple version: Like a Kaggle competition where you submit code that our judges execute in a clean box.
Centralized equivalent: Multi-party ML evaluation services like benchmarks with reproducible leaderboards.
How it works:
- Miners expose an HTTP endpoint that points to their solution repository.
- Validators query miner info, submit jobs to a Sandbox Runner service, which clones repos, builds container images, and runs ARC problems with controlled environment.
- Scoring uses exact match rate, with a 20% accuracy floor and exponential decay rewards for the top 5 above the floor.
Why This Matters
Other research from the same neighborhood of the network.