# Into: ChronoLLM

A subnet that pays miners to build language models with amnesia: each one is trained only on data up to a single past year, so it cannot cheat by knowing what happened next.

// One model per year, no peeking ahead

---

> New to Bittensor? Start here. Experienced users can skip to the full analysis.

### What is ChronoLLM?

ChronoLLM (Subnet 38) is a Bittensor subnet where miners compete to train language models that are "chronologically consistent." Each model is allowed to learn only from information available up to a specific past year, so a 2018 model knows nothing about 2019 or later. The subnet is run by the team behind CrunchDAO, the data-science competition platform (contact listed on-chain as crew@crunchdao.com).

**The simple version:** It is like training a historian sealed off from the future. Ask the 2018 model about 2018 and it answers using only what was known by the end of that year, with no hindsight about what came after.

**Centralized equivalent:** No clean direct equivalent. The closest idea is the point-in-time, time-aware models quant researchers build to avoid lookahead bias. The competitive tournament structure echoes data-science contest platforms like Kaggle or Numerai, but applied to time-sealed models.

**How it works:**
- **Miners** train a collection of models, one per year, each only on data available up to that year's cutoff, then upload them to HuggingFace and submit on-chain.
- **Validators** run inside a Trusted Execution Environment (a secure hardware enclave) and score each model in two stages: first checking it has not leaked knowledge from after its cutoff year, then judging answer quality in a round-robin tournament refereed by an LLM judge.

---

### Why This Matters

- **The problem it solves:** Lookahead bias. Standard LLMs are trained on data from every era at once, so when they are used for financial backtesting or historical analysis they already "know" what happened next. A model that has read about the 2020 crash cannot honestly simulate a 2015 analyst. ChronoLLM aims to produce models that are sealed to a point in time.
- **The opportunity:** Time-sealed models are useful anywhere you need to reconstruct what was knowable at a past moment: backtesting trading strategies without hindsight contamination, studying how narratives evolved, or evaluating old forecasts fairly.
- **The Bittensor advantage:** Building one trustworthy model per year across more than a decade is a lot of training work. A competitive subnet crowdsources that effort and pays for the best result, while the secure enclave keeps the test set hidden so miners cannot quietly overfit to it.
- **Traction signals:** Early. The repository's initial commit is dated May 15, 2026, the README states the subnet is "currently in testing phase on mainnet," and TaoSwap reported no active miners at the time of this snapshot. Net TAO staking flow over the past week was positive.

---

## Full Analysis

**Category:** Distributed Training | **Centralized Competitor:** No direct equivalent; ML contest platforms (Kaggle, Numerai) for structure

ChronoLLM comes out of CrunchDAO, whose background is running quantitative data-science competitions. That lineage shows in the problem it picks: lookahead bias is a well-known headache in quantitative finance, where a backtest is only honest if the model knew nothing about the future at the simulated moment. The subnet turns "build a model that genuinely cannot see ahead" into a paid, verifiable competition.

**Mechanism:**

Per the project's repository, miners train one language model per year (the README lists a range starting at 2013, and a recent commit caps evaluation at 2024). Each model must use the project's ChronoGPT architecture, be under 2 billion parameters, and ship in safetensors format. Miners upload finished models to HuggingFace and register them on-chain.

Validators do the grading inside a Trusted Execution Environment running on Phala Cloud with Intel TDX hardware. The README gives two reasons for this: the evaluation dataset must stay private so miners cannot overfit to the test, and remote attestation lets anyone verify the validator is running the correct, unmodified code. Scoring runs in two stages. Stage one checks every model for chronological consistency, that it has not absorbed knowledge from after its cutoff year. The top ten miners advance to stage two, where their models answer open-ended questions and an LLM judge compares every pair in a round-robin tournament; the win rate becomes a quality score. The final score combines the two as 0.7 times the consistency score plus 0.3 times the quality win rate. The design is winner-takes-all: only the single highest final score earns emissions in a round, and rounds run weekly, starting each Monday at 12:00 UTC. ChronoLLM cites the ChronoGPT paper (arXiv:2510.11677) and publishes reference models under the manelalab HuggingFace account.

On the market side, the snapshot reads as a young subnet finding its feet. The alpha token traded around 0.02040 TAO with a market cap near 25,426 TAO and a root pool of roughly 4,834 TAO. The 30-day price change was about +108% into this snapshot, with the past week up roughly +46%. Emission share sat near 7.18% with a smoothed (EMA) figure around 4.53%. Consistent with the testing phase the repository describes, the miner portion of emissions was being burned at the snapshot and no active miners were reported yet.

---

### Risk Factors

- **Very early and unproven:** The repository's initial commit is from May 2026, the README itself flags a testing phase, and development sits with two contributors. The mechanism is documented but has not been demonstrated at scale.
- **Winner-takes-all rewards:** By design only the single top miner earns each weekly round. That is a sharp incentive to be best, but it concentrates rewards and may discourage smaller or newer miners from competing.
- **Concentration:** The top-100 gini sits around 0.72, indicating concentrated ownership and stake distribution. Large positions could swing pool dynamics.
- **Enclave dependency:** The anti-gaming guarantee rests on the Phala and Intel TDX execution environment and its remote attestation. The integrity of scoring is only as strong as that enclave and the off-chain backend it talks to.

---

Into the next one.