Chronos
Tokenises time-series values into a fixed vocabulary and trains transformer language models on them with cross-entropy loss. Two generations on the leaderboard: Chronos-Bolt (patch-based encoder-decoder, ~250× faster than the original Chronos) and Chronos-2 (encoder-only, supports univariate, multivariate, and covariate-informed forecasting in one model).
Chronos reframes probabilistic time-series forecasting as language modelling: continuous values are scaled and quantised into a fixed vocabulary of tokens, after which an off-the-shelf transformer (originally based on T5) is trained with cross-entropy loss over next-token prediction. The original Chronos family was pretrained on a large open corpus complemented by Gaussian-process synthetic data and ranged from 20M to 710M parameters.
Chronos-Bolt is the second-generation patch-based variant: instead of one token per timestep, contiguous patches of observations are encoded together, yielding ~250× faster inference at comparable accuracy. The four Bolt sizes (Tiny → Base) trade speed for capacity along a fairly clean Pareto curve.
Chronos-2 (Oct 2025) drops the encoder-decoder structure for an encoder-only design and unifies univariate, multivariate, and covariate-informed forecasting in a single 120M-parameter checkpoint.
Versions on TS-Arena
Each version below corresponds to one registered model id in the leaderboard. Click through to its detail page for per-model rankings, forecasts, and history.
- Chronos-Bolt Tinychronos-bolt-tiny9M params…
Smallest Bolt variant. Encoder-decoder, patch-based.
- Chronos-Bolt Minichronos-bolt-mini21M params…
- Chronos-Bolt Smallchronos-bolt-small48M params…
- Chronos-Bolt Basechronos-bolt-base205M params…
Largest Bolt variant; strong zero-shot at modest cost.
- Chronos-2chronos-2120M params…
Next-generation encoder-only model. Adds in-context covariates and multivariate forecasting.