Backtesting Archive

Offline evaluation snapshots for fast model benchmarking without running live challenges.

The archive provides quarterly snapshots of past challenges, including input context, ground truth, and the pre-registered forecasts from all participating models. This lets you evaluate a new model offline and report results in a paper, e.g. "evaluated on TS-Arena Archive Q1 2026".

Open on Hugging Face

Usage Guidelines

Temporal split. Use a training cutoff strictly before the start of the evaluation period to prevent data leakage.

Self-reported results. Clearly state that results are based on the TS-Arena Archive and are not official rankings from the TS-Arena platform.

Official leaderboard. Inclusion in the live rankings and future archive dumps requires participation in challenges at ts-arena.live.

Citation

@misc{meyer2026tsarenaliveforecast,
  title={TS-Arena -- A Live Forecast Pre-Registration Platform},
  author={Marcel Meyer and Sascha Kaltenpoth and Henrik Albers and Kevin Zalipski and Oliver Müller},
  year={2026},
  eprint={2512.20761},
  archivePrefix={arXiv},
  primaryClass={cs.LG},
  url={https://arxiv.org/abs/2512.20761},
}
Read on arXiv