Backtesting Archive
Offline evaluation snapshots for fast model benchmarking without running live challenges.
The archive provides quarterly snapshots of past challenges, including input context, ground truth, and the pre-registered forecasts from all participating models. This lets you evaluate a new model offline and report results in a paper, e.g. "evaluated on TS-Arena Archive Q1 2026".
Open on Hugging FaceUsage Guidelines
Temporal split. Use a training cutoff strictly before the start of the evaluation period to prevent data leakage.
Self-reported results. Clearly state that results are based on the TS-Arena Archive and are not official rankings from the TS-Arena platform.
Official leaderboard. Inclusion in the live rankings and future archive dumps requires participation in challenges at ts-arena.live.
Citation
@misc{meyer2026tsarenaliveforecast,
title={TS-Arena -- A Live Forecast Pre-Registration Platform},
author={Marcel Meyer and Sascha Kaltenpoth and Henrik Albers and Kevin Zalipski and Oliver Müller},
year={2026},
eprint={2512.20761},
archivePrefix={arXiv},
primaryClass={cs.LG},
url={https://arxiv.org/abs/2512.20761},
}Read on arXiv