Chronos: Customizing Language Model Architectures to Time Series for Time Series

Forecasts for time series are important for decision -making across industries such as retail, energy, finances and health care. However, the development of accurate machine learning-based prognosis models has traditionally demanded significant data set-style and model adjustment.

Related content

Time series forecasts enable up-to-the-minute trend recognition, while the new two-step training process improves prognosis accuracy.

In a paper we have just sent to Arxiv, we present Chronos, a family of prior time series models based on language model architectures. Like large language models or vision-language models, chronos is a Foundation ModelThere learns from large data sets how to produce general representations that are useful for a wide range of tasks.

The most important insight behind Chronos is time series data as a language that should be models by transforming architectures out of shelf. To tokenize observations of time series in real value in a fixed vocabulary, we scale the time series with its absolute mean and then quantize the scaled time series to a fixed number of uniformly distributed trash cans.

In addition to these bin-tokens, we add two special tokens, pad and EOS, to denote padding/lack of values and end sequence, respective. We can shared standard language models such as T5 in such a “language for time series” using the constructive cross-bunny-loss function without changing the model architecture itself.

High -level depiction of Chronos. Left: Input -time series is scaled and quantified to achieve a number of symbols. Center: The token are introduced into a language model that is trained using the transverse entusian loss. Right: During the inference, the tokens authlets are sampled from the model and MAPD back to the number of values.

Despite its simplicity, Chronos is removable accurate. In an understanding evaluation involving 42 data sets, Chronos neglicant surpassed classic statistical methods as well as specialized dearing models on data that was held based on its training set. More important, we are pretty new data sets, Chronos’s zero-shot performance was comparable and occasionally superior than models trained directly on these data sets.

A nuclear strength of chronos is its ability to utilize different time series data from different domains to improve generalization. To improve the model’s robustness, we increase the public data used in advance with randomly mixed real samples (tsmix) and with a synthetically generated data set based on Gaussian processes (nuclear synt).

Leave a Comment Cancel reply