Kronos is the first open-source foundation model specifically designed for financial candlesticks (K-lines). It uses a specialized tokenizer to quantize continuous OHLCV (Open, High, Low, Close, Volume) data into hierarchical discrete tokens, then trains autoregressive Transformers on these tokens. It's trained on data from over 45 global exchanges and was accepted at AAAI 2026.

What models are available in the Kronos family?

Kronos offers four model sizes: Kronos-mini (4.1M params, 2048 context), Kronos-small (24.7M params, 512 context), Kronos-base (102.3M params, 512 context), and Kronos-large (499.2M params, 512 context). The first three are open-source on Hugging Face; Kronos-large is not yet released.

How does Kronos handle financial data differently?

Unlike general-purpose time series models, Kronos is designed for the unique, high-noise characteristics of financial data. It uses a specialized tokenizer (Kronos-Tokenizer) that quantizes multi-dimensional K-line data into hierarchical discrete tokens before training, treating financial market data as a specialized language.

Can I finetune Kronos on my own data?

Yes. Kronos provides a complete finetuning pipeline with examples for the Chinese A-share market using Qlib. The process involves: (1) data preparation with Qlib, (2) finetuning the tokenizer, (3) finetuning the predictor, and (4) backtesting. Multi-GPU training is supported via torchrun.

What's the maximum context length for Kronos?

Kronos-mini supports 2048 tokens context length, while Kronos-small, Kronos-base, and Kronos-large use 512 tokens. The KronosPredictor automatically handles truncation for longer contexts. For optimal performance, input data length (lookback) should not exceed these limits.

Does Kronos support batch prediction?

Yes. The predict_batch method enables parallel prediction on multiple datasets simultaneously, leveraging GPU parallelism. All series must have the same historical length and prediction length, and each DataFrame must contain ['open', 'high', 'low', 'close'] columns at minimum.

Is there a live demo?

Yes. Kronos has a live demo on Hugging Face that visualizes forecasting results for BTC/USDT over the next 24 hours. Access it at the Hugging Face Spaces link from the official repository.

Kronos: Open-source foundation model for financial | explainx.ai Blog

explainx.ainewsletter3.5k

workshops ↗

Kronos: Open-source foundation model for financial | explainx.ai Blog | explainx.ai

Model	Tokenizer	Context length	Params	Open-source	Hugging Face
Kronos-mini	Kronos-Tokenizer-2k	2048	4.1M	✅	NeoQuasar/Kronos-mini
Kronos-small	Kronos-Tokenizer-base	512	24.7M	✅	NeoQuasar/Kronos-small
Kronos-base	Kronos-Tokenizer-base	512	102.3M	✅	NeoQuasar/Kronos-base
Kronos-large	Kronos-Tokenizer-base	512	499.2M	❌	Not yet released

python

from model import Kronos, KronosTokenizer, KronosPredictor

# Load from Hugging Face Hub
tokenizer = KronosTokenizer.from_pretrained("NeoQuasar/Kronos-Tokenizer-base")
model = Kronos.from_pretrained("NeoQuasar/Kronos-small")

python

import pandas as pd

# Load your data
df = pd.read_csv("./data/XSHG_5min_600977.csv")
df['timestamps'] = pd.to_datetime(df['timestamps'])

# Define context window and prediction length
lookback = 400
pred_len = 120

# Prepare inputs
x_df = df.loc[:lookback-1, ['open', 'high', 'low', 'close', 'volume', 'amount']]
x_timestamp = df.loc[:lookback-1, 'timestamps']
y_timestamp = df.loc[lookback:lookback+pred_len-1, 'timestamps']

python

# Generate predictions
pred_df = predictor.predict(
    df=x_df,
    x_timestamp=x_timestamp,
    y_timestamp=y_timestamp,
    pred_len=pred_len,
    T=1.0,          # Temperature for sampling
    top_p=0.9,      # Nucleus sampling probability
    sample_count=1  # Number of forecast paths to generate and average
)

print("Forecasted Data Head:")
print(pred_df.head())

python

# Prepare multiple datasets
df_list = [df1, df2, df3]
x_timestamp_list = [x_ts1, x_ts2, x_ts3]
y_timestamp_list = [y_ts1, y_ts2, y_ts3]

# Generate batch predictions
pred_df_list = predictor.predict_batch(
    df_list=df_list,
    x_timestamp_list=x_timestamp_list,
    y_timestamp_list=y_timestamp_list,
    pred_len=pred_len,
    T=1.0,
    top_p=0.9,
    sample_count=1,
    verbose=True
)

# Results in same order as input
for i, pred_df in enumerate(pred_df_list):
    print(f"Predictions for series {i}:")
    print(pred_df.head())

Metric	Value
Stars	23.3k
Forks	4.1k
Watchers	215
Contributors	18
Language	Python (81.9%), HTML (17.7%), Shell (0.4%)
License	MIT

bibtex

@misc{shi2025kronos,
  title={Kronos: A Foundation Model for the Language of Financial Markets},
  author={Yu Shi and Zongliang Fu and Shuo Chen and Bohan Zhao and Wei Xu and Changshui Zhang and Jian Li},
  year={2025},
  eprint={2508.02739},
  archivePrefix={arXiv},
  primaryClass={q-fin.ST},
  url={https://arxiv.org/abs/2508.02739}
}

What it is	First open-source foundation model for financial K-lines (candlesticks)
Training data	45+ global exchanges (crypto, stocks, commodities)
Architecture	Decoder-only Transformer with specialized tokenizer for OHLCV data
Model family	Mini (4.1M), Small (24.7M), Base (102.3M), Large (499.2M params)
Context length	2048 (mini), 512 (small/base/large)
Conference	Accepted at AAAI 2026
Finetuning	Complete pipeline with Qlib integration for quantitative trading
Batch prediction	Parallel forecasting on multiple time series via `predict_batch`
GitHub stats	23.3k stars, 4.1k forks, 18 contributors, MIT license

Kronos: Open-source foundation model for financial candlesticks accepted at AAAI 2026

TL;DR

Related posts

Genesis AI Eno: first agentic general-purpose robot powered by GENE foundation model

AI Advice Kills "I Don't Know": Cognitive Surrender in a PsyArXiv Study

AI Cyber Guardrails Block US Defenders — Kimi K3 and GLM 5.2 Fix What Codex and Fable Refused

Why Kronos matters: financial data as a language

Model zoo: four sizes from mini to large

Getting started: installation and first forecast

Installation

Making your first forecast

Batch prediction: parallel forecasting for multiple assets

Live demo: BTC/USDT 24-hour forecast

Finetuning workflow (4 steps)

From demo to production: important considerations

1. Raw signals vs. pure alpha

2. Data handling

3. Strategy complexity

4. Backtest fidelity

Architecture: how Kronos works

1. Specialized tokenizer (Kronos-Tokenizer)

2. Autoregressive Transformer

3. De-quantization

Performance and benchmarks

GitHub stats and community

Bottom line

Citation

TL;DR

Related posts

Genesis AI Eno: first agentic general-purpose robot powered by GENE foundation model

AI Advice Kills "I Don't Know": Cognitive Surrender in a PsyArXiv Study

AI Cyber Guardrails Block US Defenders — Kimi K3 and GLM 5.2 Fix What Codex and Fable Refused

Why Kronos matters: financial data as a language

Model zoo: four sizes from mini to large

Getting started: installation and first forecast

Installation

Making your first forecast

Batch prediction: parallel forecasting for multiple assets

Live demo: BTC/USDT 24-hour forecast

Finetuning on your own data: A-share market example

Finetuning workflow (4 steps)

From demo to production: important considerations

1. Raw signals vs. pure alpha

2. Data handling

3. Strategy complexity

4. Backtest fidelity

Architecture: how Kronos works

1. Specialized tokenizer (Kronos-Tokenizer)

2. Autoregressive Transformer

3. De-quantization

Performance and benchmarks

GitHub stats and community

Related on explainx.ai

Bottom line

Citation