Reference

Models Reference

How the intelligence is built. Twelve production ML models, plus an algorithmic layer (CTMC win/podium probability with a full position distribution, ECP/ECPA expected championship points, and EKF tyre health), produce the analytics.* and weather.* feeds. This page documents how each model works — its type, inputs, and calibration. For the payload each one emits, see the Analytics feeds →

ℹ

Models are delivered as feed outputs, not as a separate product. You subscribe to a feed (e.g. analytics.strategy); the models behind it run server-side in real time. The model count is the documented source of truth; the live GCS manifest is the runtime truth.

Production ML models

Model	Type	Predicts	Feed
pit stop probability	BiLSTM	P(pit stop this lap)	analytics.strategy
safety car	LightGBM + Hawkes	P(SC/VSC next lap)	analytics.strategy
overtake	XGBoost	P(overtake within 3 laps)	analytics.strategy
LTOE lap-time outlier estimation	LightGBM	Lap Time Over Expected (s)	analytics.strategy
undercut threat	LightGBM	Undercut threat score	analytics.strategy
overcut threat	LightGBM	Overcut threat score	analytics.strategy
qualifying pace	LightGBM + conformal	Per-sector pace delta	analytics.qualifying
fastest lap	XGBoost	P(session fastest lap)	analytics.race-odds
DNF risk	Bathtub hazard	DNF risk score	analytics.strategy
tyre compound	LightGBM (multiclass)	Next-compound probabilities	analytics.tire-strategy
pit-stop count	LightGBM	Expected remaining stops	analytics.strategy
pit-window survival	Survival model	Pit-window survival	analytics.pit-window

Deep dives for the most-integrated models follow. The rest share the same training and calibration pipeline.

Pit stop — BiLSTM

A bidirectional LSTM reads a 20-lap rolling feature window per driver — tyre age, fuel-corrected pace delta, stint length, gap to the cars ahead and behind, and track status — and emits pitStopProbability each lap. pitRecommended trips when the probability clears an era-adjusted threshold (0.53 pre-2026, 0.60 for 2026). It drives analytics.strategy and analytics.tire-strategy.

Safety car — LightGBM + Hawkes

A gradient-boosted classifier scores incident risk from track state, lap, field spread, and recent incidents; a Hawkes self-exciting process layers on the post-incident clustering that a static classifier misses. A 3-class variant separates full safety car from VSC (fullScProbability / vscProbability). Session-level — the same value applies to all drivers.

Overtake — XGBoost

Predicts overtakeProbability — P(position gain via on-track overtake within three laps) — from the gap to the car ahead, pace delta, tyre offset, and a per-circuit overtake index. For 2026 the indices were recalibrated for zone-independent Overtake Mode rather than fixed DRS zones.

Pace — LTOE

Lap Time Over Expected. A LightGBM model predicts the expected lap time from fuel-corrected pace and stint context; the residual (actual − expected) is the LTOE signal — negative means faster than expected. It feeds paceMode classification (PUSH / HOLD / MANAGE / WARM_UP / DELTA). During the 2026 calibration window it returns with reduced confidence (ltoeConfidenceFlag: "2026_early").

Algorithmic intelligence layer

Three non-ML components ship alongside the models and produce the bulk of the race-odds and tyre signals.

Win probability — CTMC

A continuous-time Markov chain over race positions produces a full 20-element positionDistribution per driver, from which podium/points odds, expected points, expected laps led, and forward position-change projections are derived analytically. Powers analytics.race-odds and analytics.constructor (via joint-distribution convolution).

Tyre health — EKF

An extended Kalman filter tracks each driver's lap-delta series to estimate tireHealth (0–1), degradation rate (s/lap), and stint lap — robust to safety-car and traffic noise that fools a naive regression.

Expected championship points — ECP/ECPA

Season-level points expectation (and an adjusted variant) projecting championship outcomes from the current standings and per-race position distributions. Surfaces on analytics.championship-probability.

2026 regulation adjustments

The 2026 F1 regulations — tripled ERS power, turbo lag, active aero replacing DRS — required every model to be re-assessed. What changed:

Model	Feed	Status	Action taken
Tyre degradation	analytics.strategy	Recalibrated	Circuit parameters updated for new thermal loading from turbo lag. Wider confidence intervals until 4+ 2026 races.
Win probability	analytics.race-odds	Updated	Overtake Mode: same 1.0s gap threshold. CTMC transition rate β₃ adjusted 0.90 → 1.10 for electric boost.
Pit stop	analytics.strategy	Adjusted	ERS clipping creates saw-tooth lap-time variance that mimics cliff signatures. Decision threshold raised 0.53 → 0.60.
Safety car	analytics.strategy	Offset applied	Base SC rate +0.10 for the first 6 races of 2026 (driver adaptation period).
Overtake	analytics.strategy	Recalibrated	Circuit overtake indices updated for zone-independent Overtake Mode vs. DRS-zone activation.
LTOE	analytics.strategy	Suppressed	Returns with confidenceScale 0.75 and ltoeConfidenceFlag "2026_early" until 3+ races of 2026 data.

← Analytics feeds API Reference →