AI energy trading price forecasting has become a genuine competitive differentiator for electricity traders, energy retailers, and utilities operating across European power markets. The combination of rapidly growing renewable generation, increasing market volatility, the rise of short-duration battery storage, and the tightening of imbalance settlement rules has made manual or purely statistical price forecasting insufficient for many participants. Machine learning models now form part of the analytical toolkit at trading desks across the Netherlands and wider Europe — but the gap between a well-designed forecasting system and a poorly built one is wide, and the consequences of overconfidence are commercially significant. This guide provides a grounded, vendor-neutral overview of how AI price forecasting works, where it genuinely adds value, and where its limits lie. It is intended for energy traders, portfolio managers, balancing engineers, and strategy leads at utilities and trading houses who want an honest assessment before committing to a build.
Nothing in this article constitutes financial or trading advice. Electricity price forecasting involves irreducible uncertainty; all models inform decisions, they do not guarantee outcomes.
Why AI energy trading price forecasting is gaining traction now
European electricity markets have changed structurally over the past decade, and the pace of change is accelerating. The proliferation of zero-marginal-cost generation — wind and solar — has introduced a new dynamic to spot markets: prices are increasingly set by the residual demand left over after renewables have been dispatched, rather than by the fuel costs of marginal thermal plant. This residual demand is inherently more volatile and harder to predict using traditional regression approaches calibrated on historical thermal-dominated price behaviour.
On EPEX SPOT, the primary exchange for day-ahead and intraday electricity in the Netherlands and neighbouring countries, the frequency and magnitude of price spikes and negative price episodes has increased. Negative prices — periods in which generators pay to dispatch electricity rather than curtail — now occur regularly across Northern European markets, including the Dutch hub. For a trader with an unhedged generation portfolio or an open balancing position, the ability to anticipate these episodes even partially is commercially material.
At the same time, the growth of intraday trading — driven partly by wind and solar forecast revisions made after gate closure of the day-ahead auction — has created additional market microstructure complexity. Prices in the continuous intraday market on EPEX SPOT can move sharply in the hours and minutes before real-time delivery, reflecting updated generation forecasts, unexpected demand shifts, or cross-border congestion changes. Machine learning electricity spot price prediction models trained on intraday order book data and weather forecast revision signals can help traders size and time intraday positions more effectively than discretionary methods.
How accurate is AI at forecasting day-ahead electricity prices on EPEX SPOT?
This is the most important question to answer honestly, and the answer is nuanced. AI models — particularly ensemble methods and deep learning architectures — can meaningfully outperform naive benchmarks (such as using the previous day's price profile or a simple seasonal mean) and often outperform classical statistical models such as ARIMA or basic regression on standard accuracy metrics. However, several important caveats apply.
First, accuracy varies systematically with market conditions. In periods of moderate renewable output and stable demand, day-ahead price forecasting is a tractable problem and even modest models perform reasonably. In periods of high renewable penetration, rapid demand shifts, or unexpected cross-border constraints, price volatility spikes and all models — including state-of-the-art deep learning ones — see materially wider forecast errors. The episodes that are hardest to forecast are often the most commercially consequential: the extreme price events where the stakes of being wrong are highest.
Second, backtesting results are inherently optimistic. A model evaluated against historical data has, by definition, been trained and tested in conditions it has already seen. Real deployment exposes the model to regime changes — a new interconnector coming online, a policy change affecting renewable subsidy dispatch, or a geopolitical event affecting gas prices and therefore thermal generation marginal costs — that may not be represented in training data. Honest backtesting uses walk-forward validation (training on data up to time T, evaluating on data from T to T+horizon) and tests across multiple market regimes, not just benign periods.
Third, the relevant performance benchmark is not absolute accuracy but improvement over your current method. A ML day-ahead electricity price forecast Netherlands model that reduces your portfolio's forecast error by a meaningful margin relative to your existing approach delivers commercial value, even if it is still imperfect in absolute terms. The right framing is: does this model improve my trading decisions at the margin, net of the cost and complexity of building and maintaining it?
With those caveats stated: yes, well-built AI models, properly validated and honestly deployed, can provide a meaningful edge in day-ahead price forecasting for EPEX SPOT, particularly in capturing non-linear relationships between renewable generation, cross-border flows, and price outcomes that simpler models miss.
The main drivers of electricity spot prices and why they matter for model design
Understanding what drives electricity prices in the Dutch and wider European market is prerequisite to building a good forecasting model. The key drivers are:
- Renewable generation output: Solar and wind set the residual demand level that thermal generators must cover. High renewable output typically suppresses prices; low output during periods of high demand pushes prices up. The relationship is non-linear — as renewables approach the level of total demand, prices can collapse to zero or go negative rapidly.
- Fuel prices: Natural gas prices, via the TTF hub, remain the primary marginal cost driver for Dutch gas-fired generation. Coal and carbon (EU ETS) prices affect the competitive position of hard coal plant relative to gas. A forecasting model that does not incorporate these signals is missing one of the most important drivers of the price level.
- Cross-border flows and interconnector constraints: The Netherlands is well-connected to Belgium, Germany, the UK, Denmark, and Norway. When prices diverge across borders, flows arbitrage the difference until interconnectors are congested. Congestion management outcomes from ENTSO-E capacity allocation affect the available cross-border capacity and therefore the effective supply accessible to the Dutch hub.
- Demand levels and temperature effects: Residential and commercial heating demand has become more weather-sensitive as heat pump adoption grows, and EV charging introduces new demand patterns. A good price forecasting model incorporates demand-side signals, not just supply-side ones.
- Market microstructure and balancing: The balancing market clearing price — the imbalance settlement price — reflects TenneT's real-time procurement actions and feeds back into intraday market behaviour. Participants with open imbalance positions are incentivised to close them in the intraday market or to adjust output.
- Hydro reservoir levels: Norwegian and Swedish hydropower is a significant swing factor for Northern European prices. Periods of low reservoir levels reduce the cheap hydro available for export, tightening supply in the Netherlands and Germany and pushing prices up.
Model architectures for electricity price forecasting
Several machine learning architectures are used in practice for electricity price forecasting. The right choice depends on your forecasting horizon, the richness of your feature set, and the degree of interpretability your trading or risk management team requires.
Gradient-boosted ensemble models
Gradient-boosted tree models — XGBoost, LightGBM, CatBoost — remain among the strongest performers for structured time-series forecasting tasks where the feature set is well-defined. They handle mixed feature types well (continuous weather variables, categorical calendar features, lagged price values), are relatively resistant to overfitting with appropriate regularisation, and produce feature-importance scores that help trading teams understand what is driving the model's predictions. For AI algorithmic energy trading Europe, interpretability is not a luxury: risk managers and compliance teams need to understand model behaviour before approving its use in trading decisions.
Deep learning for intraday price modelling
Recurrent neural networks — particularly LSTM and GRU architectures — and transformer-based sequence models are better suited to capturing complex temporal dependencies in intraday price data, where the market is a continuous sequence of events rather than a once-daily auction. A deep learning intraday power price model can, in principle, learn the interaction between order book dynamics, weather forecast revisions, and real-time balancing signals that evolve continuously through the trading day. The trade-off is training complexity, greater data requirements, and reduced interpretability relative to ensemble methods.
Hybrid and ensemble approaches
In practice, the most robust commercial forecasting systems tend to combine multiple model types: a physics-informed or fundamental model that captures the structural drivers of price (fuel costs, renewable dispatch, cross-border flows) is combined with a machine learning layer that captures the residual patterns the fundamental model misses. The ensemble output is then calibrated against recent market data. This approach benefits from the interpretability and structural grounding of fundamental modelling while exploiting the pattern-recognition capability of machine learning.
Probabilistic forecasting
Producing a single point forecast — a price estimate for each delivery period — is less useful for trading and risk management than producing a probabilistic forecast: a distribution of possible price outcomes, or at minimum a prediction interval. Quantile regression, conformal prediction, and Bayesian neural network approaches all enable probabilistic outputs. For battery storage optimisation, for example, knowing the probability of a negative price episode in the next two hours is more actionable than knowing the expected price alone, because the value of a storage dispatch decision depends on the full distribution of outcomes, not just the central estimate.
AI imbalance settlement price forecasting
AI imbalance settlement price forecasting is a distinct and commercially important sub-problem. The imbalance settlement price (in the Netherlands, set by TenneT as the Balancing Responsible Party mechanism) reflects the cost of TenneT's real-time balancing actions and is applied to participants whose metered output or consumption deviates from their programme. A participant who can anticipate whether the imbalance price will be high or low — and in which direction — can make more informed decisions about whether to close an open position in the intraday market or accept the settlement price.
Forecasting the imbalance settlement price is harder than forecasting day-ahead prices because it depends on real-time system events that are inherently less predictable. However, machine learning models trained on historical balancing activation data, real-time generation forecast errors, and system balance signals can provide useful probabilistic guidance. The key inputs are the current system balance (long or short), the direction and magnitude of recent balancing activations, and the residual renewable forecast for the next few hours. These signals are partially observable in real time via ENTSO-E transparency platform data and TenneT's own publications.
It is important to be realistic about what such a model can achieve. The imbalance price is set by actions taken under conditions of genuine uncertainty by a system operator whose exact decision criteria are not fully public. A probabilistic forecast that guides the direction of a position adjustment is useful; a model presented as providing high-confidence point predictions of the settlement price is overstating its capability and should be treated with scepticism.
AI renewable curtailment prediction
AI renewable curtailment prediction has become increasingly relevant as grid congestion in the Netherlands — particularly on the distribution and sub-transmission networks in areas of high solar density such as Zeeland, Noord-Holland, and parts of the province of Utrecht — has led TenneT and regional DSOs to curtail generation more frequently. For a solar or wind generator with a merchant position, curtailment events directly affect revenue. For a trader with a position in a zone affected by congestion, curtailment changes the effective supply and therefore the price.
Machine learning models can learn to identify the conditions under which curtailment becomes likely: combinations of high renewable forecast output, low demand, limited interconnector headroom, and local grid constraints that exceed thermal limits. These are non-linear interactions that rule-based systems handle poorly. A well-trained curtailment prediction model provides a probabilistic signal that can inform hedging decisions, generation scheduling, and battery charging strategies in areas where curtailment risk is material.

The honest caveat here is that curtailment decisions involve operational judgement calls by grid operators and are not purely determined by observable inputs. A model trained on historical curtailment events may not capture novel congestion situations or new regulatory instruments applied to grid management. Treating curtailment prediction as a signal to incorporate in a broader decision framework — not as a binary curtailment/no-curtailment oracle — is the appropriate use.
AI battery storage trading optimisation
AI battery storage trading optimisation is one of the highest-value applications of machine learning in the energy sector today. Battery storage systems — whether utility-scale or co-located with generation — can earn revenue across multiple stacking markets: day-ahead arbitrage, intraday arbitrage, frequency containment reserve (FCR), automatic frequency restoration reserve (aFRR), and imbalance settlement. The challenge is that the value in each market is uncertain and the battery's state of charge at any given moment constrains its available capacity for future actions.
Reinforcement learning and model predictive control (MPC) approaches, informed by price forecasting models across each of the stacking markets, can optimise dispatch decisions sequentially: deciding at each time step whether to charge, discharge, or hold, given the current state of charge, the current price forecast, and the uncertainty around that forecast. The optimisation is non-trivial because actions taken now constrain future optionality — a battery that is fully charged during a negative price episode cannot charge further to capture subsequent price rises.
Crux Digits builds machine learning models that inform battery dispatch optimisation, combining price forecasting signals across day-ahead and intraday markets with probabilistic curtailment and imbalance price inputs. The architecture is always transparent about what is a model output (uncertain) versus what is a physical constraint (certain), and dispatch rules are designed with risk limits that prevent the optimisation from taking positions whose downside exceeds the operator's risk tolerance.
Data requirements for AI energy price forecasting
The quality of any AI price forecasting system is fundamentally limited by the quality and completeness of its input data. The following are the core data requirements for a well-built Dutch electricity price forecasting system:
- Historical EPEX SPOT day-ahead auction prices: Hourly clearing prices for the Dutch (NL) hub, ideally covering at least four to five years to include multiple market regime cycles, including the 2021-22 gas price spike period and the subsequent normalisation.
- Intraday transaction data: If building an intraday model, continuous intraday transaction data from EPEX SPOT at 15-minute resolution, including traded volume by delivery period.
- Renewable generation forecasts and actuals: Wind and solar generation forecasts — as issued before gate closure of the day-ahead auction — and actual metered output. The difference between these (forecast error) is a key driver of intraday price moves.
- Fuel price data: TTF natural gas futures prices (front month and prompt), ARA coal, and EU ETS carbon allowance prices (EUA front month).
- Weather data: Temperature, wind speed and direction, solar irradiance, and cloud cover, both forecast and actual, at spatial resolution appropriate for the generation assets that matter most for Dutch prices — including offshore wind zones and major solar regions.
- Cross-border flow and interconnector capacity data: Net Transfer Capacities (NTCs) or Flow-Based Market Coupling parameters for the Dutch borders, available via ENTSO-E transparency platform.
- ENTSO-E transparency data: Total load actual and forecast, generation by source (actual and forecast), cross-border physical flows. All publicly available at ENTSO-E Transparency Platform — a foundational data source for any European electricity price model.
- TenneT balancing data: Imbalance settlement prices, activated balancing energy volumes, and system balance signals, published by TenneT for the Dutch control area.
Our data engineering practice designs and maintains the pipelines that collect, clean, and align these data sources into a consistent training and inference environment. Data pipeline reliability is not a secondary concern — a price forecasting model that receives stale or missing features silently degrades in ways that may not be immediately visible to the trading team.
Practical checklist before deploying an AI price forecasting model
- Define the decision it serves: Is this model informing day-ahead bid quantities, intraday position adjustments, battery dispatch, imbalance management, or all of the above? Each use case has different horizon, accuracy, and latency requirements.
- Establish a performance baseline: What does your current forecasting method achieve in terms of mean absolute error (MAE), root mean squared error (RMSE), or economic value (position P&L improvement)? The AI model must beat this baseline convincingly to justify the build.
- Use walk-forward validation: Never evaluate a price forecasting model on in-sample data or simple train/test splits. Use walk-forward (expanding window or rolling window) validation across multiple market regimes, including volatile periods.
- Build probabilistic outputs: Point forecasts are less actionable than prediction intervals or full predictive distributions for trading and risk management applications. Design for probabilistic output from the start.
- Incorporate risk limits: The model should produce a forecast and a confidence signal; your trading system should apply risk limits (maximum position size, stop-loss thresholds, maximum model-age before human review) that cap the downside if the model is wrong.
- Plan for retraining and monitoring: Markets change. The gas price spike of 2021-22 invalidated models trained on the preceding decade of relatively low-volatility European gas prices. Build monitoring and retraining cadence into the operational design from day one.
- Assess EU AI Act applicability: AI systems used in energy market trading operations may attract regulatory scrutiny depending on the scale of the participant and how the model output is used. Document the model's purpose, its validation record, and the human oversight mechanisms in place.
Honest limits: what AI price forecasting cannot do
Price forecasting is one of the hardest problems in applied machine learning, precisely because electricity markets are competitive: if a forecasting edge were perfectly reliable, it would be arbitraged away by market participants acting on it. Several structural limits apply to any AI forecasting system in this domain.
Extreme events — the negative price spikes that occur during periods of very high renewable output and low demand, and the price spikes that occur during cold snaps or supply shocks — are inherently difficult to forecast with high confidence because they are driven by combinations of factors that occur rarely in training data. A model calibrated on normal market conditions will underestimate the probability and magnitude of tail events. Separate stress-testing frameworks, informed by scenario analysis rather than purely statistical extrapolation, are needed to manage tail risk.
Geopolitical and policy discontinuities cannot be modelled away. The 2022 supply disruption and its effect on European gas prices was not predictable from historical electricity price data. Any model that claims to forecast prices across geopolitical regime changes is overstating its capability. Fundamental scenario analysis — what happens to Dutch day-ahead prices if gas supply from a major source is reduced materially — requires human judgement and scenario construction, not machine learning.
Model decay is real and faster in markets than in many other domains. As more participants use similar ML approaches, the information content of the signals those approaches exploit may reduce. Monitoring model performance in production — not just at deployment — and having a process for diagnosing and addressing degradation is a core operational requirement, not an afterthought.
How Crux Digits approaches energy price forecasting engagements
Crux Digits is a vendor-neutral AI consultancy based in Utrecht, working with energy traders, utilities, and balancing responsible parties across the Netherlands and the wider EU. We build electricity price-forecasting models for Dutch traders and utilities across day-ahead and intraday markets, grounded in robust data engineering, honest model evaluation, and production-grade deployment practices.
Our engagement model is designed to be transparent about uncertainty at every stage. We begin with a data readiness assessment — understanding what data you have, what is missing, and what data acquisition work is needed before model building can begin. We then build a validated baseline model and benchmark it rigorously against your existing forecasting method using walk-forward evaluation across multiple market regimes. Only when the baseline shows genuine improvement over your current approach do we proceed to a full production build.
We cover the full stack: the machine learning modelling layer, the data engineering infrastructure that feeds real-time and historical data into the model, and the AI implementation work that integrates forecasting outputs into your trading systems, risk management platform, or battery dispatch controller. We do not impose a cloud platform or a model vendor — the architecture is chosen to fit your operational environment and your team's capability to own and maintain it after handover.
Our case studies illustrate the kinds of energy and utility problems we have worked on, and our pricing page sets out how engagements are typically structured. If you have a specific forecasting challenge — day-ahead price uncertainty, intraday positioning, imbalance management, or battery optimisation — get in touch for a free initial conversation. We will give you an honest assessment of what is achievable with your data and in your market context, without overpromising.
For public reference data, the ENTSO-E Transparency Platform is the primary public source for European electricity generation, load, and cross-border flow data, and is an essential input to any credible price forecasting model for the Dutch and wider European market.
Frequently asked questions
How accurate is AI at forecasting day-ahead electricity prices on EPEX SPOT?
AI models — particularly gradient-boosted ensembles and deep learning architectures — can meaningfully outperform naive benchmarks and classical statistical models on standard accuracy metrics for day-ahead EPEX SPOT price forecasting. However, accuracy varies with market conditions: normal periods are more tractable than episodes of high renewable penetration, supply shocks, or geopolitical disruption. Backtesting results are inherently optimistic, so honest evaluation requires walk-forward validation across multiple market regimes. The right question is not whether the model is perfect, but whether it improves your trading decisions at the margin relative to your current approach. Price forecasting involves irreducible uncertainty; this is not financial advice.
What data sources are needed to build a reliable electricity price forecasting model for the Dutch market?
A credible Dutch electricity price forecasting model requires: historical EPEX SPOT day-ahead auction prices (ideally four to five years, covering multiple market regimes); TTF gas futures prices, coal, and EU ETS carbon prices; wind and solar generation forecasts and actuals; weather data (temperature, wind, irradiance) at appropriate spatial resolution; cross-border flow and interconnector capacity data from ENTSO-E; TenneT balancing market data; and calendar features. ENTSO-E Transparency Platform data is the foundational public source. The pipeline that collects, cleans, and aligns these sources reliably is as important as the model architecture itself.
How does AI battery storage trading optimisation work and what are the main value streams?
Battery storage systems can earn revenue across multiple stacked markets: day-ahead price arbitrage, intraday arbitrage, frequency containment reserve (FCR), automatic frequency restoration reserve (aFRR), and imbalance settlement. AI optimisation — typically reinforcement learning or model predictive control informed by price forecasting models — decides at each time step whether to charge, discharge, or hold, given the current state of charge, the price forecast, and uncertainty around that forecast. The optimisation is complex because current actions constrain future optionality. Crux Digits builds battery dispatch optimisation systems with explicit risk limits that prevent the model from taking positions whose downside exceeds the operator's risk tolerance.
What is imbalance settlement price forecasting and why does it matter for Dutch energy market participants?
The imbalance settlement price is set by TenneT for the Dutch control area and applied to participants whose metered generation or consumption deviates from their nominated programme. A participant who can anticipate whether the imbalance price will be high or low — and its direction — can make better decisions about whether to close an open position in the intraday market or accept the settlement price. Machine learning models trained on historical balancing activation data, system balance signals, and real-time renewable forecast errors can provide probabilistic guidance. However, the imbalance price depends on operational judgements by TenneT that are not fully observable, so honest use treats this as a directional signal, not a high-confidence point forecast.
How does Crux Digits approach electricity price forecasting projects for traders and utilities?
We start with a data readiness assessment: understanding what historical price, generation, weather, and fuel data you have, identifying gaps, and designing the data pipeline before touching model architecture. We then build a validated baseline model and benchmark it against your current forecasting method using walk-forward evaluation across multiple market regimes — so you have an objective measure of improvement before committing to a full production build. We cover the full stack: machine learning modelling, data engineering, and AI implementation into your trading or risk systems. We are vendor-neutral. Get in touch via our contact page for a free initial conversation about your specific forecasting challenge.