TN-010 — Layered Economic Loss Function for Electricity Endurance Risk

Abstract

CFO Illustration

A Finnish industrial firm with €2B annual revenue allocates a €20M VAR risk budget for electricity price exposure — approximately 1% of revenue, consistent with standard practice. Under the LELF framework, the same firm's CEL₉₀ for a 48-hour compound winter event is approximately €400M — 20× the VAR budget, representing exposure to 20% of annual revenue in a single event. The VAR budget is correctly sized for price risk. It does not model the risk that CEL quantifies. The firm is optimising against 1% of its actual electricity endurance exposure.

Illustrative Full-Stack Example

OL3 unplanned outage + cold period (T < −15°C) + wind drought + SE1 at capacity: compound probability P ≈ 0.5% per winter. Duration P90 ≈ 36 hours (from TN-009 tail distribution). CEL₉₀ at 36 hours: €0.5–2.0 billion (L4–L5 threshold, sector-weighted). Expected annual loss contribution: P × CEL₉₀ = 0.005 × €1.25B = €6.25M/year. This figure does not appear in any current financial risk model for Finnish electricity exposure. The remainder of this note develops the framework that produces it.

Standard financial risk models treat electricity price as a continuous variable subject to Value at Risk analysis. This is appropriate for commodity price exposure but fails categorically for physical outage risk. An electricity outage is not a price event — it is a state transition. Damage does not scale linearly with duration; it accumulates in discrete layers as successive process tolerances are breached. This note develops a Layered Economic Loss Function (LELF) that maps outage duration to sector-specific damage thresholds, proposes a Conditional Endurance Loss (CEL) metric as a CVaR analogue for electricity endurance, and shows how WEM's SP_cluster component can serve as a probabilistic input to CEL estimation. The framework is designed to make electricity endurance risk legible to financial sector actors whose standard tools systematically underestimate it.

The Categorical Failure of VAR for Outage Risk

Value at Risk (VAR) and its successor Conditional Value at Risk (CVaR) are built on a continuity assumption: small changes in the underlying variable produce proportionally small changes in portfolio value. For electricity price exposure — the exposure a manufacturer faces when spot prices spike — this assumption holds approximately. If electricity costs rise 50%, operating margins compress by a calculable amount.

State transition, not price change. A grid under stress does not gradually reduce supply — it maintains nominal voltage until it cannot, then drops abruptly. The economic damage function is not smooth; it has hard thresholds at the moment supply fails. A firm with uninterruptible processes faces zero marginal cost from the first hour of high prices and catastrophic cost from the first second of lost supply. These are not on the same curve.

Duration dependence is nonlinear. Two hours without power is an operational disruption. Twenty hours is a process catastrophe for any continuous manufacturing operation. The damage does not scale proportionally — it accumulates through qualitatively different phases as successive tolerance thresholds are breached. A paper mill that shuts down cleanly at hour two faces a different cost structure than one that runs its process into an uncontrolled stop at hour six.

Quantitative contrast (illustrative, not calibrated): VAR₉₅ for electricity price exposure (Finnish industrial firm): Input: historical spot price distribution Output: €2–5M at 95th percentile over one year → Captures: price spikes, demand response cost → Misses: physical outage, state transition, recovery cost CEL₉₀ for electricity endurance exposure (same firm): Input: P90 outage duration × LELF threshold × sector weight Output: €50–500M at L3–L4 threshold crossing → Captures: process damage, recovery cost, dependency cascade → Misses: nothing structural — this is the correct risk measure Ratio: CEL₉₀ / VAR₉₅ ≈ 10–100x This ratio is the financial sector's blind spot. The VAR number is not wrong — it correctly measures price risk. It simply does not measure the same thing as CEL. A firm optimising against VAR has zero protection against the risk CEL quantifies.

Timing heterogeneity is invisible to price models. A stress hour at 03:00 on a Sunday in May has approximately zero economic consequence for most commercial actors. The same stress hour at 14:00 on a Tuesday in January, during a cold spell, with industrial loads at maximum and hospital systems under peak demand, carries orders-of-magnitude higher systemic cost. Standard energy risk models assign identical weight to both.

The Layered Economic Loss Function (LELF)

The LELF maps outage duration to cumulative economic damage through discrete threshold layers. Each layer represents a qualitatively distinct damage category that becomes active when a duration threshold is crossed.

LELF(d, t, s) = Σᵢ Lᵢ(s) · 𝟙[d ≥ θᵢ(s)] · f(t, s) where: d = outage duration (hours) t = time of occurrence (hour of day, day of week, season) s = sector index (industrial, commercial, residential, critical infrastructure) Lᵢ(s) = layer-specific loss function for sector s θᵢ(s) = sector-specific duration threshold activating layer i (not a single shared value — each sector has its own tolerance profile) f(t,s) = sector-specific timing multiplier [0.1 – 3.0] (industry peaks weekdays 07–21; residential peaks evenings and weekends) Key correction from v0.1: θᵢ was incorrectly specified as sector-invariant. Threshold heterogeneity is the central feature of the LELF framework.

Layer definitions

The layer thresholds are sector-specific. The table below shows the kernel layer structure; §02.1 details subsystem threshold profiles for each infrastructure category.

Finnish empirical reference: VTT and Energiateollisuus ry cost-of-interruption studies provide sector-specific loss rates in €/kWh unserved. These are linear averages which systematically understate L3–L5 costs as they are derived from short-duration survey data. The LELF framework uses these as L1–L2 baselines and applies threshold multipliers for L3–L5 based on process engineering tolerances and the subsystem profiles in §02.1.

Subsystem Response Profiles

The following profiles specify buffer capacity, threshold dynamics, and recovery asymmetry for six critical infrastructure subsystems. These feed into the LELF layer calculation as sector-specific θᵢ(s) values and inform the dependency overlay described in §02.2.

District heating

District heating has one of the largest buffer capacities of any electricity-dependent system, but the buffer size varies by three orders of magnitude depending on system scale. Large urban systems (Helsinki, Vantaa) operate massive hot-water accumulator tanks — Helen's Salmisaari facility holds 10,000 m³, providing approximately 48–72 hours of heating at normal winter demand before supply temperature begins falling. Smaller municipal systems typically operate with 6–12 hours of thermal mass. Rural and building-scale systems have 2–4 hours of radiator and pipe thermal storage.

Recovery asymmetry: once district heating pipe temperature drops below operational threshold, restart requires pressurisation, venting, and sequential reactivation across the distribution network — typically 4–12 hours after power is restored. L3 threshold is therefore: Helsinki/Vantaa ~48h; mid-size cities ~6–12h; rural/small systems ~2–4h.

Water supply

Water towers and pressure vessels provide 2–6 hours of buffer at normal consumption. The failure mode is not purely a pressure event — it is a contamination event. When network pressure drops below a threshold, external pressure can drive contaminants into the distribution network. This activates a post-restoration protocol: network flushing, water sampling, and laboratory confirmation before the network is declared safe. This protocol typically requires 12–48 hours after power restoration regardless of outage duration.

Recovery asymmetry is severe: a 6-hour outage may produce 48–72 hours of post-restoration unavailability. The LELF L3 threshold for water supply is therefore not symmetric — it should be expressed as a function of both outage duration and post-restoration recovery time. Pumping stations in smaller municipalities have limited generator capacity and fuel reserves, creating L4 exposure at 6–12 hours.

Wastewater management

Wastewater pumping stations are fully electricity-dependent with limited buffer capacity. Collection network storage — the volume held in pipes and lift station wet wells — typically provides 2–6 hours of buffer before overflow conditions develop, depending on inflow rates and network topology. Unlike water supply, where pressure loss is the primary failure mode, wastewater failure produces an active environmental hazard: untreated sewage discharge to waterways, streets, or basements. This makes wastewater one of the fastest-escalating LELF categories.

Finnish regulatory requirement (Water Services Act) mandates controlled overflow to designated discharge points before uncontrolled overflow occurs. This requires active management — operators must be reachable and able to respond — which in turn depends on telecommunications remaining functional. The cascade: telecommunications degradation at ~3 hours impairs wastewater operator response precisely when buffer exhaustion is approaching.

Recovery asymmetry is severe and environmentally extended. After power restoration, network decontamination, pump inspection, and environmental sampling are required before normal operation resumes. Regulatory reporting and remediation obligations may extend the effective recovery period by days to weeks. Finnish environmental law (Ympäristönsuojelulaki) triggers mandatory notification and liability assessment from confirmed overflow events.

Biological treatment processes (activated sludge) at wastewater treatment plants do not survive extended shutdowns — microbial communities die within 12–24 hours of aeration failure, requiring 1–2 weeks of biological recovery before the plant reaches normal treatment efficiency. This creates a post-restoration discharge quality problem independent of the overflow events during the outage itself.

Empirical recovery rule: if outage duration is d hours, total wastewater system impairment (outage + recovery) is approximately 3–5 × d hours. The LELF loss function for wastewater takes the form:

LELF_wastewater(d,t) = L_env(d) + L_recovery(d) + L_contam · 𝟙[d ≥ θ_contam] where: L_env(d) = environmental damage cost (overflow volume × treatment cost) L_recovery(d) ≈ k_ww × d^1.5 (superlinear — biological process restart) L_contam = fixed cost if wastewater reaches drinking water sources θ_contam ≈ 12–24h (drinking water contamination threshold) Note: L_contam activates a discrete step change in total loss — this is the mechanism behind the L4 threshold severity in wastewater.

L3 threshold: 2–4 hours (overflow risk). L4 threshold: 6–12 hours (confirmed environmental discharge, remediation liability). Smaller municipalities with limited generator capacity face L3 exposure significantly earlier than urban systems.

Healthcare

Healthcare has among the most robust backup power of any sector — major hospitals operate diesel generators sized for 24–72 hours at full load, with priority fuel resupply agreements. However, the failure profile is not binary and the recovery asymmetry is significant. The sector also has the highest unit cost of failure of any LELF category: an interrupted intensive care procedure or delayed emergency response has no economic analogue.

L1 threshold (0–4h): Elective procedures paused, generators running, critical care unaffected. L2 threshold (4–12h): Smaller health centres approach generator fuel limits; patient transfer decisions begin. L3 threshold (12–48h): Larger hospitals on fuel consumption alert; non-critical wards may lose power sequentially as fuel is rationed to ICU and operating theatres. L4 threshold (48h+): Only the largest facilities with dedicated fuel reserves remain fully operational; widespread patient evacuation required for mid-tier hospitals.

Recovery asymmetry: elective surgery backlog, equipment recertification, staff fatigue from extended emergency protocols, and patient record system restoration typically add 6–24 hours of reduced-capacity operation after power restoration. The primary dependency is fuel logistics — healthcare endurance is ultimately bounded by diesel resupply chains, not generator capacity.

Fuel distribution — Core System Variable

Fuel distribution is the hidden cascade amplifier in the LELF framework. Most petrol stations have no backup generation — pumps, payment terminals, and monitoring systems operate on UPS for 0–2 hours, then stop. This creates a supply constraint that progressively limits the effective duration of all diesel-dependent backup systems across every other LELF sector.

The cascade timeline: at L1 (0–2h), fuel stations begin failing. By L2 (2–6h), approximately 95% of retail fuel stations are non-operational. Backup generators across hospitals, data centres, telecommunications base stations, and water pumping stations begin drawing down their stored diesel reserves. By L3 (6–12h), only facilities with dedicated fuel contracts and pre-positioned reserves (major hospitals, large data centres, critical telecoms nodes) retain meaningful fuel security. All other backup-dependent systems begin entering forced shutdown as their stored diesel is exhausted.

This creates a structural revision to all other subsystem θ(s) values: the nominal backup duration (e.g., hospital 72h, data centre 48h) applies only if fuel resupply logistics function. Under an extended outage with simultaneous road network disruption (weather, traffic management failure), effective backup duration collapses to stored-diesel capacity only — typically 12–24 hours for most facilities. The fuel distribution subsystem is therefore not a LELF category in its own right but a time-dependent modifier R_fuel(d) applied to all backup-dependent subsystems:

R_resupply(d) — resupply availability scalar (sigmoidal collapse): R_resupply(d) = 1 / (1 + e^(k·(d − dᶜ))) where: dᶜ ≈ 10–14h (critical point: fuel network percolation threshold) k ≈ 0.4 (collapse steepness — empirically, most stations fail within a 4–6h window around dᶜ, not gradually) Approximate values: d = 4h: R_resupply ≈ 0.88 (most stations still operational) d = 10h: R_resupply ≈ 0.50 (critical point — rapid collapse) d = 14h: R_resupply ≈ 0.12 (near-complete failure) d = 24h: R_resupply ≈ 0.02 (stored reserves only) Note: The linear approximation (v0.3) understated cascade severity. The sigmoidal form reflects the percolation physics of fuel network failure — a connected network becomes disconnected abruptly at a critical threshold, not gradually. R_fuel_effective(s,d) — effective backup duration modifier: R_fuel_effective = R_resupply(d) × stored_factor(s) where stored_factor(s) = ratio of stored diesel to total nominal capacity Effective backup duration = nominal(s) × R_fuel_effective(s,d) × R(s,d) Note: R_resupply is the supply-chain term; R_fuel_effective combines resupply availability with each facility's stored diesel buffer.

Cold chain (food and pharmaceutical)

Retail cold chain has minimal buffer: supermarket refrigeration operates on UPS for 0–4 hours, after which perishables begin degrading. Large cold storage warehouses retain temperature for 12–24 hours through insulation. Pharmaceutical cold chain (insulin, vaccines) follows supermarket timelines for retail pharmacies but hospital pharmaceutical stores typically have 24–48 hours of managed storage. Economic loss scales steeply: L2 (4–6h) produces significant fresh produce loss; L3 (12–24h) produces near-total warehouse inventory loss with mandatory disposal under food safety law. Recovery requires regulatory inspection and restocking — typically 2–5 days of reduced availability. Finnish food retail concentration (S-group, K-group) means that central warehouse losses cascade directly to retail availability.

Payment systems

Bank data centres maintain backup power comparable to large data centres (48–72h). However, retail payment infrastructure — card terminals, ATMs, point-of-sale systems — has minimal backup. UPS provides 0–2 hours; thereafter, only cash transactions are possible. ATM networks typically have 4–8 hours of UPS backup but cannot be restocked without functioning logistics. At L2 (2–6h), cashless payment fails across most retail and service locations. This creates social friction that compounds all other crisis responses: fuel cannot be purchased for generators, emergency supplies cannot be bought, logistics cannot be coordinated through normal commercial channels. Payment system failure is not a direct economic loss in the LELF sense — it is a friction multiplier that degrades the efficiency of every other crisis response by 20–40%.

SCADA and industrial control systems

SCADA and remote control systems are not an independent LELF sector — they are a cascade accelerator. Power grid SCADA, district heating remote control, water utility monitoring systems, and traffic management operate on 4–12 hours of backup power but are often disabled earlier by telecommunications failure. When SCADA systems lose visibility, operators cannot monitor or adjust connected equipment even if the physical infrastructure (pump, valve, substation) remains functional. Recovery requires manual site visits — a process that is extremely slow under simultaneous road network degradation.

Practical consequence: SCADA failure shortens the effective threshold for all dependent subsystems by 20–40% — a water pumping station rated for 6-hour manual operation under outage conditions effectively becomes a 4-hour system if operators cannot remotely diagnose fault conditions. This coefficient is already incorporated into the dependency overlay in §02.2.

Airports

Helsinki-Vantaa Airport has robust backup power (48–72h). However, approach lighting, radar, and instrument landing systems are sensitive to power quality — any voltage irregularity or transition to generator power triggers safety protocols that temporarily halt landings and departures. An outage exceeding 2 hours suspends all flight operations. Air freight (pharmaceuticals, critical components, mail) and passenger evacuation capacity are unavailable during this period. Recovery requires system recertification — typically 4–12 hours after power restoration. Regional airports have significantly weaker backup systems and may face extended closure.

Metro and trams

Helsinki Metro operates on a dedicated 110 kV feed with backup power for pumping stations and safety systems only — not for train traction. An outage exceeding 15 minutes halts the entire metro network. Helsinki, Tampere, and Turku trams follow the same logic. Electric bus charging infrastructure fails immediately; battery-powered buses continue for 2–4 hours on stored charge before the urban bus network reverts to diesel buses — subject to fuel availability under the R_fuel(d) constraint described above. Combined, urban public transport capacity collapses to approximately 30–50% within 4 hours and to diesel-only within 6 hours.

Telecommunications (4G/5G)

Finnish regulatory requirement mandates 3-hour battery backup at base stations. This can be extended to approximately 20 hours with mobile generator units, but generator deployment requires logistics coordination and fuel. The failure profile is non-binary: voice calls and SMS function until battery exhaustion; data services degrade earlier as backhaul links lose power. The critical dependency is that telecommunications degradation impairs remote monitoring and control of other infrastructure systems — district heating, water pumping, hospital coordination — creating a cascade multiplier on other LELF layers.

L2 threshold: ~3 hours (statutory battery exhaustion). L3 threshold: ~6–8 hours (data backhaul degradation, mobile generator deployment window closing). Timing multiplier f(t,s) is particularly high for telecommunications because control system dependencies are constant across diurnal cycles.

Data centres

Data centres operate with UPS systems providing 10–15 seconds of seamless transition to diesel generator. Well-operated facilities can sustain full load indefinitely given fuel supply. The critical vulnerability is fuel logistics: a 48-hour outage in a major weather event may exhaust diesel reserves and disrupt resupply if road access is compromised. L1 threshold is effectively zero — UPS handles the transition. L2 threshold is fuel-supply dependent, typically 48–72 hours for well-equipped facilities. Smaller or older facilities may face L2 at 12–24 hours.

Rail transport

Electrified rail has essentially zero buffer capacity — trains stop immediately on power loss. The recovery profile is asymmetric and slow: safety systems (switches, signals, level crossings) require sequential reset and manual inspection before resuming operations. A 2-hour outage may produce 4–6 hours of operational disruption during recovery. Rail's primary LELF contribution is indirect: it impairs emergency logistics, fuel resupply, and personnel mobility during extended events, amplifying damage in other sectors. L1 threshold: immediate (minutes). Recovery multiplier: 2–3x outage duration.

Traffic management and emergency access

Traffic signal UPS systems provide 15–60 minutes of buffer. After exhaustion, dark signals significantly increase accident risk and create congestion that impairs emergency vehicle access. The cascade effect: impaired emergency access slows response to all other infrastructure failures. L1 threshold: 15–60 minutes. The timing multiplier f(t,s) is highest during peak traffic hours — a 14:00 weekday outage carries 3–5x higher cascade cost than a 03:00 weekend outage for this subsystem.

Buildings (non-district-heating)

Buildings with electric heating (heat pumps, resistance heating) lose heating immediately. Thermal mass provides 12–48 hours of temperature retention depending on insulation quality, external temperature, and building type. Modern well-insulated buildings retain heat significantly longer than older stock. Below −15°C external temperature, poorly insulated buildings may face freeze damage to pipes and pumping equipment within 24–48 hours. Recovery asymmetry: freeze damage to pipework requires repair before heating can be restored, creating extended post-restoration downtime. L3 threshold: 24h (poorly insulated, −15°C); 48h (modern insulated, −10°C).

Dependency Overlay

The subsystem profiles in §02.1 interact through dependency relationships that amplify LELF damage beyond the sum of individual sector losses. The primary cascade chains are:

Layer	Generic threshold	Damage category activated	Threshold range across sectors
L1	0–2 h	Operational disruption. UPS and buffer systems active. Most processes recover cleanly. Cost: lost production hours, restart costs.	0–15 min (rail) to 2 h (commercial)
L2	2–6 h	Process intolerance breach. Continuous processes cannot hold state; controlled shutdown required. Cost: batch loss, restart damage, cleaning cycles.	30 min (paper mill) to 6 h (data centre with generator)
L3	6–48 h	Thermal and infrastructure envelope breach. District heating buffer depletes; cold chain integrity at risk. Telecommunications base stations approach battery exhaustion. Hospitals on fuel consumption alert.	3 h (telecom statutory) to 48 h (large urban district heating with accumulator)
L4	12–72 h	Critical infrastructure degradation. Generator fuel reserves depleted for smaller facilities. Water system contamination risk activates. Freeze damage risk to building systems begins.	6 h (rural water pumping) to 72 h (major hospital complex)
L5	>48 h	Societal function impairment. Cascading failures across interdependent systems. Cost estimation transitions from sectoral to macroeconomic.	Systemic — all sectors simultaneously

Electricity → Telecommunications → Infrastructure control. Telecommunications degradation (L2, ~3h) impairs remote monitoring and control of district heating substations, water pumping stations, hospital systems, and rail infrastructure. This accelerates the threshold crossing for all dependent systems — effectively reducing their θᵢ(s) values by 20–40% under telecommunications failure conditions.

Electricity → Fuel distribution → All backup systems. This is the dominant cascade in extended outages. Fuel station failure at 2–6 hours initiates progressive collapse of backup-dependent systems across all sectors. The fuel cascade converts nominal backup durations (hospital 72h, data centre 48h, telecoms 20h) into actual stored-reserve durations (12–24h for most facilities without dedicated fuel contracts). Under compound stress with simultaneous road network disruption, effective backup duration across the system collapses to approximately 12 hours regardless of nominal generator ratings. R_fuel(d) is the most important time-dependent modifier in the LELF framework — it should be applied to all backup-dependent subsystem calculations.

Electricity → Rail → Emergency logistics. Rail disruption impairs fuel resupply for generators at hospitals, data centres, and telecommunications base stations. This converts fuel-dependent L2 tolerances into L3 exposures if the outage duration approaches generator fuel reserves.

Electricity → Traffic management → Emergency response. Dark traffic signals slow emergency vehicle access, increasing response time to all L3+ events across all sectors. This is a timing multiplier on cascade damage rather than a direct damage source.

The dependency overlay does not require separate LELF curves for each cascade chain. It is incorporated as a threshold reduction factor applied to dependent subsystems when a triggering subsystem has crossed its own threshold.

Conditional Endurance Loss (CEL) — A CVaR Analogue

Financial risk management moved from VAR to CVaR because VAR answers the wrong question: it tells you the maximum loss at a given confidence level but says nothing about the distribution of losses beyond that threshold. CVaR — the expected loss given that the VAR threshold is exceeded — is more informative for fat-tailed distributions.

The electricity endurance analogue follows the same logic. The relevant question is not "what is the probability of a stress hour?" (this is SP in WEM). It is: "given that a stress event occurs, what is the expected economic damage as a function of the duration distribution of that event?"

CEL(w) = Σₛ wₛ · ∫ LELF(d, t̄, s) · p(d | SP_cluster, EPP) dd where: w = sector weight vector (industrial share, commercial share, etc.) t̄ = expected timing given seasonal and diurnal profile p(d | SP_cluster, EPP) = conditional duration distribution given current SP_cluster and EPP state Key insight: SP_cluster T₁₆₈ (longest stress episode, WEM §11) is a direct empirical estimator of p(d | current system state).

CEL band — three-point risk characterisation: CEL_P50 = LELF( d_P50, t̄, s ) × sector_weights [expected scenario] CEL_P90 = LELF( d_P90, t̄, s ) × sector_weights [stress scenario] CEL_P99 = LELF( d_P99, t̄, s ) × sector_weights [tail scenario] where d_Pxx = Pxx percentile of outage duration given compound trigger (derived from TN-009 tail distribution and historical SP_cluster data) Illustrative values for compound scenario (OL3 + cold + wind): d_P50 ≈ 6h → CEL_P50 ≈ €20–80M (L2 threshold) d_P90 ≈ 36h → CEL_P90 ≈ €500M–2B (L4 threshold) d_P99 ≈ 72h → CEL_P99 ≈ €2–10B (L5 threshold, societal) Financial equivalents: CEL_P50 ≈ Expected Loss (EL) — provision against CEL_P90 ≈ CVaR 90% — stress capital CEL_P99 ≈ Tail scenario — catastrophe reinsurance territory

Linkage to EENS and VoLL literature

The CEL framework is compatible with — and extends — the standard power systems reliability metric Expected Energy Not Served (EENS). The relationship is:

CEL = Σₛ αₛ · EENSₛ · VoLL_s(d) where: EENSₛ = Expected Energy Not Served to sector s (MWh) VoLL_s(d) = duration-dependent Value of Lost Load for sector s derived from LELF threshold structure αₛ = sector economic weight Key difference from standard VoLL: Standard VoLL: constant €/MWh (linear — misses threshold effects) LELF-derived VoLL_s(d): step function that jumps at θᵢ(s) VoLL_industry(2h) ≈ €1,500–3,000/MWh (L1: operational disruption) VoLL_industry(12h) ≈ €8,000–15,000/MWh (L3: process destruction) VoLL_water(24h) ≈ €20,000+/MWh (L4: contamination event) This formulation makes CEL directly comparable to EENS-based reliability assessments used by Fingrid, ENTSO-E, and regulators — while correcting the linear VoLL assumption that causes standard models to underestimate L3–L5 damage by 2–3×.

SP_cluster already exists in WEM §11 as T₁₆₈ — the longest single stress episode in the 168-hour window. WEM v2.7.2 also incorporates a persistence premium P(T₁₆₈) directly into the EPP calculation (+0.05 for T₁₆₈ ≥ 6h, +0.12 for T₁₆₈ ≥ 24h), which provides a real-time endurance signal that feeds naturally into CEL estimation. It currently reports the longest stress episode duration and episode count. What is missing is the mapping from SP_cluster values to the LELF threshold layers — and the sector weight vector that translates system stress into economic damage estimates.

Operational CEL (CELₒₚ) — expected loss given current stress state: CELₒₚ = LELF( E[d | SP_cluster], t̄, s ) × sector_weights where E[d | SP_cluster] = expected duration given current cluster length → answers: "what is the expected damage if stress continues at current rate?" Tail CEL (CEL₉₀) — stress-scenario loss: CEL₉₀ = LELF( d₉₀, t̄, s ) × sector_weights where d₉₀ = P90 duration from historical SP_cluster distribution → answers: "what is the damage in a severe but plausible event?" Relationship to financial risk metrics: CELₒₚ ≈ Expected Loss (EL) CEL₉₀ ≈ Conditional Value at Risk (CVaR) at 90th percentile A pension fund or bank already has internal EL and CVaR estimates for credit and market risk. CELₒₚ and CEL₉₀ are the direct analogues for electricity endurance exposure — expressed in the same units (€M) and comparable across risk categories.

The Financial Sector Blind Spot

Finnish pension funds, banks, and insurance companies hold significant exposure to electricity endurance risk through their portfolios — directly via energy infrastructure investments, and indirectly via industrial and commercial lending books. This exposure is currently assessed through two instruments, both of which are structurally blind to physical outage risk.

Trigger	Affected subsystems	Threshold reduction	Mechanism
Telecom outage >3h	SCADA, hospital patient systems, payment systems, water monitoring	−20–40% of nominal θ	Remote monitoring loss → manual response only
Fuel distribution outage >4h	All diesel-dependent backup systems	R_resupply(d) applied — see §02.1 fuel profile	Resupply impossible; stored diesel only
Fuel + road network degradation	Mobility-dependent logistics (ambulance, fuel trucks, repair crews)	Effective backup collapses to 12–24h across system	Physical access impossible
Wastewater overflow >6h	Drinking water sources in affected catchment	Drinking water θ reduced by 50%	Contamination risk activates emergency protocol
Payment system failure >6h	All commercial crisis response	20–40% friction increase on all logistics	Fuel, supplies, repair parts cannot be purchased

Spot price exposure models track electricity cost as a function of market price. They correctly identify the risk that high prices compress industrial margins. They do not model the risk that supply fails entirely, because supply failure is not a price event — it is a physical event that removes the market signal before it can be observed.

Climate risk stress tests (EBA 2025, NGFS scenarios) focus on transition risk — stranded asset exposure, carbon price trajectories, green investment requirements. Physical risk in these frameworks means weather damage to assets, not grid endurance failure. A prolonged cold-still period that stresses the Finnish electricity system does not appear in standard NGFS scenarios because it is a compound infrastructure event, not a climate variable.

The consequence: a Finnish pension fund's exposure to a 48-hour winter grid stress event is effectively zero in its risk models, because the event has no representation in any standard financial risk framework. The physical exposure is real; the modelled exposure is zero. This is the measurement gap in SM-009 applied to financial risk management.

Structural Finding

The financial sector's exposure to electricity endurance risk is not measured because the risk does not fit the instruments available. VAR assumes continuity; outage risk is categorical. Climate stress tests address transition risk; endurance failure is a compound infrastructure event. The CEL framework proposed here is the missing instrument — not because it is sophisticated, but because it asks the right question: given current grid endurance state (EPP, SP_cluster), what is the expected economic damage if the stress persists beyond sector-specific tolerance thresholds?

Calibration Requirements

Duration distribution from Fingrid data 2010–2025. The historical frequency of stress episodes by duration bracket (1–4h, 4–12h, 12–48h, >48h) is calculable from Fingrid open data DS 124/192 using the same SP methodology as WEM. This provides the empirical base for p(d | system state).

LELF threshold values from Finnish cost-of-interruption research. VTT Technical Research Centre and Energiateollisuus ry have published interruption cost estimates for Finnish industry. These require extension from linear averages to threshold-based functions — specifically, the L3–L5 multipliers require engineering-based estimates of process tolerance rather than survey averages which undersample long-duration events.

Sector weight vector from Statistics Finland. Industrial structure data provides the relative economic weight of continuous manufacturing, district heating, cold chain logistics, and critical infrastructure in Finnish GDP. This allows CEL to be expressed as an aggregate economic figure rather than a sector-specific engineering estimate.

Timing correction factor f(t). Diurnal and seasonal demand profiles from Fingrid DS 124 provide the empirical basis for the timing multiplier. A stress event during peak winter industrial demand carries a timing multiplier of approximately 2.5–3.0 relative to a summer night baseline.

Proposed WEM Extension: CEL Display

CEL can be integrated into WEM §11 as a fourth component alongside SP_cluster, FS(p), and MD_proxy. The display would show:

CEL indicator: Input: SP_cluster T₁₆₈ + EPP W168 + seasonal timing factor Output: estimated economic damage range (€M) if current stress pattern persists to threshold Example (illustrative, not calibrated): SP_cluster T₁₆₈ = 3h, EPP = 0.20: CEL ~ €5–20M (L1–L2 range) SP_cluster T₁₆₈ = 12h, EPP = 0.45: CEL ~ €50–200M (L2–L3 range) SP_cluster T₁₆₈ = 36h, EPP = 0.75: CEL ~ €500M–2B (L4–L5 range) Note: These are illustrative order-of-magnitude estimates. Calibration requires the empirical inputs described in §05.

The CEL display would make the WEM instrument legible to financial actors who currently have no entry point into electricity endurance risk. An EPP value of 0.75 is technically precise but communicates nothing to a risk officer at a pension fund. "Estimated conditional economic loss: €500M–2B if current stress pattern persists to 36-hour threshold" communicates directly.

Open Questions

Recovery dynamics

Dominant term beyond L3. Total system loss has two components: direct outage damage (LELF) and recovery cost. Beyond the L3 threshold, recovery cost becomes the dominant term — which explains why 72 hours costs approximately the same as 48 hours in direct damage but significantly more in total loss. This is the mechanism behind the nonlinearity that standard models miss.

Total Loss(d) = LELF(d) + L_recovery(d) Dominance crossover: d < 12h: LELF dominates (~80% of total loss) d = 24h: approximately equal d > 48h: L_recovery dominates (~60–80% of total loss) This is why: 72h total ≈ 48h total (direct LELF plateaus, recovery still growing) but 48h total >> 24h total (LELF jumps, recovery kicks in)

Sector-specific β values for the recovery cost term: A critical additional component is recovery cost — the damage incurred after power is restored but before the system returns to normal operation. Recovery costs are nonlinear and in several sectors exceed direct outage costs for events beyond L3 threshold.

Total Loss = LELF(d, t, s) + L_recovery(d, s) where L_recovery(d, s) ≈ k(s) × d^β(s) Empirical β values by sector: Water supply: β ≈ 1.5 (contamination protocol — cost accelerates with duration) Continuous industry: β ≈ 1.3 (process restart damage + cleaning cycles) District heating: β ≈ 1.2 (network repressurisation, sequential restart) Telecommunications: β ≈ 1.1 (relatively linear — generator deployment logistics) Rail transport: β ≈ 1.4 (safety reset cascades, inspection requirements) β > 1.0 means recovery cost grows faster than linearly with duration. β = 1.5 for water supply means a 20-hour outage costs not 2x a 10-hour outage in recovery — it costs approximately 2.8x, because contamination protocol scope grows with duration. This term is particularly significant because it explains why L4–L5 events are so much more damaging than linear extrapolation from L1–L2 data would suggest. Survey-based cost-of-interruption studies systematically understate L4–L5 costs because they capture LELF(d) but not L_recovery(d).

Reserve capacity adjustment. The LELF as specified does not account for backup generation (hospital generators, industrial UPS systems, district heating emergency boilers). A reserve capacity discount factor R(s) should be incorporated: LELF_adjusted(d,s) = LELF(d,s) · (1 − R(s,d)) where R(s,d) = P(backup generation operational at duration d, sector s) This is time-dependent: a hospital generator rated for 72h has R(hospital,24h)≈0.95 but R(hospital,96h)≈0.20 as fuel reserves are exhausted.. Estimating R(s) requires survey data on backup generation penetration by sector — partially available from HVK strategic reserve assessments.

Interdependency multipliers. L4–L5 damage is amplified by infrastructure interdependencies: telecommunications failure impairs emergency response, which amplifies district heating failure, which amplifies hospital stress. These cascade effects are structurally excluded from sector-by-sector LELF estimates. Modelling them requires a network dependency graph that is beyond the scope of this working draft.

Insurance market data. Finnish business interruption insurance claims from major winter stress events would provide direct empirical validation of L1–L2 estimates and partial validation of L3 estimates. This data exists within the insurance sector but is not publicly available. Finanssiala ry would be the natural counterpart for accessing aggregated claims data.

CEL vs. EENS. The power systems literature uses Expected Energy Not Served (EENS) as its standard reliability metric. CEL is economically more informative than EENS because it weights unserved energy by sector-specific damage rather than treating all MWh equally. The relationship between the two metrics requires explicit articulation — CEL should be expressed as a function of EENS with sector-specific loss-of-load cost coefficients.

Scenario Analysis: 24h, 48h, and 72h Compound Events

The following scenarios apply the LELF framework to a compound winter event: OL3 unplanned outage, T = −18°C (five-day cold spell), wind <400 MW, NVE ~32%, SE1→FI at capacity. Not a full national blackout — approximately 40% of load affected through rolling outages, giving 8–16 hours effective outage per customer in the 24h scenario and 24–48 hours in the 72h scenario. Finnish winter load approximately 14,000 MW; affected load ~5,500 MW.

24-hour event (L2–L3 threshold)

48-hour event (L3–L5 threshold)

R_fuel cascade contribution: approximately €500M–1B of the 48h total would be avoided if fuel resupply logistics remained functional (generators holding nominal duration). This is the single largest avoidable cost component.

72-hour event (L4–L5 regime)

Nonlinearity: 72h ≠ 3 × 24h

The marginal cost from 48h to 72h is dominated by recovery and permanent losses, not direct outage damage — which is why the two figures are nearly identical (€2.4B vs €2.5B) despite the 50% longer duration. The step change is entirely at the 24h→48h boundary, where threshold crossings, R_resupply collapse, and cascade activation combine. The 10× jump between 24h and 48h — not 2× — is the central empirical finding of this scenario analysis. Three mechanisms drive it: threshold crossings (θᵢ) that activate new damage categories sequentially; R_resupply(d) collapse that invalidates all backup duration assumptions; and recovery cost β > 1 that makes system restoration superlinearly expensive with duration. The 72h event is not meaningfully more expensive than 48h in direct costs — it is more expensive in recovery costs and systemic disruption, but the major step change occurs at the 48h boundary.

Sector	LELF level	Loss range	Driver
Continuous industry (paper, chemical, metal)	L2	€100–180M	~66 GWh disrupted × €1,500–3,000/MWh production loss
Retail and services	L1–L2	€30–60M	Sales loss + operational shutdown
Cold chain (food + pharma)	L2→L3	€20–50M	Partial spoilage — threshold not fully crossed
Telecommunications and ICT	L2	€10–25M	Network degradation from ~3h; generator logistics still functional
Water and wastewater	L2	€10–30M	Pressure loss; no widespread contamination yet
Transport, payments, logistics friction	L1–L2	€20–40M	Payment systems, traffic management, logistics delay
Total 24h CEL	L2–L3	€190–385M	Central estimate ~€250M

Sector	Loss range	Driver
Industry	€400–550M	Process damage + extended restart; batch losses
ICT and data centres	€50–150M	Smaller operators fail; telecom cascade; SLA penalties
Retail and payments	€200–350M	POS down 2–6h; logistics friction; lost sales
Cold chain	€150–300M	Supermarkets: critical at 6–12h; warehouses: critical at 24h+
Water and wastewater	€250–550M	Recovery dominates: β=1.5; environmental discharge; biological reset
Healthcare	€150–300M	Operational cost + patient transfers + indirect effects
Transport and logistics	€150–300M	Rail stop; port slowdown; fuel cascade
Households (heating, freeze damage)	€200–600M	Electric/hybrid heating loss; pipe freeze; insurance claims
Public sector / emergency response	€50–150M	Crisis operations, civil protection activation
Total 48h CEL	€1.6–3.2B	Central estimate ~€2.4B

Sector	Loss range	Regime change
Industry	€400–800M	Permanent production loss for some facilities; extended restart
Retail and services	€150–300M	Near-complete shutdown; demand + payment + logistics all failed
Cold chain	€150–300M	Near-total warehouse inventory loss; mandatory disposal
Telecommunications	€50–120M	Fallback collapses; SCADA affected
Water and wastewater	€200–500M	Contamination + biological process reset (1–2 weeks recovery)
Healthcare	€100–250M	Evacuations; critical overload; fuel exhaustion at smaller facilities
Logistics and fuel	€100–250M	R_fuel cascade fully realised; mobility stops
Recovery costs (all sectors)	€500M–1.5B	β > 1 across all sectors; this term dominates 72h vs 24h difference
Total 72h CEL	€1.65–4.0B	Central estimate ~€2.5B

Duration	LELF level	CEL (central)	Ratio to 24h
24h	L2–L3	~€250M	1×
48h	L3–L5	~€2.4B	~10×
72h	L4–L5	~€2.5B	~10×

Comparison with traditional models: Standard €/MWh interruption cost models applied to the same 48h event produce approximately €500M–1B — an underestimate of 2–3× relative to the LELF result. The gap arises because traditional models capture L1–L2 direct costs but miss L3–L5 threshold crossings, recovery asymmetry, and cascade effects. The LELF result is not "pessimistic" — it correctly accounts for the physical mechanisms that make outages expensive.

DC Load Impact and Curtailment Optimisation

Adding 2,000 MW of flat DC load to the compound 72h scenario raises total CEL from approximately €2.5B to €5B — a doubling driven by three mechanisms: increased load shedding across the rest of the economy; DC diesel generators competing with hospitals, water systems, and telecommunications for the same fuel pool; and accelerated threshold crossings as all subsystem endurance timelines shift left by 20–40%.

The generator switch problem

The intuitive response — switch DCs to backup generators, freeing grid capacity — is counterproductive under compound stress. DCs switched to generators consume approximately 450 m³/hour of diesel at 2,000 MW load. This competes directly with hospitals, water pumping stations, and telecommunications base stations for the same limited fuel supply. R_resupply(d) deteriorates faster, collapsing from 24 hours to 12–18 hours of effective system backup. The net effect: grid load is partially relieved, but the diesel crisis is accelerated. DC-to-generator switching is always worse than maintaining DC on grid under compound stress conditions.

Curtailment optimisation results

The optimal strategy is partial load reduction (50–80% sleep mode) executed at 10–14 hours into the event — early enough to prevent cascade progression, late enough that DC direct losses (SLA, workload migration) are bounded by time remaining. The optimisation surface is non-convex with a saddle point near t = 12h.

Non-intuitive findings

Strategy	Timing	DC losses	System CEL	Total CEL	Net saving
No action	—	€0	€5.0B	€5.0B	baseline
DC sleep 50%	t=12h	€1.0B	€3.7B	€4.7B	+€0.3B
DC sleep 80%	t=12h	€1.6B	€3.1B	€4.7B	+€0.3B
DC offline	t=12h	€2.0B	€2.7B	€4.7B	+€0.3B
DC→generator	t=0h	€0	€5.5B	€5.5B	−€0.5B (worse)
DC sleep 50%	t=0h	€3.0B	€3.2B	€6.2B	−€1.2B (worse)
DC offline	t=24h	€1.6B	€3.3B	€4.9B	+€0.1B

Finding 1: DC→generator switching is always worse than keeping DCs on grid under compound stress — and the reason is marginal utility, not simulation artefact. During compound stress, diesel fuel is a scarce resource with a clear social priority ordering:

Marginal social value of diesel, descending priority: 1. Healthcare (hospitals, emergency services) 2. Water and wastewater (contamination prevention) 3. Telecommunications (coordination backbone) 4. Logistics (fuel resupply, emergency response) 5. Data centres (commercial, deferrable workload) DC-to-generator switching moves DC from position 5 (deferrable, grid-powered) to competing with positions 1–4 for the same limited diesel pool. The marginal social value of diesel in DC use is lower than its marginal value in hospital, water, or telecoms use by a factor of approximately 5–10× (based on VoLL ordering in the LELF framework). This is not a simulation result — it is a direct consequence of the social priority ordering that should govern resource allocation under emergency conditions. The policy recommendation (DC sleep protocol, no generator switching) follows from this ordering, not from model parameters.

Finding 1 (restated): DC→generator switching is always worse than keeping DCs on grid under compound stress. The grid load relief does not compensate for the diesel pool competition it creates.

Finding 2: Early (t=0h) full sleep or offline is also suboptimal — DC direct losses exceed cascade savings when action is taken before threshold crossings have begun.

Finding 3: The optimal window is approximately t=10–14h — after L2 thresholds have crossed (justifying action) but before L3–L4 cascades are fully established. Net system saving is approximately €300M against no-action baseline.

Finding 4 (the policy implication): Pre-emptive action triggered by CAT=1 (before the compound event reaches t=12h) outperforms reactive action. CAT condition (C) — NVE <35% AND EPP* >0.50 AND OL3 unavailable — is the natural trigger for DC demand response protocols. A mandatory "CAT-1 sleep protocol" (50% load reduction within 2 hours of CAT trigger) is a coordination instrument requiring grid connection agreement terms, not a technical constraint. It is currently absent from Finnish data centre regulatory framework.

DC Curtailment Policy Rule (proposed): If CAT = 1: DC operators reduce to 50% load within t+2h DC operators do NOT switch to backup generators (sleep mode: reduce workload, maintain minimal cooling) If CAT = 1 AND EPP* > 0.75 (BP-like): DC operators reduce to 20% load within t+1h Fingrid may mandate further reduction under emergency powers Expected system benefit: €200–400M per compound event avoided vs. no protocol: €0 (no instrument exists)

Linkage to TN-009: Expected System Loss

TN-009 provides compound event probabilities. TN-010 provides conditional loss functions. The linkage is the expected system loss calculation — the figure that financial actors can directly compare against other portfolio risks:

Expected System Loss (ESL) = Σᵢ P(scenario i) × CEL(scenario i) For the principal compound scenarios from TN-009: Scenario P/winter CEL_P90 ESL/year ──────────────────────────────────────────────────────────────── OL3 alone (normal conditions) 5.3% €20–80M €1.1–4.2M OL3 + cold 1.8% €80–300M €1.4–5.4M OL3 + cold + wind drought 0.5% €500M–2B €2.5–10M Two-unit + cold + wind 0.1% €2–10B €2–10M ──────────────────────────────────────────────────────────────── Total ESL using P90 (approximate, conservative) €7–30M/year Note: Using mean CEL (not P90) raises this estimate to €20–80M/year, because the fat tail of the duration distribution contributes disproportionately to the expectation. P90-based ESL understates true expected loss when the distribution is heavy-tailed (as compound winter events are). The range €7–80M/year brackets the likely true expected annual cost of electricity endurance risk in Finland. This figure currently appears in no financial risk model, insurance reserve calculation, or regulatory stress test.

Monte Carlo confidence intervals (n=100,000)

The ESL calculation uses point estimates for P(compound) and CEL. A Monte Carlo simulation with 100,000 draws — Beta-distributed P(compound) with 95% mass in [0.1–2.0%/winter], lognormal CEL (P50=€250M, P90=€2.4B), and P-CEL correlation ρ=0.6 — produces:

ESL distribution (M€/year): P10: 0.0 M€/year (most years: no compound event) P50: 0.8 M€/year (median year: small) P75: 4.1 M€/year P90: 17.5 M€/year P95: 39.8 M€/year P99: 178.0 M€/year Mean: 11.6 M€/year Std: 118 M€/year 90% confidence interval: €0–40M/year Tail: top 1% of scenarios produces ~45× the mean annual loss.

P90 (€17.5M/year) approximately equals a typical €20M VAR risk budget — electricity endurance risk at the 90th percentile is already comparable in magnitude to priced electricity price risk. P99 (€178M/year) is 9× the VAR budget. Standard deviation (€118M) exceeds the mean (€12M) by 10×: the distribution is dominated by rare extreme events, not typical years. Capital provisions based on expected value (€12M) are structurally inadequate for fat-tailed risk — the correct reference for reserves and stress capital is P99 (€178M).

CAT → Action Mapping

The Compound Alert Trigger (TN-009 §03.1) becomes operationally meaningful only when mapped to specific response actions. The following table provides a starting framework; thresholds require calibration against institutional mandates:

This mapping is illustrative. The thresholds, responsible actors, and specific actions require formal mandate assignment — the measurement gap identified in SM-009 §09. The framework provides the instrument; institutional design provides the mandate.

CAT	CEL₉₀ range	EPP*	Response action	Time-to-action	Responsible actor
0	< €50M	< 0.50	Standard monitoring	—	WEM operator
1 (slope)	€50–200M	0.50–0.65	Readiness: fuel pre-positioning, standby personnel, DC sleep-protocol alert, contingency review	Within 2h	Fingrid, HVK
1 (level)	€200M–1B	0.65–0.75	Active: DC load reduction to 50%, demand response activation, reserve capacity alert, public communication	Within 1h	Fingrid, TEM
1 (compound)	> €1B	> 0.75	Emergency: DC load to 20%, load control, cross-border coordination, civil authority activation, fuel resupply priority order activated	Within 30min	Fingrid, VNK, pelastustoimi