WP-004 Calibration Report · CR-1 · Two Cases · RP-1 and RP-2

Recovery Capacity Diagnostic Calibration: European Energy and Healthcare Systems

Pre-registered blind reconstructions across two independent domains — results, limitations, and the pre-condition for RP-3

Purpose of This Report

This report documents the first two calibration exercises conducted against the WP-004 Recovery Capacity diagnostic framework. Both cases apply three proxy variables — Variation, Redundancy, Recovery Time — to independent domains using only publicly available data from defined pre-event periods. The purpose is not to explain historical events but to determine whether the framework's gradient hypothesis (dΩ/dt as the primary diagnostic signal) was detectable from pre-event data. Results are recorded here prior to RP-3 (false positive test) to establish a locked baseline against which subsequent tests can be evaluated.

Cases: European gas system 2018–2021 · European hospital systems 2015–2019 · Blind cutoffs: 31 Dec 2021 · 31 Dec 2019 · Status: two positive replications

§ 01

Calibration Protocol

Both calibration cases follow the same protocol established in WP-004 Appendix A. The protocol is reproduced here in condensed form to establish the terms under which results should be interpreted.

Blind Reconstruction Protocol — WP-004 CR

Temporal cutoffA hard cutoff date is established before reconstruction begins. No data, analysis, or knowledge of events after the cutoff date is used in the reconstruction.

Data classPublicly available primary or secondary statistics only. No internal documents, private datasets, or retrospective analyses published after the cutoff.

Target variableSign of dΩ/dt only — the direction of recovery capacity change. No quantitative prediction of failure timing, severity, or probability.

Proxy selectionThree proxies mapping to Variation, Redundancy, and Recovery Time. Selected for data availability and domain-appropriateness before reviewing trend data.

Verdict criteriaPositive result: all three proxies show consistent negative gradient. Signal criteria: ≥3 of 5 WP-004 early warning signals present. These criteria are pre-registered, not applied post-hoc.

This report locks the results of RP-1 and RP-2 prior to RP-3. Once RP-3 conditions are specified, they cannot be modified in response to RP-1 and RP-2 results.

§ 02

Case Summaries

RP-1 · Case 1

European Natural Gas System

Observation period: 2018–2021. Blind cutoff: 31 December 2021. Exposure event (excluded): 2022 supply disruption. Proxy variables: supply source HHI (Variation), winter storage minimum fill rate (Redundancy), TTF post-shock recovery duration (Recovery Time). Data sources: Eurostat nrg_ti_gas, GIE AGSI+, EEX/ICE TTF.

Result: dΩ/dt negative · 4/5 signals present · Danger zone at cutoff

RP-2 · Case 2

European Hospital Systems

Observation period: 2015–2019. Blind cutoff: 31 December 2019. Exposure event (excluded): COVID-19 pandemic stress 2020–2021. Six countries: DE, FR, IT, ES, NL, FI. Proxy variables: care pathway diversity (Variation), hospital beds per capita and occupancy rate (Redundancy), seasonal backlog recovery duration (Recovery Time). Data sources: OECD Health Statistics, Eurostat Healthcare Capacity, WHO GHO.

Result: dΩ/dt negative in 6/6 countries · ≥4/5 signals in all countries · Danger zone at cutoff

§ 03

Proxy Gradient Results

3.1 Variation

Both cases show declining variation across the observation period. In the energy case, supply source HHI increased monotonically 2018–2021, with Russian pipeline gas rising from approximately 40% to over 44% of EU imports. In the healthcare case, care pathway diversity declined across all six countries through centralisation of acute care, reduction in primary care relative capacity, and workforce specialisation that reduced triage flexibility.

The mechanism differs between domains — market consolidation in energy, planned efficiency reform in healthcare — but the structural outcome is identical: fewer available pathways for equivalent system function under stress conditions.

3.2 Redundancy

This is the strongest signal across both cases and the most consistently documented in pre-event public data.

In the energy case, EU gas storage winter minimum fill rates declined across three consecutive annual cycles. Summer refill campaigns consistently failed to restore prior-year buffer baselines. By the blind cutoff date, storage headroom for multi-month disruption absorption had reached its lowest observed level.

In the healthcare case, the pattern is more uniform and covers a longer period. OECD data documents consistent bed reduction across the EU throughout the 2010s, with occupancy rates rising to compensate. By 2019, average acute care bed occupancy in the study countries exceeded levels at which clinical guidance identifies systemic strain risk. ICU capacity had not grown proportionally with population or age-adjusted demand.

Country / System	Variation trend	Redundancy trend	Recovery Time trend	dΩ/dt	Signals ≥3
EU Gas System	↓	↓↓	↑ (elongating)	Negative	4/5 ✓
Germany (DE)	↓	↓	↑	Negative	5/5 ✓
France (FR)	↓	↓	↑	Negative	5/5 ✓
Italy (IT)	↓	↓↓	↑	Strong neg.	5/5 ✓
Spain (ES)	↓	↓↓	↑	Strong neg.	5/5 ✓
Netherlands (NL)	→	↓	→	Slight neg.	4/5 ✓
Finland (FI)	↓	↓	↑	Negative	4/5 ✓

3.3 Recovery Time

Recovery time elongation — the progressive slowing of return to baseline after disruption — was the most operationally visible signal in both cases, and the one most extensively documented in pre-event reports that nevertheless did not trigger systemic response.

In the energy case, TTF post-shock recovery duration increased from approximately 12 days in early 2018 to 38 days after the February 2021 cold event. The October 2021 spike had not recovered to pre-shock baseline by the blind cutoff date. In the healthcare case, winter pressure periods produced progressively longer elective care backlogs in each successive year from 2015 to 2019. Annual reports from national health systems documented this pattern consistently; it was attributed to demand growth rather than capacity erosion.

§ 04

Early Warning Signal Assessment

The five WP-004 early warning signals are assessed across both cases. Assessment uses only pre-cutoff evidence.

Signal Name Cross-case evidence Energy Health

S-1

Recovery Delay Drift

Both cases document progressive elongation of system recovery from equivalent disruption events across the observation period. TTF recovery durations; elective care backlog clearance times.

PRESENT

S-2

Buffer Erosion Without Replacement

Storage buffers and hospital bed reserves both declined while being simultaneously used at higher utilisation rates. Reserve capacity reclassified as operational capacity in both domains.

PRESENT

S-3

Suppression of Weak Signals

Pre-event reports in both domains documented deteriorating conditions and framed findings within efficiency or demand-management language. Resilience implications present in data were not reflected in institutional responses.

PRESENT

S-4

Local Optimisation Proliferation

Energy market liberalisation and spot-market efficiency; hospital lean management programmes. Both produced local efficiency gains with aggregate resilience costs that were not assessed system-wide.

PRESENT

S-5

Decision Irreversibility Accumulation

Energy: LNG infrastructure and diversification investments deferred; spot-market lock-in. Health: hospital closures and consolidated care models structurally difficult to reverse on short timescales.

PARTIAL

PRESENT

§ 05

Calibration Verdict

CR-1 Locked Result

Two positive replications across independent domains

Gradient hypothesisConfirmed in both cases. dΩ/dt negative and detectable from pre-event public data in all observed systems.

Signal thresholdMet or exceeded in all 7 system observations (1 energy, 6 healthcare countries). Minimum 4/5 signals present in all cases.

Cross-domain replicationThree-proxy structure produces consistent directional results across two structurally unrelated domains using different data sources.

Proxy operationalisationDomain-specific proxies required in each case. No single universal proxy. The three-variable structure is domain-invariant; its operationalisation is not.

Framework status changeFrom single-case concept to cross-domain replicated diagnostic pattern. Not yet theory. Method candidate.

What Changed After Two Calibrations

Single-domain observation → Cross-domain replication

Concept requiring testing → Method candidate requiring falsification

Exposure events as causes → Exposure events as detectors of prior deficit

RP-3 as optional next step → RP-3 as methodological obligation

§ 06

The Structural Finding Across Both Cases

Both calibration cases share a finding that is not about the specific domains but about the diagnostic structure itself. In both cases, the exposure event — the 2022 gas disruption and the COVID pandemic stress — was widely described at the time as an unexpected shock. In both cases, the pre-event data shows a system whose recovery capacity had been declining for multiple years before the shock occurred.

The exposure event did not create the vulnerability. It revealed a vulnerability that was already structurally present and measurable. The shock was the test, not the cause.

This distinction has methodological implications beyond these two cases. If the pattern is general — if exposure events are detectors of prior recovery capacity deficit rather than causes of it — then the relevant diagnostic question shifts from post-event analysis ("what caused this?") to pre-event observation ("is the gradient negative?"). This is the operational implication of the gradient hypothesis, and it is what makes RP-3 the methodologically decisive next test.

RP-3 requires identifying a system in which the gradient was negative and the signals were present, but the exposure event either did not occur or did not produce systemic failure. If such cases exist and the framework cannot distinguish them from RP-1 and RP-2, the framework produces false positives and its diagnostic utility is limited. If the framework can identify structural differences between RP-1/RP-2 cases and RP-3 cases, it has earned a stronger evidential claim.

§ 07

Cumulative Limitations

The limitations of individual cases documented in Appendix A apply here. Three cumulative limitations specific to the two-case comparison are added.

CL-1

Both cases involve major exposure events. The two calibration cases were selected in part because they involved well-documented system stress events. This introduces selection bias: cases where the framework would produce a negative result (declining gradient, no subsequent failure) are not represented. RP-3 addresses this directly.

CL-2

Both cases are European, same period. Shared geopolitical and economic context across the two domains cannot be excluded as a confounding factor. Replication in non-European contexts and different historical periods is required before geographic generalisability is claimed.

CL-3

Positive replications do not validate the framework. Two cases consistent with the framework's predictions confirm that it does not break on contact with evidence. They do not confirm that the framework is correct, complete, or superior to alternative diagnostic approaches. Validation requires a substantially larger and more diverse case set including cases designed to challenge the framework.

§ 08

RP-3 Pre-Registration

RP-3 is the false positive test: a system in which the WP-004 diagnostic signals were present and the gradient was negative, but systemic failure did not follow. This report establishes the pre-registration conditions for RP-3 before case selection occurs.

RP-3 Pre-Registered Criteria

False Positive Test — Conditions Locked Prior to Case Selection

Case selection requirement: The case must involve a system in which at least three of five WP-004 early warning signals were present during the observation period, and in which the three proxy variables showed negative gradient. The system must not have experienced catastrophic failure during or immediately after the observation period.

Candidate domains: Financial systems that passed stress tests during period of balance sheet deterioration. Infrastructure systems that maintained function through extended pressure periods. Organisations that recovered from declining trajectory without failure event.

Success condition for RP-3: The framework identifies structural differences between RP-3 cases and RP-1/RP-2 cases — differences that would, in principle, have been visible pre-event. If no such differences are identified, the framework is producing false positives at the observed signal thresholds.

Failure condition for RP-3: The framework cannot distinguish RP-3 cases from RP-1/RP-2 cases using pre-event data. This would require revision of signal thresholds, proxy operationalisation, or the gradient hypothesis itself.

Both outcomes are scientifically productive. The failure condition is not a failure of the research programme — it is the programme working as designed.

RP-3 Result — Pre-registered condition met

The RP-3 false positive test has been completed. The Nordic electricity system 2010–2018 was selected as a system under sustained stress that did not fail. Blind cutoff: 31 December 2018. The framework produced sign(dΩ/dt) = stable, 1/5 signals partial, Stable/Concern zone — consistent with the known outcome. No false positive. The pre-registered success condition is met.

WP-004 RP-3 — Nordic Electricity System False Positive Test →

§ 09

Recovery Capacity Diagnostic Calibration: European Energy and Healthcare Systems

Calibration Protocol

Case Summaries

Proxy Gradient Results

3.1 Variation

3.2 Redundancy

3.3 Recovery Time

Early Warning Signal Assessment

Calibration Verdict

The Structural Finding Across Both Cases

Cumulative Limitations

RP-3 Pre-Registration

Related Documents