Finnish health registries are among the most comprehensive in the world. The integration failure documented in WP-016 is not a data absence problem — it is an allocation problem. HDCI is a diagnostic instrument for identifying where integration failure is highest, enabling resource allocation decisions under conditions of fiscal constraint.
Pohjois-Savo is selected as the pilot welfare area for three structural reasons:
| Reason | Justification |
|---|---|
| Registry coverage | Kuopio University Hospital (KYS) serves as the tertiary referral centre for the region. THL and Kela registry coverage is comprehensive and documented. |
| Multi-morbidity profile | Pohjois-Savo has one of Finland's highest age-standardised morbidity indices (Sotkanet ind. 5642). Elevated multi-morbidity makes integration failure structurally more detectable. |
| Institutional readiness | Pohjois-Savo welfare area is operational. KYS has existing research infrastructure and THL collaboration pathways. |
All required data exists within Finnish health registries. Patient-level access requires formal secondary use licensing through Findata (the Health and Social Data Permit Authority).
| Dataset | Controller | HDCI use |
|---|---|---|
| THL Avohilmo | THL | DCI (expected entries), IAI (specialty contacts), PAI (preventive codes), RVI (finding timestamps) |
| THL Terveys-Hilmo | THL | IAI (care episodes), RVI (procedure timestamps), validation: preventable hospitalisations |
| Kela lääkekorvausrekisteri | Kela | RKI (prescription data, prescriber source) |
| Kanta care plan register | Kela / THL | RKI (shared care plan entry verification) |
| Sotkanet ind. 5642 | THL | DCI expected entry baseline — open data, no licence required |
| Step | Description | Estimated duration |
|---|---|---|
| 1. Pre-application consultation | Contact Findata to confirm dataset availability and application requirements | 2–4 weeks |
| 2. Research plan finalisation | Cohort definition, variables, linkage method, data protection protocol | Concurrent with step 1 |
| 3. Submit Findata application | Formal secondary use licence with THL and Kela as data controllers | 1–2 weeks preparation |
| 4. Findata processing | Statutory processing time | 2–3 months |
| 5. Data delivery | Secure research environment access (Kapseli or equivalent) | 1–2 months post-approval |
| Total estimated | 4–6 months |
Current status: Step 1 not yet initiated. This document is preparatory material for the pre-application consultation.
| Criterion | Definition |
|---|---|
| Residence | Pohjois-Savo welfare area municipalities |
| Multi-morbidity | ≥2 chronic conditions from THL chronic disease classification (diabetes, cardiovascular, respiratory, mental health, musculoskeletal, neurological) |
| Age | ≥18 years at index date |
| Observation period | 1 January 2024 – 31 December 2025 (24 months) |
| Criterion | Rationale |
|---|---|
| Palliative care (Z51.5) | Integration needs differ structurally; distinct care coordination protocols apply |
| Active oncology (C00–C97, treatment within 6 months) | Protocolised through oncology services; integration failure manifests differently |
| End-stage renal disease (N18.5–N18.6 with dialysis) | Care concentrated in nephrology; generalisable integration metrics less applicable |
| Long-term care facility (>90 days) | Data capture patterns differ; coordination occurs within facility |
Expected cohort size: 30,000–40,000 patients (based on ~250,000 population, ~15–20% adult multi-morbidity prevalence).
| Hypothesis | Outcome measure | Source |
|---|---|---|
| H1: Higher HDCI_v1 predicts lower preventable hospitalisation rate | Ambulatory care sensitive conditions / 1,000 patient-years (THL definition) | THL Terveys-Hilmo |
| H2: Higher RKI_adjusted predicts higher adverse medication event rate | Adverse medication events / 1,000 patient-years (ICD-10 Y40–Y59, T36–T50) | THL Terveys-Hilmo |
| H3: HDCI_v1 varies across municipalities within Pohjois-Savo | Municipality-level HDCI vs. morbidity index | Calculated HDCI |
| H4 (Topological): RKI–IAI–RVI form stable clusters in state space; the high-RKI/high-IAI/low-RVI cluster predicts elevated harm outcomes not captured by conventional metrics | Cluster membership (k-means / latent class analysis) vs H1 and H2 outcomes AND vs conventional metrics (cost per patient, queue length, morbidity index). H4 predicts: cluster 3 elevated on H1/H2 but not distinguishable from cluster 2 on conventional metrics. | THL Terveys-Hilmo · Kela · Calculated HDCI |
H4 is the primary research contribution of this pilot. If supported, it demonstrates that HDCI reveals a high-risk patient state that conventional metrics systematically miss. If unsupported, it constrains the interpretation of HDCI to a measurement instrument without topological properties — which remains valid and useful.
Statistical methods: H1–H3: multilevel regression, patient-level predictors, outcome measures as dependent variables, controlling for age, sex, morbidity index. H4: k-means clustering (k=3) and latent class analysis in RKI–IAI–RVI space; cluster outcome comparison via ANOVA and logistic regression.
DT-006 pilot results will populate a conceptual dashboard structurally analogous to the Winter Endurance Monitor (WEM). The Health Integration Monitor (HIM) is a design target for post-pilot development.
| WEM component | HIM equivalent |
|---|---|
| EPP — system endurance pressure | HDCI composite — integration pressure index |
| SP — stress persistence fraction | RKI — polypharmacy risk fraction |
| FS(p) — probabilistic firm capacity | IAI — integration achievement fraction |
| NVE hydro_RF — external buffer | Morbidity index — external demand pressure |
| TRR — transmission realisation rate | RVI — response velocity (institutional throughput) |
The following validity threats are identified in advance of data access. Each is addressed in the pilot design. This section is a prerequisite for a credible Findata application.
DCI has a structural endogeneity problem: morbidity drives care contacts, which drive Kanta entries — the same causal structure produces both predictor and outcome. Without correction, DCI risks measuring documentation intensity rather than data capture quality. The pilot adds Avohilmo contact rate as an exogenous instrument:
This separates documentation completeness (Kanta entries per contact) from care frequency (contacts per patient). Negative binomial regression; sensitivity vs Poisson and stratum-mean.
IAI is a structural proxy for integration, not a clinically invariant measure. The 30-day episode window is a heuristic that treats all care episodes identically — false in clinical reality across diabetes, orthopaedics, and psychiatry. IAI measures whether the system produces multi-specialty contact within a defined window: a system-level observation, not a patient-level truth claim. Sensitivity analysis: 15, 30, 90-day windows. Unstable rankings require disease-stratified analysis.
RVI measures institutional response velocity — how quickly the system converts a recorded finding into a recorded action. It does not measure clinical appropriateness. "First finding" is not neutral: it depends on testing frequency and coding practices. RVI is a system-level throughput indicator; this scope must be stated explicitly in all publications.
RKI is the strongest HDCI component. v0.6 introduces an explicit adjusted measure:
RKI_adjusted isolates coordination failure from documentation infrastructure variation. Composite HDCI uses RKI_adjusted. Both values reported. Cross-validated against H2 (adverse medication events).
RKI: 0.20 → 0.25 (strongest component, direct outcome validation). PAI: 0.10 → 0.05 (input-effort indicator only; does not measure prevention effectiveness). All weights remain Bayesian priors subject to outcome-constrained re-estimation.
HDCI does not model "integration" as a unitary latent variable. Integration is multidimensional — information flow, responsibility coordination, and clinical decision consistency are distinct constructs. HDCI is a composite proxy index: a diagnostic signal about where integration-relevant observables are weakest, not a measurement of an underlying integration quantity.
EXOGENOUS
Morbidity · Age/Sex · Avohilmo contact rate · Kanta adoption rate
Morbidity + Age/Sex → Care Contacts ← Avohilmo rate (instrument)
Care Contacts → Kanta entries (DCI) · Multi-spec episodes (IAI)
→ First finding → First action (RVI)
Prescriber multiplicity + Kanta adoption → RKI_raw → RKI_adjusted
Preventive contacts / total → PAI
HDCI composite → H1 Preventable hospitalisations
RKI_adjusted → H2 Adverse medication events
HDCI components → H3 Inter-municipality variation
HDCI does not prescribe interventions. It reveals where the system's own coordination logic produces elevated risk without corresponding visibility in current metrics. This section describes what the RKI–IAI–RVI space shows about system behaviour — not what should be done about it.
The three-dimensional RKI–IAI–RVI space identifies:
HDCI does not tell the system what to do. It tells the system where its own coordination logic is producing the highest unobserved friction.
| Incorrect framing | Correct framing |
|---|---|
| "Group 3 is the target" | Transitions between groups are the signal |
| "Fix Group 3" | Prevent drift into Group 3 |
| Patient classification | System dynamics observation |
The diagnostic value of HDCI lies not in labelling patients but in identifying the transition zones where patients move from coordinated care into high-risk invisibility. Any interventions, if undertaken, address the drift mechanism — not individual patients.
1. RKI → IAI coupling — responsibility diffusion before fragmentation. When RKI rises without IAI decrease, medication responsibility diffuses across prescribers before the care pathway fragments into multiple uncoordinated specialties. The system accumulates polypharmacy risk silently. Minimal feedback: a Kela + Kanta medication consolidation view — not a new workflow, only increased visibility of prescriber multiplicity.
2. IAI → RVI coupling — coordination without decision. When IAI is high but RVI remains low, the system makes contact across multiple specialties, records findings, but does not convert findings into timely action. The system sees everything but does not react. Minimal feedback: organisational-level latency reporting — an escalation log, not automated decision support.
3. RKI × RVI risk zone — silent high-risk drift. When RKI is high and RVI is low simultaneously, responsibility is maximally diffused and response velocity is minimal. No current metric (cost, queue, morbidity) flags this state. Minimal feedback: a review flag — not a clinical decision, only a signal that the coordination profile warrants attention.
If HDCI is presented as "Group 3 requires intervention," it becomes a clinical decision model — which it is not — and loses standing as a measurement framework. Presented as "HDCI reveals structural friction zones that current metrics cannot see," it remains a diagnostic instrument. This distinction determines whether DT-006 remains a research design or becomes an unimplementable policy proposal.
| Periaate | Toteutus |
|---|---|
| Ei yksilötason julkista raportointia | Klusterit raportoidaan väestötasolla — kunnittain, alueittain. Yksittäistä potilasta ei luokitella julkisesti. |
| Ei "huono / hyvä" -arvottamista | Ryhmät kuvataan rakenteellisina tiloina: lineaarinen, hallittu kompleksisuus, hidas sirpaloituminen. |
| Ei suoria toimenpidesuosituksia | HDCI kertoo mitä näkyy, ei mitä pitäisi tehdä. Toimenpiteet ovat hyvinvointialueen ja THL:n päätöksiä. |
| Avoin metodologia | Kaikki laskentakaavat, klusterointimenetelmät ja validointitulokset julkaistaan. Läpinäkyvyys on hyväksyttävyyden tae. |
| Vertaisarvioitu julkaisu ennen operatiivista käyttöä | Metodologia ja ensimmäiset tulokset julkaistaan vertaisarvioidussa kanavassa ennen kuin HDCI:tä tarjotaan käytettäväksi ohjaustyökaluna. |
| Step | Target | Status |
|---|---|---|
| Pre-application consultation with Findata | Q2 2026 | Not initiated |
| Contact Pohjois-Savo welfare authority | Q2 2026 | Not initiated |
| Contact THL / Kela research units | Q2 2026 | Not initiated |
| Finalise research plan and data protection protocol | Q2 2026 | Not initiated |
| Submit Findata licence application | Q3 2026 | Not initiated |
| Data access established | Q4 2026 | Pending licence |
| Preliminary HDCI results for Pohjois-Savo | Q1–Q2 2027 | Pending data access |
| Version | Trigger |
|---|---|
| v0.7 (current) | Preparatory. Pre-licensing. |
| v0.7 | Licence application submitted. Research plan finalised. |
| v0.9 | Licence granted. Data access established. |
| v1.0 | Preliminary results. HDCI calculated for Pohjois-Savo cohort. |