
Three Rules to Find Them All - Clinical Risk Scores in HFpEF
- | By Ultromics
A. Akerman1, W. Hawkes1, N. Al-Roub2, C. Scott3, C. Angell-James2, H. Piotrowska1, P. Leeson4, G. Woodward1, J. Strom2, P. Pellikka3, R. Upton1
¹ Ultromics Ltd, Oxford, UK, ² Beth Israel Deaconess Medical Center, Harvard Medical School, Boston, US, ³ Mayo Clinic, Rochester, US, ⁴ University of Oxford, Oxford, UK
Purpose
Multiple scoring rules and clinical decision aids exist to support the identification of heart failure with preserved ejection fraction (HFpEF), but it remains frequently unrecognized or misdiagnosed. This study aimed to validate multi-site diagnostic performance of clinical scores, both independently and in combination, for identifying HFpEF.
Methods
Independent patients undergoing clinically indicated echocardiograms at Mayo Clinic (1) and Beth Israel Deaconess Medical Center (2) were retrospectively identified. Risk of HFpEF was assessed according to three validated algorithms; the H2FPEF and HFA-PEFF scores, and EchoGo® Heart Failure (Ultromics). The H2FPEF score and HFA-PEFF score are multiparametric clinical models, and EchoGo® Heart Failure is an AI computer vision model using a single echocardiographic video input. The continuous outputs from the H2FPEF score and EchoGo® Heart Failure were combined with the HFA-PEFF categorical score (logistic regression) to provide a unique prediction (”Three Scores”). Discrimination, calibration, classification, and clinical utility were assessed.
Results
Compared with patients without HFpEF (n=886), patients with HFpEF (n=894) were slightly older (73 vs. 68 y), had more comorbidities, and more pronounced cardiac dysfunction. The AI model and H2FPEF continuous score demonstrated high discrimination (AUROC; Figure 1), and similar calibration (Figures 2-4), both of which were improved when combining all three scores. EchoGo® Heart Failure categorised 50.2% of patients as high likelihood of HFpEF, 40.2% as low likelihood of HFpEF, and 9.6% of patients as intermediate (Table 2). The H2FPEF score and HFA-PEFF score categorised 27.8% and 22.4% patients as high likelihood of HFpEF, 11.5% and 23.1% as low likelihood of HFpEF, and 60.8% and 54.5% of patients as intermediate (respectively; Table 2). The Three Scores combined demonstrated high sensitivity (91%) and specificity (82%). At a decision threshold probability of 30%, managing patients based on EchoGo® Heart Failure output resulted in 27% more correct decisions than H2FPEF score, but the combined Three Scores increased correct decisions by a further 12% (Central Figure).
Utilizing all available information from clinical risk scores and decision support aids could increase utility in the management of HFpEF
AI can support traditional methods in the diagnosis of heterogenous and often missed/misdiagnosed cardiomyopathies
Conclusion
The integration of existing clinical scores and AI models may be the most valuable approach to diagnosing heart failure with preserved ejection fraction.