All content on this site is intended for healthcare professionals only. By acknowledging this message and accessing the information on this website you are confirming that you are a Healthcare Professional. If you are a patient or carer, please visit the MPN Advocates Network.

The MPN Hub uses cookies on this website. They help us give you the best online experience. By continuing to use our website without changing your cookie settings, you agree to our use of cookies in accordance with our updated Cookie Policy

Introducing

Now you can personalise
your MPN Hub experience!

Bookmark content to read later

Select your specific areas of interest

View content recommended for you

Find out more
  TRANSLATE

The MPN Hub website uses a third-party service provided by Google that dynamically translates web content. Translations are machine generated, so may not be an exact or complete translation, and the MPN Hub cannot guarantee the accuracy of translated content. The MPN Hub and its employees will not be liable for any direct, indirect, or consequential damages (even if foreseeable) resulting from use of the Google Translate feature. For further support with Google Translate, visit Google Translate Help.

Steering CommitteeAbout UsNewsletterContact
LOADING
You're logged in! Click here any time to manage your account or log out.
LOADING
You're logged in! Click here any time to manage your account or log out.

The MPN Hub is an independent medical education platform, sponsored by AOP Health and GSK, and supported through an educational grant from Bristol Myers Squibb. The funders are allowed no direct influence on our content. The levels of sponsorship listed are reflective of the amount of funding given. View funders.

2023-11-30T11:38:10.000Z

Utilizing machine learning to predict thrombosis in polycythemia vera

Nov 30, 2023
Share:
Learning objective: After reading this, learners will be able to cite new developments in machine learning for the prediction of thrombosis in polycythemia vera.

Bookmark this article

Test your knowledge! Take our quick quiz before and after you read this article to find out if you improved your knowledge. Results help us to improve content and continually provide open-access education.

Thrombosis remains one of the leading causes of morbidity and mortality in patients diagnosed with polycythemia vera (PV). As a result, there is a growing need for precise models to identify patients at the highest risk of thrombotic events. Artificial intelligence (AI) offers a potential solution in the form of sensitive, precise, and dynamic prediction models that can be used for risk prediction during clinical trials. The use of AI in the diagnosis of myeloproliferative neoplasms (MPN) has been previously reported by Royston in our recent MPN Hub Steering Committee meeting.

During the 15th International Congress on MPN, Abu-Zeinah gave a presentation on the use of AI and machine learning to predict thrombotic events in patients diagnosed with PV. Here, we summarize the key points.

The need to improve thrombotic prediction

  • Patients who are both <60 years and >60 years old have double the cardiovascular mortality risk compared with the general population
  • Over 3–5 years, the overall mortality risk for patients with PV is 10%
    • - A third of this risk is due to thrombotic complications
  • For the feasibility of randomized clinical trials that investigate thrombotic risk, around 200 patients are required, together with a model that provides a predictive value of ≥20%
  • A unifying diagnostic model is therefore urgently needed
    • Both the positive predictive value and sensitivity would need to be 75%

A unifying diagnostic model

  • The proposed model should be dynamic and be able to predict short-term risk at any clinical visit
  • The advantages of a machine learning model include:
    • Automated feature selection
      • Handle large data sets and identify new variables and patterns
    • Improved accuracy
    • Personalized and real-time assessment
      • Ability to learn and adapt to changing patient data
      • Individualized predictions

Results from a large PV cohort with longitudinal data

  • A recent study of 470 patients accrued several variables:
    • 8,500 clinic visits;
    • 1.4 million data elements; and
    • >2,100 parameters.
  • Results showed thrombosis was non-linear
    • There was an increased incidence of 4.4% in the first 2 years vs 1% in subsequent years.
    • This was similar for recurrent thrombotic events, whereby incidence in the first 2 years was 9.7% and incidence in subsequent years was 1.8%.

The machine learning approach

  • The proposed prediction model identified several important variables, including:
    • time since diagnosis;
    • age;
    • lactate dehydrogenase levels at last check;
    • hematocrit levels; and
    • blood type.
  • Important variables identified by the model that have historically been underappreciated in PV research included:
    • time since last thrombotic event;
    • body mass index; and
    • creatine levels.
  • The model accrued a total of 292 variables and performed well when assessed. It was able to identify patients who were likely to have thrombosis within 1 year at every clinical instance.
  • Sensitivity was calculated to be 85%, with 98% specificity and 97% positive predictive value.
  • Unfortunately, a model with 300 variables is feasible. As a result over 300,000 models were trained with the top 3–9 variables selected.
  • There was high consistency among the variables selected, which included:
    • age;
    • time since diagnosis;
    • blood type;
    • JAK2 percentage; and
    • body mass index.
  • When analyzed, several of the trained models demonstrated the feasibility of a 1-year 1:1 randomized clinical trial investigating the reduction of thrombosis by 50% in 200 patients, with all models indicating an efficient accrual period of 2 years.

Conclusion

A precise prediction model is essential for patient selection in clinical trials, requiring high sensitivity and positive predictive value. Leveraging machine learning enables individualized predictions and the identification of unknown patterns in large datasets. The proposed model should undergo validation using prospective and external data sets.

  1. Abu-Zeinah G. Using artificial intelligence for predicting thrombosis in PV. Oral abstract #2.1. 15th International Congress on MPN; Nov 2, 2023; New York, US.

Newsletter

Subscribe to get the best content related to MPN delivered to your inbox