Artificial Intelligence in Oncology - Supporting scientific research
Radboudumc
In short, Marianne Jonkers' research entails the following:
'The prognosis of patients with rare cancers is worse than that of patients with common cancers. An important reason is that for rare cancers it is more difficult to identify potential treatments, because only small data sets will typically be available for research. When using such small data sets to identify predictive factors, the risk of overfitting in the models used is high. This limits severely our ability to infer statistically significant patterns in these data, which might have suggested novel treatments. Pooling data from different institutions could alleviate the situation, but is in practice challenging due to regulatory and logistical problems.
Two complementary routes for confronting the challenges of small data sets for rare cancers are proposed. The first is to focus on more powerful techniques for inference that are better able to cope with small sample sizes, without overfitting. The second route is to design and improve machine learning algorithms that circumvent the need for data pooling at one location by `cycling’ around medical institutes with small data sets (federated learning). Data on salivary gland cancer will be analyzed with the proposed methods.
The project is a collaboration between: dr. Marianne Jonker, prof. Ton Coolen, prof. Kit Roes and prof. Carla van Herpen, all at the Radboud university medical center or Radboud university.'
With the BFI methodology one can reconstruct from local inferences in separate data sets (centers) what would have been inferred had the data sets been merged. The methodology turns out to be very accurate and applicable in many situations. We have developed the methodology and software for multiple commonly used models (generalized linear models and time-to-event models), homogeneous and heterogeneous populations, and for prediction and association models. Data have been analyzed and the results are very good. The next step is to incorporate overfitting theory that overcomes overfitting in case of small datasets.
The first aim in this project is to develop and apply the theory of Bayesian Federated Inference (BFI) for parametric models and for models with a time-to-event outcome. We finished this for general parametric models and submitted our paper: Bayesian Federated Inference for Statistical Models (M.A. Jonker, H. Pazira, A.C.C. Coolen). The paper is still under review. The next step is to generalize the theory to the semi- parametric Cox PH model. We aim to finish this before the summer of 2023