MUSK: Stanford AI Model Achieves Breakthrough in Cancer Prediction
Stanford Medicine has developed a groundbreaking artificial intelligence (AI) tool, named MUSK, that integrates medical images and text data to significantly improve cancer prognosis prediction and treatment response assessment. Unlike previous AI models that struggled to incorporate diverse data types, MUSK excels at combining visual information (such as microscopic images, X-rays, CT, and MRI scans) with textual information (including exam notes and physician communications). This advancement marks a significant departure from the current applications of AI in clinical care, offering a more holistic and accurate approach to patient care guidance.
MUSK was trained on an extensive dataset of 50 million medical images and over 1 billion pathology-related texts. The results showed that MUSK outperformed standard methods in predicting prognoses for a wide array of cancer types, identifying patients with lung or gastroesophageal cancers likely to benefit from immunotherapy, and predicting melanoma recurrence. The model’s superior performance stems from its ability to utilize “unpaired multimodal data” – a type of data previously underutilized in AI training due to challenges in creating paired datasets. This means that the AI can learn from significantly larger datasets and fine-tune itself for specific clinical tasks, essentially acting as an "off-the-shelf" tool for doctors.
According to Dr. Ruijiang Li, a senior author of the study published in Nature, MUSK was developed to reflect real-world clinical practice, where doctors leverage multiple data sources for informed decision-making. Unlike existing diagnostic-focused AI tools, MUSK prioritizes prognosis and treatment response. It was tested using data from The Cancer Genome Atlas, encompassing 16 major cancer types. MUSK achieved 75% accuracy in predicting disease-specific survival, compared to 64% for standard methods. Furthermore, MUSK correctly predicted immunotherapy response in non-small cell lung cancer cases 77% of the time, whereas the conventional PD-L1 expression method achieved only 61% accuracy. Similarly, MUSK showed a 12% accuracy improvement over other models in predicting melanoma recurrence.
The success of MUSK underscores the power of integrating multimodal data in AI for medical applications. By leveraging vast, unpaired datasets and combining them with traditional medical data, AI can now provide far more precise and personalized predictions for cancer patients. This innovation promises to transform how clinicians use AI, shifting from mere diagnostics to more nuanced prognostic assessments, ultimately enhancing treatment strategies and patient outcomes. This research also involved contributors from Harvard Medical School and was funded by the National Institutes of Health and the Stanford Institute for Human-Centered Artificial Intelligence.