Colorectal Cancer Survival Predicted by AI Using Clinical and Molecular Features
Dec 17, 2025
Discover how cutting-edge machine learning is revolutionizing survival predictions for colorectal cancer. A team of researchers analyzed data from over 500 patients, combining clinical factors with molecular features like gene expression. The standout adaptive boosting model achieved an impressive accuracy of 89.58%. Key biological markers such as E2F8, WDR77, and hsa-miR-495-3p are highlighted for their crucial role in tumor growth. This innovative approach aims to refine risk assessment and personalize treatment strategies for high-risk patients.
03:23
forum Ask episode
web_stories AI Snips
view_agenda Chapters
auto_awesome Transcript
info_circle Episode notes
insights INSIGHT
Integrating Clinical And Molecular Data Works
Combining clinical and molecular data improves colorectal cancer survival prediction accuracy compared to using either alone.
The study achieved up to 89.58% accuracy with an adaptive boosting model on TCGA patient data.
insights INSIGHT
Large Multi-Modal TCGA Cohort Used
The team analyzed over 500 TCGA colorectal cancer patients combining age, chemo status, stage, gene expression, and microRNAs.
Using multi-modal inputs enabled more precise identification of high-risk patients.
insights INSIGHT
Clinical Predictors Remain Crucial
Clinical predictors such as cancer stage, age, lymph node status, and chemotherapy status were key drivers of survival predictions.
Combining these with molecular features helped better identify high-risk patients.
Get the Snipd Podcast app to discover more snips from this episode
BUFFALO, NY - December 17, 2025 – A new #research paper was #published in Oncotarget (Volume 16) on December 15, 2025, titled “Machine learning-based survival prediction in colorectal cancer combining clinical and biological features.”
In this study, led by Lucas M. Vieira from the University of Brasília and the University of California San Diego, researchers used machine learning to predict survival in patients with colorectal cancer. They built a model by combining biological markers with clinical data. This approach could help improve prognosis and guide treatment strategies for one of the world’s most common and deadly cancers.
The team analyzed data from over 500 patients, using clinical details such as age, chemotherapy status, and cancer stage, along with molecular features like gene expression and microRNAs. Their goal was to improve how clinicians identify high-risk patients and make outcome predictions more precise. Researchers evaluated three different patient data scenarios using different machine learning techniques. The best-performing was an adaptive boosting model, which achieved 89.58% accuracy. This approach showed that integrating clinical and biological data led to significantly better predictions than using either data type alone.
Among the biological markers, the gene E2F8 was consistently influential in all patient groups and is known to play a role in tumor growth. Other important markers included WDR77 and hsa-miR-495-3p, which are also associated with cancer development. Key clinical predictors included cancer stage, patient age, lymph node involvement, and whether chemotherapy was administered.
“The proposed method combines biological and clinical features to predict patient survival, using as input data from patients from the United States, available in the TCGA database.”
Unlike earlier models that relied on either clinical or molecular data alone, this study demonstrates the added value of combining both. Ensemble methods, which merge multiple learning algorithms, provided more stable and consistent results across all patient groups tested.
These research findings could lead to new tools that help clinicians better predict how a patient's disease might progress or respond to treatment. The study also highlights the importance of collecting complete clinical information, such as lifestyle factors, which were missing from the dataset but could enhance future predictions.
Overall, the study demonstrated how machine learning can support more accurate and personalized survival predictions in colorectal cancer. It also points to potential future research on markers like E2F8, which may be useful for monitoring or targeted therapy.
DOI - https://doi.org/10.18632/oncotarget.28783
Correspondence to - Lucas M. Vieira - lvieira@health.ucsd.edu
Abstract video - https://www.youtube.com/watch?v=cy7UL5ZUKuI
Sign up for free Altmetric alerts about this article - https://oncotarget.altmetric.com/details/email_updates?id=10.18632%2Foncotarget.28783
Subscribe for free publication alerts from Oncotarget - https://www.oncotarget.com/subscribe/
Keywords - cancer, colorectal cancer, machine learning, feature selection, non-coding RNAs, genes
To learn more about Oncotarget, please visit https://www.oncotarget.com and connect with us on social media:
Facebook - https://www.facebook.com/Oncotarget/
X - https://twitter.com/oncotarget
Instagram - https://www.instagram.com/oncotargetjrnl/
YouTube - https://www.youtube.com/@OncotargetJournal
LinkedIn - https://www.linkedin.com/company/oncotarget
Pinterest - https://www.pinterest.com/oncotarget/
Reddit - https://www.reddit.com/user/Oncotarget/
Spotify - https://open.spotify.com/show/0gRwT6BqYWJzxzmjPJwtVh
MEDIA@IMPACTJOURNALS.COM