Multi-Omics Data Fusion Engine: Data Fusion and Multi-Modal Analysis Enhancing CRC Prevention in Europe
The ONCODIR project is dedicated to advancing Colorectal Cancer (CRC) prevention through evidence-based decision-making that supports both healthcare professionals, policymakers and citizens. By providing individuals with personalized recommendations and empowering policymakers with actionable insights, ONCODIR aims to drive impactful CRC prevention strategies across Europe.
DATA FUSION TOOL: ADDRESSING THE COMPLEXITY OF CRC
CRC is a complex, multidimensional disease influenced by a range of risk factors including lifestyle, nutrition, comorbidities, environmental influences, and socioeconomic factors. Addressing this complexity requires a comprehensive, multimodal analysis that accounts for diverse datasets and analytical perspectives. By integrating diverse datasets, including retrospective data at the country level and prospective data at the individual level, ONCODIR offers a unified, data-driven approach that not only supports the reduction of individual cancer risks but also enables the development of more effective, evidence-based prevention and intervention strategies. ONCODIR achieved this through the advanced Multi-Omics Data Fusion Engine (Data Fusion Tool), an innovative tool designed to merge and analyse various types of datasets. By providing enriched information of greater quality, the Data Fusion Tool offers three core capabilities-data fusion, subgroup analysis and omics analysis-designed to generate actionable insights for CRC prevention (Figure).
The Data Fusion Tool integrates large-scale, heterogeneous datasets from multiple sources, to prepare them for comprehensive analysis. This includes both retrospective at the country level data, primarily open-sourced, such as CRC incidence, risk factors, and cost-related data, and prospective at the individual level data, collected through the ONCODIR project. This approach promotes the accessibility and reuse of these curated datasets to be analyzed within the Data Fusion Tool itself or forwarded to other ONCODIR’s tools for further exploration. This comprehensive data fusion provides a deep, holistic understanding of CRC’s causes and risk factors. By standardizing and preparing these diverse datasets for analysis, the tool derives insights that support both personalized recommendations and impactful policy decisions.
SUBGROUP ANALYSIS
The Data Fusion Tool not only facilitates data fusion but also supports in-depth statistical and predictive analyses of diverse datasets. By conducting subgroup analysis at the country level across various population sub-groups, such as age and sex, in European countries, the tool aims to identify high-risk populations and uncover patterns, trends, and associations among risk factors and CRC incidence. This analysis is crucial in shaping future prevention strategies, particularly as CRC incidence varies significantly across age groups (e.g., <24 yo, 24-50 yo, >50 yo) and between sexes. Subgroup analysis facilitates the development of targeted, age- and gender-sensitive prevention strategies, tailored to specific population needs. These insights enable healthcare professionals and policymakers to drive informed decisions that improve public health outcomes and reduce the overall incidence of CRC across Europe.
OMICS ANALYSIS
In addition to this epidemiological subgroup analysis, ONCODIR leverages multi-omics data, such as genomics and metabolomics, to deepen the understanding of CRC at the molecular level. By exploring patient-level omics data, through statistical and predictive analysis, the tool uncovers potential biomarkers linked to CRC. Specifically, focuses on microbiome data, exploring the relationship between microbiome composition and CRC development. This multi-omics approach provides a more comprehensive view of the biological processes driving CRC, opening new avenues for personalized prevention strategies and advancing clinical understanding of the disease.
In conclusion, the ONCODIR project is pioneering data-driven CRC prevention, leveraging both country-level and individual-level data to identify high-risk groups, uncover biomarkers, and enhance prevention strategies. With its advanced data fusion and analytical capabilities, ONCODIR provides healthcare professionals and policymakers with the necessary insights to reduce CRC incidence, advancing Europe toward a future with improved public health outcomes and stronger CRC prevention initiatives.
Data Fusion Tool Workflow: Data fusion and analysis of Country and Individual-Level Data to Support CRC Decision-Making
Authors:
Ioanna Sasilioglou, Christoniki Maga-Nteve, Thanassis Mavropoulos
Reference (Books, Online, etc)
- Rajesh et al. (2022), Global, regional, and national burden of colorectal cancer and its risk factors, 1990–2019: a systematic analysis for the Global Burden of Disease Study 2019, The Lancet Gastroenterology & Hepatology, Volume 7, Issue 7, 627 – 647
- Wang et al. (2024), Global incidence trends of early-onset colorectal cancer and related exposures in early-life: an ecological analysis based on the GBD 2019, Front Public Health, vol. 12, p. 1367818.
- Hu, Y. Li, W. Zhu, J. Liu, and S. Wei (2023), Global, region and national trends and age-period-cohort effects in colorectal cancer burden from 1990 to 2019, with predictions to 2039, Environmental Science and Pollution Research, vol. 30, no. 35, pp. 83245–83259, 2023.
- Novielli et al. (2024): Explainable artificial intelligence for microbiome data analysis in colorectal cancer biomarker identification, Front Microbiol, vol. 15, p. 1348974.
Keywords: CRC prevention, Data fusion, Subgroup analysis, Omics analysis, Personalized recommendations, Biomarkers, Colorectal cancer, Healthcare professionals, Policymakers, ONCODIR
Header picture rights: Designed by Freepik (www.freepik.com)