Synthetic Generation of Hematological Data Over Federated Computing Frameworks: SCD Use Case
SYNTHEMA (SCD)
1 other identifier
observational
1,500
3 countries
3
Brief Summary
Haematological diseases (HDs) are a large group of disorders resulting from quantitative or qualitative abnormalities of blood cells, lymphoid organs and coagulation factors. Despite most of them (\~74%) are rare, the overall number of HD affected patients worldwide is important, placing a considerable economic burden on healthcare systems and societies. Despite the existence of several collaborative research groups at national and EU level, current clinical approaches are often ineffective, particularly for rarest conditions, due to the relatively low number of patients per disease and the high number of unconnected clinical entities. SYNTHEMA aims to establish a cross-border data hub where to develop and validate innovative AI-based techniques for clinical data anonymisation and synthetic data generation (SDG), to tackle the scarcity and fragmentation of data and widen the basis for GDPR-compliant research in rare hematological disorders (RHD). The project will focus on one representative RHD use case: sickle-cell disease (SCD).
Trial Health
Trial Health Score
Automated assessment based on enrollment pace, timeline, and geographic reach
participants targeted
Target at P75+ for all trials
Started Nov 2022
Longer than P75 for all trials
3 active sites
Health score is calculated from publicly available data and should be used for screening purposes only.
Trial Relationships
Click on a node to explore related trials.
Study Timeline
Key milestones and dates
Study Start
First participant enrolled
November 1, 2022
CompletedFirst Submitted
Initial submission to the registry
January 21, 2025
CompletedFirst Posted
Study publicly available on registry
January 27, 2025
CompletedPrimary Completion
Last participant's last visit for primary outcome
November 30, 2026
ExpectedStudy Completion
Last participant's last visit for all outcomes
November 30, 2026
March 12, 2025
March 1, 2025
4.1 years
January 21, 2025
March 11, 2025
Conditions
Outcome Measures
Primary Outcomes (1)
Generate synthetic multimodal (clinical, omics and imaging) data for rare haematological diseases with a validated clinical result
For Sickle Cell Disease, validation scenarios will test the reliability of synthetic data in regards to genomic variants/disease phenotypes association and MRI feature-based prediction of brain vascular events (SCD).
November 2026
Interventions
O1. Provide novel methods and capabilities to generate synthetic multimodal clinical, omics and imaging data for SCD with a validated clinical result. O2. Develop de-identification, minimisation and anonymisation pipelines, including automatic assessment of privacy levels, at the service of clinical research and care. O3. Consolidate and scale-up the use of FL applications, SMPC and DP solutions for privacy-preserving local algorithm training and global model aggregation. O4. Ensure ethical and GDPR compliance in anonymised and synthetic data-driven research in RHDs. O5. Ensure wide uptake and scalability of the developed methodologies and tools through effective stakeholder engagement, dissemination and open science practices.
Eligibility Criteria
SYNTHEMA will retrospectively collect existing RHD clinical, omics and imaging datasets from all the health data centres of its consortium (VHIR, UMCU, GLSMED LH, UNIPD) for the target SCD clinical use cases.
You may qualify if:
- SCD patients (any genotype).
- older than 1 year old
You may not qualify if:
- younger than 1 year old
- post HSCT patients
Contact the study team to confirm eligibility.
Sponsors & Collaborators
- Hospital Universitari Vall d'Hebron Research Institutelead
- Universidad Politecnica de Madridcollaborator
- Datawizard SRLcollaborator
- University of Southamptoncollaborator
- Humanitas Mirasole SpAcollaborator
- Charite University, Berlin, Germanycollaborator
- Universita degli Studi di Padovacollaborator
- Assistance Publique - Hôpitaux de Pariscollaborator
- Vicomtechcollaborator
- GLSMED Learning Health S.A.collaborator
- UMC Utrechtcollaborator
- Intrasoftcollaborator
Study Sites (3)
Azienda Ospedale Università Padova
Padua, Italy
UMC Utrecht
Utrecht, Netherlands
Vall Hebron Institut de Recerca
Barcelona, Barcelona, 08035, Spain
Related Links
MeSH Terms
Conditions
Interventions
Condition Hierarchy (Ancestors)
Intervention Hierarchy (Ancestors)
Study Design
- Study Type
- observational
- Observational Model
- COHORT
- Time Perspective
- CROSS SECTIONAL
- Sponsor Type
- OTHER
- Responsible Party
- SPONSOR
Study Record Dates
First Submitted
January 21, 2025
First Posted
January 27, 2025
Study Start
November 1, 2022
Primary Completion (Estimated)
November 30, 2026
Study Completion (Estimated)
November 30, 2026
Last Updated
March 12, 2025
Record last verified: 2025-03
Data Sharing
- IPD Sharing
- Will not share