NCT07414966|Not Yet RecruitingDMC

Scalable Clinical Oversight of Large Language Models Via Uncertainty Triangulation

SCOUT

Prospective Evaluation of a Model-Agnostic Meta-Verification Framework (SCOUT) for Scalable Clinical Oversight of Large Language Model Outputs in Coronary Heart Disease Diagnosis: A Multi-Reader, Randomized, Crossover Trial

China National Center for Cardiovascular Diseases ClinicalTrials.gov

Compare

1 other identifier

2025-2702-1

Study Type

interventional

Target

Locations

0 countries

Sites

N/A

Timeline

RegisteredFeb 2026

StartedFeb 2026

CompletionFeb 2026

Brief Summary

This prospective, multi-reader, randomized crossover trial evaluates SCOUT (Scalable Clinical Oversight via Uncertainty Triangulation), a model-agnostic meta-verification framework that selectively defers unreliable large language model (LLM) predictions to clinicians by triangulating three orthogonal uncertainty signals: model heterogeneity, stochastic inconsistency, and reasoning critique. The trial assesses whether SCOUT-assisted review can reduce physician review time compared with standard manual review of AI-generated diagnoses while maintaining non-inferior diagnostic accuracy in coronary heart disease (CHD) subtyping.

Trial Health

At Risk

Trial Health Score

Automated assessment based on enrollment pace, timeline, and geographic reach

Trial has exceeded expected completion date

Enrollment

participants targeted

Target at below P25 for not_applicable

Timeline

Completed

Started Feb 2026

Status

not yet recruiting

Health score is calculated from publicly available data and should be used for screening purposes only.

Trial Relationships

Click on a node to explore related trials.

Study Timeline

Key milestones and dates

9 days study duration

First Submitted

Initial submission to the registry

February 9, 2026

Completed

8 days until next milestone

First Posted

Study publicly available on registry

February 17, 2026

Completed

2 days until next milestone

Study Start

First participant enrolled

February 19, 2026

Completed

9 days until next milestone

Primary Completion

Last participant's last visit for primary outcome

February 28, 2026

Completed

Same day until next milestone

Study Completion

Last participant's last visit for all outcomes

February 28, 2026

Completed

Last Updated

February 17, 2026

Status Verified

February 1, 2026

Enrollment Period

9 days

First QC Date

February 9, 2026

Last Update Submit

February 14, 2026

Conditions

Coronary Heart Disease (CHD)

Keywords

artificial intelligence

Outcome Measures

Primary Outcomes (1)

Mean physician review time per case (minutes)
Mean time spent by each clinician reviewing and rendering a diagnostic decision per case under each arm. Measured in minutes.
Through study completion, an average of 2 hours.

Secondary Outcomes (2)

Diagnostic accuracy (%)
Through study completion, an average of 2 hours.
Computational Return on Investment (ROI)
Through study completion, an average of 2 hours.

Study Arms (2)

Control (Standard Manual Review)

ACTIVE COMPARATOR

Physicians manually review all cases in the control set (n=54) with access to AI predictions and reasoning. No selective deferral.

Diagnostic Test: Standard Manual Review Workflow

Experimental (SCOUT-Assisted Review)

EXPERIMENTAL

Physicians process the intervention set (n=56) through the SCOUT framework. Low-uncertainty cases are auto-accepted; high-uncertainty cases undergo physician review with full audit trail.

Diagnostic Test: SCOUT-Assisted Review Workflow

Interventions

SCOUT-Assisted Review WorkflowDIAGNOSTIC_TEST

SCOUT-Assisted Review (Intervention Arm): Physicians review 56 cases processed through the SCOUT framework. For cases classified as low-uncertainty (D(x)=0), the AI prediction is auto-accepted without physician review. For high-uncertainty cases (D(x)=1), the physician reviews the case with access to the main model's chain-of-thought reasoning and the meta-verification audit results. The main model is DeepSeek-V3.1 with chain-of-thought prompting.

Experimental (SCOUT-Assisted Review)

Standard Manual Review WorkflowDIAGNOSTIC_TEST

Physicians perform a full manual review of 54 cases using raw medical records with access to the AI model's predictions and reasoning, but without SCOUT uncertainty stratification or selective deferral.

Control (Standard Manual Review)

Eligibility Criteria

Age18 Years+

Sexall

Healthy VolunteersNo

Age GroupsAdult (18-64), Older Adult (65+)

You may qualify if:

Board-certified or in-training cardiologists at Fuwai Hospital
Spanning three experience strata: junior residents, senior residents, attending physicians

You may not qualify if:

Clinicians involved in the development or optimization of the SCOUT framework
Clinicians involved in the gold-standard adjudication process

Contact the study team to confirm eligibility.

Sponsors & Collaborators

China National Center for Cardiovascular Diseaseslead

MeSH Terms

Conditions

Coronary Disease

Condition Hierarchy (Ancestors)

Myocardial IschemiaHeart DiseasesCardiovascular DiseasesVascular Diseases

Central Study Contacts

Xiaojin Gao, Dr.

CONTACT

+86 010 88322415 sophie_gao@sina.com

Study Design

Study Type: interventional
Phase: not applicable
Allocation: RANDOMIZED
Masking: NONE
Purpose: DIAGNOSTIC
Intervention Model: CROSSOVER
Sponsor Type: OTHER GOV
Responsible Party: SPONSOR

Study Record Dates

First Submitted

February 9, 2026

First Posted

February 17, 2026

Study Start

February 19, 2026

Primary Completion

February 28, 2026

Study Completion

February 28, 2026

Last Updated

February 17, 2026

Record last verified: 2026-02

Data Sharing

IPD Sharing: Will share
Shared Documents: STUDY PROTOCOL, SAP, ICF, CSR, ANALYTIC CODE
Time Frame: Beginning 1 months after publication of the primary results and available for up to 60 months.
Access Criteria: Data are available from the corresponding author upon reasonable request. Requestors will need to provide a methodologically sound research proposal and sign a data use agreement.

Brief Summary

Trial Health

Trial Health Score

Trial Relationships

Related Scientific Literature

Study Timeline

First Submitted

First Posted

Study Start

Primary Completion

Study Completion

Conditions

Keywords

Outcome Measures

Primary Outcomes (1)

Secondary Outcomes (2)

Study Arms (2)

Control (Standard Manual Review)

Experimental (SCOUT-Assisted Review)

Interventions

Eligibility Criteria

You may qualify if:

You may not qualify if:

Sponsors & Collaborators

MeSH Terms

Conditions

Condition Hierarchy (Ancestors)

Central Study Contacts

Study Design

Study Record Dates

Data Sharing