NCT06627985

Brief Summary

This retrospective clinical trial aims to better explore the potential of large language models in medicine by comparing the effectiveness of MDT consultations conducted by human doctors with those conducted by large language models. The main questions to be addressed are: Does using large language models to conduct anthropomorphic MDT consultations yield better results than using non-anthropomorphic processes? Is there a significant performance gap between MDT consultations conducted by large language models and those conducted by humans? How much greater is the economic benefit of MDT consultations from large language models compared to those conducted by humans? Retrospectively collect MDT consultation records from the past 20 years in northern Sichuan in China, as well as anonymized patient medical records. Group 1: Different large language models are assigned to act as doctors from different departments and as MDT secretaries to summarize consultations. Group 2: The large language model directly outputs diagnostic and treatment recommendations for patients. Compare the outputs of groups 1 and 2 with human performance retrospectively, score them, and select the best model from each department for a re-evaluation through anthropomorphic MDT consultations, once again comparing them to human results.

Trial Health

43
At Risk

Trial Health Score

Automated assessment based on enrollment pace, timeline, and geographic reach

Trial has exceeded expected completion date
Enrollment
300

participants targeted

Target at P75+ for all trials

Timeline
Completed

Started Oct 2024

Shorter than P25 for all trials

Geographic Reach
1 country

1 active site

Status
not yet recruiting

Health score is calculated from publicly available data and should be used for screening purposes only.

Trial Relationships

Click on a node to explore related trials.

Study Timeline

Key milestones and dates

First Submitted

Initial submission to the registry

October 1, 2024

Completed
Same day until next milestone

Study Start

First participant enrolled

October 1, 2024

Completed
3 days until next milestone

First Posted

Study publicly available on registry

October 4, 2024

Completed
28 days until next milestone

Primary Completion

Last participant's last visit for primary outcome

November 1, 2024

Completed
Same day until next milestone

Study Completion

Last participant's last visit for all outcomes

November 1, 2024

Completed
Last Updated

October 4, 2024

Status Verified

October 1, 2024

Enrollment Period

1 month

First QC Date

October 1, 2024

Last Update Submit

October 3, 2024

Conditions

Outcome Measures

Primary Outcomes (8)

  • Consultation Cost ($)

    From Multi-Disciplinary Treatment Process to Multi-Disciplinary Treatment Process until all json fields are output, the time taken by human doctors to record the time using His system generally does not exceed 12 hours.

  • Consultation Time (min)

    From Multi-Disciplinary Treatment Process to Multi-Disciplinary Treatment Process until all json fields are output, the time taken by human doctors to record the time using His system generally does not exceed 12 hours.

  • Comprehensiveness of the Multi-Disciplinary Treatment Results (Percentage Scale)

    From Multi-Disciplinary Treatment Process to Multi-Disciplinary Treatment Process until all json fields are output, the time taken by human doctors to record the time using His system generally does not exceed 12 hours.

  • Clarity of Multi-Disciplinary Treatment Results (Percentage Scale)

    From Multi-Disciplinary Treatment Process to Multi-Disciplinary Treatment Process until all json fields are output, the time taken by human doctors to record the time using His system generally does not exceed 12 hours.

  • Correctness of Multi-Disciplinary Treatment Results (Percentage Scale)

    From Multi-Disciplinary Treatment Process to Multi-Disciplinary Treatment Process until all json fields are output, the time taken by human doctors to record the time using His system generally does not exceed 12 hours.

  • Cross-Professional Team Collaboration Practice Assessment (CPAT)

    From Multi-Disciplinary Treatment Process to Multi-Disciplinary Treatment Process until all json fields are output, the time taken by human doctors to record the time using His system generally does not exceed 12 hours.

  • Rating Scale for Summarization

    From Multi-Disciplinary Treatment Process to Multi-Disciplinary Treatment Process until all json fields are output, the time taken by human doctors to record the time using His system generally does not exceed 12 hours.

  • Flesch-Kincaid Readability Test

    From Multi-Disciplinary Treatment Process to Multi-Disciplinary Treatment Process until all json fields are output, the time taken by human doctors to record the time using His system generally does not exceed 12 hours.

Secondary Outcomes (1)

  • Ethical Compliance (Boolean)

    From Multi-Disciplinary Treatment Process to Multi-Disciplinary Treatment Process until all json fields are output, the time taken by human doctors to record the time using His system generally does not exceed 12 hours.

Study Arms (4)

Anthropomorphized Process Large Language Model Multidisciplinary Treatment Group

Using a locally deployed MedicalGPT, the commercially available online GPT-4o, Claude-3.5 Sonnet, GPT-4o mini, and Claude 3 Haiku, will each sequentially play the role of physicians from different departments involved in the Multi-Disciplinary Treatment Process. They will then sequentially take on the role of a summarizer to compile their recommendations into a final suggestion or treatment plan.

Diagnostic Test: GPT-4oDiagnostic Test: GPT-4o miniDiagnostic Test: MedicalGPTDiagnostic Test: Claude-3.5 SonnetDiagnostic Test: Claude 3 Haiku

Non-anthropomorphized Process Large Language Model Multidisciplinary Treatment Group

Using a locally deployed MedicalGPT, the commercial online GPT-4o, Claude-3.5 Sonnet, GPT-4o mini, and Claude 3 Haiku to output multidisciplinary consultation results in a single instance, without separately assuming roles for each department and then compiling the results.

Diagnostic Test: GPT-4oDiagnostic Test: GPT-4o miniDiagnostic Test: MedicalGPTDiagnostic Test: Claude-3.5 SonnetDiagnostic Test: Claude 3 Haiku

Real Doctors Multi-Disciplinary Treatment Group

In traditional multidisciplinary treatments, the results are documented in the consultation records of the patients involved, including the recommendations from doctors of various departments who participated in the consultation and the final summary by the secretary.

Diagnostic Test: Real Doctors

Best Large Language Model Multidisciplinary Treatment Group

After scoring the results of the Anthropomorphized Process Large Language Model Multidisciplinary Treatment Group against the outcomes of the Real Doctors' Multi-Disciplinary Treatment Group on a department-by-department basis, the best substitute models and the best summary models for each department were selected. These top models are set to assume roles in a Multi-Disciplinary Treatment consultation.

Diagnostic Test: GPT-4oDiagnostic Test: GPT-4o miniDiagnostic Test: MedicalGPTDiagnostic Test: Claude-3.5 SonnetDiagnostic Test: Claude 3 Haiku

Interventions

GPT-4oDIAGNOSTIC_TEST

Input all patient medical records, including text, examination reports, and imaging data, into GPT-4o. Use pre-tested prompts to establish department roles, enabling it to provide diagnostic and treatment recommendations pertinent to the respective department.

Anthropomorphized Process Large Language Model Multidisciplinary Treatment GroupBest Large Language Model Multidisciplinary Treatment GroupNon-anthropomorphized Process Large Language Model Multidisciplinary Treatment Group
GPT-4o miniDIAGNOSTIC_TEST

Input all patient medical records, including text, examination reports, and imaging data, into GPT-4o mini. Use pre-tested prompts to establish department roles, enabling it to provide diagnostic and treatment recommendations pertinent to the respective department.

Anthropomorphized Process Large Language Model Multidisciplinary Treatment GroupBest Large Language Model Multidisciplinary Treatment GroupNon-anthropomorphized Process Large Language Model Multidisciplinary Treatment Group
MedicalGPTDIAGNOSTIC_TEST

Input all patient medical records, including text, examination reports, and imaging data, into MedicalGPT. Use pre-tested prompts to establish department roles, enabling it to provide diagnostic and treatment recommendations pertinent to the respective department.

Anthropomorphized Process Large Language Model Multidisciplinary Treatment GroupBest Large Language Model Multidisciplinary Treatment GroupNon-anthropomorphized Process Large Language Model Multidisciplinary Treatment Group
Claude-3.5 SonnetDIAGNOSTIC_TEST

Input all patient medical records, including text, examination reports, and imaging data, into Claude-3.5 Sonnet. Use pre-tested prompts to establish department roles, enabling it to provide diagnostic and treatment recommendations pertinent to the respective department.

Anthropomorphized Process Large Language Model Multidisciplinary Treatment GroupBest Large Language Model Multidisciplinary Treatment GroupNon-anthropomorphized Process Large Language Model Multidisciplinary Treatment Group
Claude 3 HaikuDIAGNOSTIC_TEST

Input all patient medical records, including text, examination reports, and imaging data, into Claude 3 Haiku. Use pre-tested prompts to establish department roles, enabling it to provide diagnostic and treatment recommendations pertinent to the respective department.

Anthropomorphized Process Large Language Model Multidisciplinary Treatment GroupBest Large Language Model Multidisciplinary Treatment GroupNon-anthropomorphized Process Large Language Model Multidisciplinary Treatment Group
Real DoctorsDIAGNOSTIC_TEST

Retrospectively collect the diagnostic and treatment recommendations from the corresponding departments involved in the multidisciplinary treatment of past patients, as well as the overall recommendations.

Real Doctors Multi-Disciplinary Treatment Group

Eligibility Criteria

Sexall
Healthy VolunteersNo
Age GroupsChild (0-17), Adult (18-64), Older Adult (65+)
Sampling MethodNon-Probability Sample
Study Population

From hospital

You may qualify if:

  • \. The medical records include interdisciplinary consultation notes, with recommendations from specialists of various departments and a well-documented final summary.
  • \. The medical records contain data from at least one year prior to and one year following the consultation (including intact reports and imaging records).
  • \. The patient\'s discharge conditions improved due to the multidisciplinary treatment plan after the consultation.

You may not qualify if:

  • \. The medical records do not include multidisciplinary consultation notes, or the recommendations from various departmental physicians and the final summary notes are incomplete or inadequate.
  • \. The medical records lack data from 1 year before and after the consultation, or miss necessary reports and imaging data, resulting in incomplete documentation.
  • \. The patient\'s condition at discharge has not improved following the multidisciplinary treatment plan, or the condition has worsened.

Contact the study team to confirm eligibility.

Sponsors & Collaborators

Study Sites (1)

The Affiliated Hospital of North Sichuan Medical College

Nanchong, Sichuan, 637000, China

Location

Related Publications (1)

  • Schroder C, Medves J, Paterson M, Byrnes V, Chapman C, O'Riordan A, Pichora D, Kelly C. Development and pilot testing of the collaborative practice assessment tool. J Interprof Care. 2011 May;25(3):189-95. doi: 10.3109/13561820.2010.532620. Epub 2010 Dec 23.

MeSH Terms

Conditions

NeoplasmsRespiratory InsufficiencyHeart DiseasesInfectionsPneumoniaDisease

Condition Hierarchy (Ancestors)

Respiration DisordersRespiratory Tract DiseasesCardiovascular DiseasesRespiratory Tract InfectionsLung DiseasesPathologic ProcessesPathological Conditions, Signs and Symptoms

Central Study Contacts

Zining Luo, Doctor

CONTACT

Study Design

Study Type
observational
Observational Model
OTHER
Time Perspective
RETROSPECTIVE
Sponsor Type
OTHER
Responsible Party
PRINCIPAL INVESTIGATOR
PI Title
Principal Investigator

Study Record Dates

First Submitted

October 1, 2024

First Posted

October 4, 2024

Study Start

October 1, 2024

Primary Completion

November 1, 2024

Study Completion

November 1, 2024

Last Updated

October 4, 2024

Record last verified: 2024-10

Data Sharing

IPD Sharing
Will share
Shared Documents
STUDY PROTOCOL, SAP, ICF, CSR, ANALYTIC CODE

Locations