Evaluating AI and Human Expert Decisions in Colorectal Cancer

NCT ID: NCT07045207

Last Updated: 2025-07-01

Study Results

Results pending

The study team has not published outcome measurements, participant flow, or safety data for this trial yet. Check back later for updates.

Basic Information

Get a concise snapshot of the trial, including recruitment status, study phase, enrollment targets, and key timeline milestones.

Recruitment Status

NOT_YET_RECRUITING

Total Enrollment

1500 participants

Study Classification

OBSERVATIONAL

Study Start Date

2025-07-01

Study Completion Date

2026-06-01

Brief Summary

Review the sponsor-provided synopsis that highlights what the study is about and why it is being conducted.

The goal of this observational study is to evaluate the decision-making consistency between large language models (LLMs) and expert multidisciplinary teams (MDTs) in adult patients diagnosed with colorectal cancer who underwent MDT consultation between January 2023 and December 2024.

The main questions it aims to answer are:

How consistent are the treatment decisions generated by LLMs compared to actual MDT decisions? Do different LLMs (e.g., ChatGPT, DeepSeek) show varying levels of agreement with expert recommendations? What clinical factors contribute to differences between AI-generated and human expert decisions? Researchers will compare the AI-generated treatment recommendations with real-world MDT decisions using anonymized patient records to see if LLMs can reliably support clinical decision-making in oncology.

Participants will:

Have their de-identified clinical data (e.g., imaging, pathology, MDT notes) processed through several LLMs Not be contacted or receive any interventions, as this is a retrospective study using existing clinical records only.

Detailed Description

Dive into the extended narrative that explains the scientific background, objectives, and procedures in greater depth.

This is a retrospective, non-interventional observational study aiming to evaluate the consistency between treatment decisions made by large language models (LLMs) and multidisciplinary team (MDT) experts in the management of colorectal cancer (CRC).

Colorectal cancer is a highly heterogeneous malignancy requiring personalized treatment strategies, often developed through MDT discussions that integrate input from surgery, oncology, radiology, pathology, and other specialties. While MDTs improve treatment planning and outcomes, they are time- and resource-intensive, and subject to variability in expert judgment. With the rise of artificial intelligence, especially LLMs such as ChatGPT and DeepSeek, there is growing interest in their potential role in assisting or standardizing clinical decision-making.

In this study, researchers will retrospectively analyze de-identified clinical records of approximately 1,500 patients with histologically confirmed colorectal cancer who underwent MDT consultation at a tertiary cancer center between January 2023 and December 2024. Key clinical data-including demographic information, imaging reports (CT, MRI), endoscopy results, pathology findings, and MDT recommendations-will be extracted and anonymized.

These de-identified records will be input into several LLMs (ChatGPT, DeepSeek, Baichuan, and Qwen) running on secure offline servers. The models will be asked to generate treatment recommendations, which will be categorized into predefined decision codes (e.g., surgery, systemic therapy, chemoradiotherapy, further diagnostics). Each case will be input three times to assess the consistency of the model output.

The primary outcome is the agreement between AI-generated recommendations and original MDT decisions, quantified using Cohen's Kappa. Secondary analyses include comparison among LLMs using chi-squared tests, evaluation of output consistency via Fleiss' Kappa, and identification of clinical factors associated with discordant decisions.

This study does not involve any direct patient contact, intervention, or new clinical procedures. All data are historical and anonymized in accordance with ethical and legal requirements. The results are expected to inform the potential value, limitations, and appropriate use of AI in supporting multidisciplinary decision-making in oncology.

Conditions

See the medical conditions and disease areas that this research is targeting or investigating.

Colorectal Cancer

Study Design

Understand how the trial is structured, including allocation methods, masking strategies, primary purpose, and other design elements.

Observational Model Type

CASE_ONLY

Study Time Perspective

RETROSPECTIVE

Interventions

Learn about the drugs, procedures, or behavioral strategies being tested and how they are applied within this trial.

LLM-MDT

Leveraging large language models (LLMs) to Generate Multidisciplinary Team (MDT) Treatment Recommendations

Intervention Type OTHER

Eligibility Criteria

Check the participation requirements, including inclusion and exclusion rules, age limits, and whether healthy volunteers are accepted.

Inclusion Criteria

* Patients with a histologically confirmed diagnosis of colorectal cancer
* Patients who received multidisciplinary team (MDT) consultation at Peking University Cancer Hospital between January 1, 2023 and December 31, 2024
* Availability of complete clinical records, including(MDT consultation notes, CT or MRI imaging reports, Pathology reports, Outpatient or inpatient medical summaries)

Exclusion Criteria

* Incomplete or missing medical records related to MDT decision-making
* MDT consultations conducted for non-oncologic purposes (e.g., hernia evaluation, stoma planning)
* Missing critical clinical data such as imaging or pathology reports
* Duplicate or conflicting records that prevent reliable data analysis
Eligible Sex

ALL

Accepts Healthy Volunteers

No

Sponsors

Meet the organizations funding or collaborating on the study and learn about their roles.

Peking University Cancer Hospital & Institute

OTHER

Sponsor Role lead

Responsible Party

Identify the individual or organization who holds primary responsibility for the study information submitted to regulators.

Responsibility Role SPONSOR

Locations

Explore where the study is taking place and check the recruitment status at each participating site.

Peking University Cancer Hospital

Beijing, , China

Site Status

Countries

Review the countries where the study has at least one active or historical site.

China

Central Contacts

Reach out to these primary contacts for questions about participation or study logistics.

Yongjiu Chen, PhD

Role: CONTACT

+86 18813041827

Facility Contacts

Find local site contact details for specific facilities participating in the trial.

Yongjiu Chen, PhD

Role: primary

+86 18813041827

Other Identifiers

Review additional registry numbers or institutional identifiers associated with this trial.

PekingUCHI2025YJZ57

Identifier Type: -

Identifier Source: org_study_id

More Related Trials

Additional clinical trials that may be relevant based on similarity analysis.