Evaluating AI and Human Expert Decisions in Colorectal Cancer

Study Results

Results pending

The study team has not published outcome measurements, participant flow, or safety data for this trial yet. Check back later for updates.

Basic Information

Get a concise snapshot of the trial, including recruitment status, study phase, enrollment targets, and key timeline milestones.

Recruitment Status

NOT_YET_RECRUITING

Total Enrollment

1500 participants

Study Classification

OBSERVATIONAL

Study Start Date

2025-07-01

Study Completion Date

2026-06-01

Brief Summary

Review the sponsor-provided synopsis that highlights what the study is about and why it is being conducted.

The goal of this observational study is to evaluate the decision-making consistency between large language models (LLMs) and expert multidisciplinary teams (MDTs) in adult patients diagnosed with colorectal cancer who underwent MDT consultation between January 2023 and December 2024.

The main questions it aims to answer are:

How consistent are the treatment decisions generated by LLMs compared to actual MDT decisions? Do different LLMs (e.g., ChatGPT, DeepSeek) show varying levels of agreement with expert recommendations? What clinical factors contribute to differences between AI-generated and human expert decisions? Researchers will compare the AI-generated treatment recommendations with real-world MDT decisions using anonymized patient records to see if LLMs can reliably support clinical decision-making in oncology.

Participants will:

Have their de-identified clinical data (e.g., imaging, pathology, MDT notes) processed through several LLMs Not be contacted or receive any interventions, as this is a retrospective study using existing clinical records only.

Related Clinical Trials

Explore similar clinical trials based on study characteristics and research focus.

Young-onset Colorectal Cancer Screening Based on Artificial Intelligence

NCT06342622

Colorectal Cancer

COMPLETED

Early Screening of Colorectal Cancer Based on Plasma Multi-omics Combining With Artificial Intelligence

NCT05587452

Colorectal Adenoma Colorectal Cancer

UNKNOWN

Comparison of the Diagnostic Performance of Different Artificial Intelligence Assisted Endocytoscopy for Colorectal Lesions

NCT06982872

Endocytoscopy

RECRUITING

AI-assisted Integrated Care to Promote Colonoscopy Uptake

NCT07261059

Colorectal Neoplasms Colonoscopy

NOT_YET_RECRUITING NA

AI Prediction Model and Risk Stratification for Lung Metastasis in Colorectal Cancer

NCT05816902

Colorectal Cancer Lung Metastases

COMPLETED

Detailed Description

Dive into the extended narrative that explains the scientific background, objectives, and procedures in greater depth.

This is a retrospective, non-interventional observational study aiming to evaluate the consistency between treatment decisions made by large language models (LLMs) and multidisciplinary team (MDT) experts in the management of colorectal cancer (CRC).

Colorectal cancer is a highly heterogeneous malignancy requiring personalized treatment strategies, often developed through MDT discussions that integrate input from surgery, oncology, radiology, pathology, and other specialties. While MDTs improve treatment planning and outcomes, they are time- and resource-intensive, and subject to variability in expert judgment. With the rise of artificial intelligence, especially LLMs such as ChatGPT and DeepSeek, there is growing interest in their potential role in assisting or standardizing clinical decision-making.

In this study, researchers will retrospectively analyze de-identified clinical records of approximately 1,500 patients with histologically confirmed colorectal cancer who underwent MDT consultation at a tertiary cancer center between January 2023 and December 2024. Key clinical data-including demographic information, imaging reports (CT, MRI), endoscopy results, pathology findings, and MDT recommendations-will be extracted and anonymized.

These de-identified records will be input into several LLMs (ChatGPT, DeepSeek, Baichuan, and Qwen) running on secure offline servers. The models will be asked to generate treatment recommendations, which will be categorized into predefined decision codes (e.g., surgery, systemic therapy, chemoradiotherapy, further diagnostics). Each case will be input three times to assess the consistency of the model output.

The primary outcome is the agreement between AI-generated recommendations and original MDT decisions, quantified using Cohen's Kappa. Secondary analyses include comparison among LLMs using chi-squared tests, evaluation of output consistency via Fleiss' Kappa, and identification of clinical factors associated with discordant decisions.

This study does not involve any direct patient contact, intervention, or new clinical procedures. All data are historical and anonymized in accordance with ethical and legal requirements. The results are expected to inform the potential value, limitations, and appropriate use of AI in supporting multidisciplinary decision-making in oncology.

Conditions

See the medical conditions and disease areas that this research is targeting or investigating.

Colorectal Cancer

Study Design

Understand how the trial is structured, including allocation methods, masking strategies, primary purpose, and other design elements.

Observational Model Type

CASE_ONLY

Study Time Perspective

RETROSPECTIVE

Interventions

Learn about the drugs, procedures, or behavioral strategies being tested and how they are applied within this trial.

LLM-MDT

Leveraging large language models (LLMs) to Generate Multidisciplinary Team (MDT) Treatment Recommendations

Intervention Type OTHER

Eligibility Criteria

Check the participation requirements, including inclusion and exclusion rules, age limits, and whether healthy volunteers are accepted.

Inclusion Criteria

* Patients with a histologically confirmed diagnosis of colorectal cancer
* Patients who received multidisciplinary team (MDT) consultation at Peking University Cancer Hospital between January 1, 2023 and December 31, 2024
* Availability of complete clinical records, including(MDT consultation notes, CT or MRI imaging reports, Pathology reports, Outpatient or inpatient medical summaries)

Exclusion Criteria

* Incomplete or missing medical records related to MDT decision-making
* MDT consultations conducted for non-oncologic purposes (e.g., hernia evaluation, stoma planning)
* Missing critical clinical data such as imaging or pathology reports
* Duplicate or conflicting records that prevent reliable data analysis

Eligible Sex

ALL

Accepts Healthy Volunteers

No