Evaluating the Effectiveness and Acceptability of a GPT-4o and RAG-Based Voice Chatbot for Depression Screening Using PHQ-9
NCT ID: NCT06801925
Last Updated: 2025-01-30
Study Results
The study team has not published outcome measurements, participant flow, or safety data for this trial yet. Check back later for updates.
Basic Information
Get a concise snapshot of the trial, including recruitment status, study phase, enrollment targets, and key timeline milestones.
ENROLLING_BY_INVITATION
100 participants
OBSERVATIONAL
2025-02-01
2025-05-31
Brief Summary
Review the sponsor-provided synopsis that highlights what the study is about and why it is being conducted.
The voice-based chatbot integrates GPT-4o, with RAG to enhance its ability to provide informed and contextualized responses during interactions. GPT-4o serves as the conversational engine, capable of generating human-like, empathetic, and contextually appropriate dialogue. RAG, on the other hand, enables the chatbot to retrieve and incorporate external, up-to-date knowledge from a curated database or knowledge repository, ensuring the accuracy and reliability of its responses.
Related Clinical Trials
Explore similar clinical trials based on study characteristics and research focus.
AI-Powered Mental Health Screening in University Students
NCT07092085
Tracking Depression Symptoms With a Health Chatbot
NCT03990389
Interactive Voice-Based Administration of the PHQ-9
NCT04609267
Developing E-health Services (DES): The Feasibility and Acceptability of Video-conferencing for Adults With Depression
NCT03288506
Cognitive and Mood Assessment Data in Major Depressive Disorder Using Digital Wearable Technology
NCT03067506
Detailed Description
Dive into the extended narrative that explains the scientific background, objectives, and procedures in greater depth.
Participants will fill in the PHQ-9 for self-testing before interacting with the chatbot (the results will not be disclosed to the public and will only be used for accuracy comparisons), and the results of their self-tests will be compared with the results given by the chatbot in terms of accuracy.
The chatbot interaction comprises three phases:
1. Warm-up conversations for rapport-building and general support.
* The chatbot initiates casual, empathetic dialogues to build rapport with users, helping them feel comfortable and at ease before transitioning to the PHQ-9 screening.
* Users can ask general questions related to mental health, and the chatbot provides informed and supportive responses.
2. Administration of the PHQ-9 questionnaire for depression screening.
* The chatbot introduces the PHQ-9 questionnaire, explaining its purpose and how the results will help assess the user's mental health.
* Through voice interaction, users respond to the nine PHQ-9 questions, and the chatbot records their responses. The chatbot can clarify questions or provide additional context if users have difficulty understanding specific items.
3. Analysis of results and delivery of tailored recommendations.
* After the user completes the PHQ-9, the chatbot analyzes the responses, calculates the total score, and categorizes the results into severity levels (e.g., mild, moderate).
* Based on the score, the chatbot provides personalized recommendations, such as self-help strategies for mild symptoms or suggesting professional mental health services for more severe cases.
Participants will interact with the chatbot and then participate in a 1-hour semi-structured interview to provide feedback on their experience. The study focuses on evaluating the acceptability and feasibility of using such LLM-based chatbots in mental health screening and identifying potential improvements and risks.
Study Objectives Primary Objectives
1. To evaluate the acceptability, feasibility, and accuracy of a GPT-4o and RAG-based voice chatbot (HopeBot) for depression screening using PHQ-9.
Hypothesis: Participants showed high acceptance of HopeBot (higher than 65%) and high willingness to use such LLM-based chatbot for mental health screening in the future (higher than 65%), indicating recognition of the credibility of LLM as a supportive tool in mental health screening (higher than 65%). Participants use of the HopeBot for depression screening matched their self-test PHQ-9 results by 100%
2. To analyze the chatbot's effectiveness in identifying depressive symptoms and delivering actionable recommendations.
Hypothesis: HopeBot can help users take the PHQ-9 test in a friendly way, help users categorize the answers accurately, and give accurate test results, the advice they provide is based on the official PHQ-9 guidelines, and more than 70% of the users say that their responses are very effective and helpful.
Secondary Objectives
1. To assess the feasibility and performance of integrating RAG with LLM in creating a voice-interactive chatbot for mental health.
Hypothesis: Over 65% of participants recognized that responses using RAG were more helpful and effective.
2. To explore the strengths, limitations, and risks of deploying LLMs in the mental health domain.
Hypothesis: More than 65% of users say that HopeBot is very convenient, more accessible, and cost-free to provide non-judgmental advice. However, 50% still expressed concerns about its privacy and data security.
Conditions
See the medical conditions and disease areas that this research is targeting or investigating.
Study Design
Understand how the trial is structured, including allocation methods, masking strategies, primary purpose, and other design elements.
OTHER
CROSS_SECTIONAL
Interventions
Learn about the drugs, procedures, or behavioral strategies being tested and how they are applied within this trial.
GPT-4o and RAG Voice Chatbot for PHQ-9 Screening
This study involves the use of a voice-based chatbot powered by GPT-4o and Retrieval-Augmented Generation (RAG) to conduct depression screening using the Patient Health Questionnaire-9 (PHQ-9).
The chatbot aims to evaluate the feasibility and acceptability of using AI-powered conversational tools for mental health screening.
Participants interact with the chatbot in a single session, answering PHQ-9 questions and receiving responses generated using GPT-4o and RAG technologies.
Eligibility Criteria
Check the participation requirements, including inclusion and exclusion rules, age limits, and whether healthy volunteers are accepted.
Inclusion Criteria
* Fluent in English.
* Access to a device capable of voice interaction and stable internet connection.
* Willing to participate in chatbot interaction and a follow-up interview.
Exclusion Criteria
* Participants undergoing active treatment for depression with a psychiatrist.
* Discomfort with voice-based technology or inability to provide informed consent.
18 Years
65 Years
ALL
Yes
Sponsors
Meet the organizations funding or collaborating on the study and learn about their roles.
University College, London
OTHER
Responsible Party
Identify the individual or organization who holds primary responsibility for the study information submitted to regulators.
Principal Investigators
Learn about the lead researchers overseeing the trial and their institutional affiliations.
Kezhi Li
Role: STUDY_DIRECTOR
University College, London
Locations
Explore where the study is taking place and check the recruitment status at each participating site.
UCL Institute of Health Informatics
London, , United Kingdom
Countries
Review the countries where the study has at least one active or historical site.
Other Identifiers
Review additional registry numbers or institutional identifiers associated with this trial.
26133.001
Identifier Type: OTHER
Identifier Source: secondary_id
26133.001
Identifier Type: -
Identifier Source: org_study_id
More Related Trials
Additional clinical trials that may be relevant based on similarity analysis.