Data-driven Identification for Substance Misuse

NCT ID: NCT03833804

Last Updated: 2025-10-24

Study Results

Results available

Outcome measurements, participant flow, baseline characteristics, and adverse events have been published for this study.

View full results

Basic Information

Get a concise snapshot of the trial, including recruitment status, study phase, enrollment targets, and key timeline milestones.

Recruitment Status

COMPLETED

Clinical Phase

NA

Total Enrollment

64996 participants

Study Classification

INTERVENTIONAL

Study Start Date

2022-09-19

Study Completion Date

2024-09-19

Brief Summary

Review the sponsor-provided synopsis that highlights what the study is about and why it is being conducted.

The investigators propose to develop an open-source, publicly available machine learning model that health systems could download and apply to their electronic health record data marts to screen for substance misuse in their patients. The investigators hypothesize that the natural language processing algorithm can provide a standardized and interoperable approach for an automated daily screen on all hospitalized patients and provide better implementation fidelity for screening, brief intervention, and referral to treatment.

Detailed Description

Dive into the extended narrative that explains the scientific background, objectives, and procedures in greater depth.

In 2016, nearly 30% hospital discharges in the United States (US) had a major diagnostic category for a substance-use related condition. Substance misuse ranks second among principal diagnoses for unplanned 7-day hospital readmission rates. Despite the availability of Screening, Brief Intervention, and Referral to Treatment (SBIRT) interventions, substance misuse is not part of the admission routine and only a minority of patients are screened for substance misuse in the hospital setting. This is particularly problematic, since among hospitalized inpatients, the prevalence of substance misuse is estimated to be as high as 25%, greater than either the general population or outpatient setting. Practical screening methods tailored for the hospital setting are needed.

In the advent of Meaningful Use in the electronic health record (EHR), efficiency for alcohol detection may be improved by leveraging data collected during usual care. Documentation of substance use is common and occurs in over 96% of provider admission notes, but their free text format renders them difficult to mine and analyze. Natural Language Processing (NLP) and machine learning are subfields of artificial intelligence (AI) that provide a solution to analyze text data in the EHR to identify substance misuse. Modern NLP has fused with machine learning, another sub-field of artificial intelligence focused on learning from data. In particular, the most powerful NLP methods rely on supervised learning, a type of machine learning that takes advantage of current reference standards to make predictions about unseen cases

In the earlier version of an NLP and machine learning tool, the investigators successfully used data from clinical notes collected in the first 24 hours of hospital admission to reach a sensitivity and specificity above 70% for identifying alcohol misuse. With nearly 36 million hospital admissions in 2016, a substance misuse classifier has potential to impact millions.

In this study, the aim is to prospectively implement a substance misuse classifier to examine its effectiveness against current practice of all hospitalized adult patients at a tertiary health system. The health system has a mature screening system to examine substance misuse classifier performance against current practice of questionnaire screening.

The hypothesis is that the substance misuse classifier may provide a standardized, interoperable, and accurate approach to screen hospitalized patients. Successful implementation of the classifier in hospitalized patients is a step towards an automated and comprehensive universal screening system for substance misuse.

Conditions

See the medical conditions and disease areas that this research is targeting or investigating.

Substance Use Substance Abuse Substance-Related Disorders

Study Design

Understand how the trial is structured, including allocation methods, masking strategies, primary purpose, and other design elements.

Allocation Method

NA

Intervention Model

SEQUENTIAL

Quasi-experimental design as an interrupted time series
Primary Study Purpose

SCREENING

Blinding Strategy

NONE

No masking as the manual screen is already part of usual care and the automated screen will become usual care in the post-period of the pre-post design.

Study Groups

Review each arm or cohort in the study, along with the interventions and objectives associated with them.

SMART-AI: NLP (natural language processing) pre-screen

Automated processing of clinical notes collected during routine care in first 24 hours of hospital admission to identify individuals at-risk for substance misuse to receive standard-of-care full screening and assessment, brief intervention, or referral to treatment (SBIRT) intervention.

Group Type EXPERIMENTAL

Processing of clinical notes in the EHR data collected during routine care

Intervention Type OTHER

Clinical notes collected in the first day of hospital admission during usual care as input to natural language processing and machine learning algorithm.

Usual Care

Data collected before the intervention began

Group Type NO_INTERVENTION

No interventions assigned to this group

Interventions

Learn about the drugs, procedures, or behavioral strategies being tested and how they are applied within this trial.

Processing of clinical notes in the EHR data collected during routine care

Clinical notes collected in the first day of hospital admission during usual care as input to natural language processing and machine learning algorithm.

Intervention Type OTHER

Eligibility Criteria

Check the participation requirements, including inclusion and exclusion rules, age limits, and whether healthy volunteers are accepted.

Inclusion Criteria

* Ages 18 years old to 89 years old
* Inpatient status during hospitalization
* Length of stay greater than 24 hours

Exclusion Criteria

* Cannot participate in the usual care SBIRT intervention
* Death or obtunded during first 24 hours of admission
* Discharged against medical advice
* Transferred from another acute care hospital
* Transferred to another acute care hospital
Minimum Eligible Age

18 Years

Maximum Eligible Age

89 Years

Eligible Sex

ALL

Accepts Healthy Volunteers

No

Sponsors

Meet the organizations funding or collaborating on the study and learn about their roles.

Rush University Medical Center

OTHER

Sponsor Role collaborator

National Institute on Drug Abuse (NIDA)

NIH

Sponsor Role collaborator

University of Wisconsin, Madison

OTHER

Sponsor Role lead

Responsible Party

Identify the individual or organization who holds primary responsibility for the study information submitted to regulators.

Responsibility Role SPONSOR

Locations

Explore where the study is taking place and check the recruitment status at each participating site.

Rush University Medical Center

Chicago, Illinois, United States

Site Status

Countries

Review the countries where the study has at least one active or historical site.

United States

References

Explore related publications, articles, or registry entries linked to this study.

Afshar M, Phillips A, Karnik N, Mueller J, To D, Gonzalez R, Price R, Cooper R, Joyce C, Dligach D. Natural language processing and machine learning to identify alcohol misuse from the electronic health record in trauma patients: development and internal validation. J Am Med Inform Assoc. 2019 Mar 1;26(3):254-261. doi: 10.1093/jamia/ocy166.

Reference Type RESULT
PMID: 30602031 (View on PubMed)

Joyce C, Markossian TW, Nikolaides J, Ramsey E, Thompson HM, Rojas JC, Sharma B, Dligach D, Oguss MK, Cooper RS, Afshar M. The Evaluation of a Clinical Decision Support Tool Using Natural Language Processing to Screen Hospitalized Adults for Unhealthy Substance Use: Protocol for a Quasi-Experimental Design. JMIR Res Protoc. 2022 Dec 19;11(12):e42971. doi: 10.2196/42971.

Reference Type DERIVED
PMID: 36534461 (View on PubMed)

Provided Documents

Download supplemental materials such as informed consent forms, study protocols, or participant manuals.

Document Type: Study Protocol and Statistical Analysis Plan

View Document

Related Links

Access external resources that provide additional context or updates about the study.

https://github.com/

login page but full code not finalized for publishing

Other Identifiers

Review additional registry numbers or institutional identifiers associated with this trial.

A534285

Identifier Type: OTHER

Identifier Source: secondary_id

SMPH/MEDICINE

Identifier Type: OTHER

Identifier Source: secondary_id

1R01DA051464

Identifier Type: NIH

Identifier Source: secondary_id

View Link

2022-0983

Identifier Type: -

Identifier Source: org_study_id

More Related Trials

Additional clinical trials that may be relevant based on similarity analysis.

A Chatbot to Support Substance Use Recovery
NCT06845878 RECRUITING PHASE1/PHASE2