Flexible Representation of Speech

NCT ID: NCT05209386

Last Updated: 2025-09-08

Study Results

Results available

Outcome measurements, participant flow, baseline characteristics, and adverse events have been published for this study.

View full results

Basic Information

Get a concise snapshot of the trial, including recruitment status, study phase, enrollment targets, and key timeline milestones.

Recruitment Status

COMPLETED

Clinical Phase

NA

Total Enrollment

48 participants

Study Classification

INTERVENTIONAL

Study Start Date

2022-05-02

Study Completion Date

2025-04-10

Brief Summary

Review the sponsor-provided synopsis that highlights what the study is about and why it is being conducted.

The overarching goal of this exploratory research is to understand the dynamic and flexible nature of speech processing in the human supratemporal plane. The temporal lobe has long been established as a region of interest in the speech perception and processing literature because it contains the auditory cortex. More recently, research has localized the supratemporal plane as an area that exhibits response specificity to acoustic properties of complex auditory signals like speech. The supratemporal plane, comprised of Heschl's gyrus, the planum polare, and the planum temporale, is capable of the rapid spectrotemporal analysis required to map acoustic information to linguistic representation. Neural activity in this area, however, is rarely studied directly because it is difficult to access with non-invasive measures like scalp electroencephalography (EEG). Capitalizing on the unique opportunity to access these areas via routine clinical stereoelectroencephalography (sEEG) in a patient population, this study seeks to understand how cortical responses reflect the diagnosticity of two acoustic-phonetic dimensions of interest and how responses rapidly and flexibly adapt to changes in listening demands. Examining how neural response to voice onset time (VOT) and fundamental frequency (F0) modulates as a function of perceptual weight carried in signaling phoneme categories, and identifying how changes in listening context shift perceptual weight, will provide invaluable data that indicates how speech processing flexibly adapts to short-term acoustic patterns.

Detailed Description

Dive into the extended narrative that explains the scientific background, objectives, and procedures in greater depth.

The purpose of this study is to understand the dynamic, flexible nature of speech processing as a function of perceptual weight applied to acoustic-phonetic dimensions within varying listening contexts and demands.

The specific aims of this study are as follows:

1. To establish the neural response to two acoustic-phonetic dimensions as a function of the perceptual weight they carry when signaling phoneme identity.

Aim 1 will specifically evaluate responses to voice onset time (VOT) and fundamental frequency (F0). Data collected will provide a baseline response for participants.
2. To identify how experimental manipulation of listening context impacts perceptual weighting strategies of VOT and F0.

Aim 2 will evaluate modulation of neural response to the introduction of noise and the introduction of an "accent."

A secondary aim of this study is to use "control" electrodes, which are those placed in clinically necessary regions of the brain but outside of the region of interest for this study (supratemporal plane), to determine if additional regions of the brain are implicated in adaptive plasticity of speech processing.

Speech is the primary means by which we convey our needs, wants, and thoughts to others and the ability to process speech is crucial to our everyday functioning, as well as our ability to establish and maintain relationships. Impairments in speech processing have an undeniable negative impact on individuals and society. While habilitative and rehabilitative strategies exist that can improve auditory processing and quality of life, understanding the exact neural mechanism underlying the human brain's ability to process speech would contribute to a more well-defined means by which to target deficits. This study seeks to understand the regions of the brain involved in speech processing, how those regions analyze specific acoustic-phonetic dimensions, and how the system adapts to successfully process speech in different listening contexts.

Modern electrophysiological techniques have revolutionized research into activity in the human brain, allowing investigators to identify specific regions or patterns of activity associated with various behaviors and sensory experiences. sEEG recordings, which involve intracerebral measurements of neural activity using depth electrodes, are capable of providing unique access to regions of the brain that are otherwise inaccessible with less invasive measurements. Capitalizing on PI Abel's work with pediatric patients undergoing sEEG recording for localization of seizure foci or language mapping, this study will allow researchers to directly study activity in regions that have already been implicated in the literature as crucial to spectrotemporal analysis of complex acoustic signals like speech. These regions within the supratemporal plane (STP) include Heschl's gyrus, the planum polare, and the planum temporale, all of which are uniquely targeted via sEEG.

Existing literature in speech processing has indicated that the mapping of physical input (acoustic signal) to linguistic representation (identification of phonemes or words) is not a static process, but rather highly dependent on listening context. The auditory processing system regularly adapts to changes in signal quality, adverse listening conditions, and short-term deviations from expected and learned regularities in native language input by applying varying importance, or perceptual weight, to specific acoustic-phonetic parameters. This indicates the existence of adaptive plasticity in speech processing, yet existing neurophysiological models do not account for this flexibility in cortical response. Data from pilot EEG and sEEG studies demonstrated that high gamma activity in the STP and behavioral responses were graded by the perceptual weight given to two acoustic-phonetic dimensions, voice onset time (VOT) and fundamental frequency (F0).

The proposed study will contribute to existing knowledge by helping to establish a more detailed model of on-line cortical response and adaptation to changing acoustic signals. It is unique in its accounting for the role that perceptual weight of acoustic-phonetic dimensions play in signaling phonemes and making category-based judgments.

Conditions

See the medical conditions and disease areas that this research is targeting or investigating.

Epilepsy

Study Design

Understand how the trial is structured, including allocation methods, masking strategies, primary purpose, and other design elements.

Allocation Method

NA

Intervention Model

SINGLE_GROUP

Primary Study Purpose

BASIC_SCIENCE

Blinding Strategy

NONE

Study Groups

Review each arm or cohort in the study, along with the interventions and objectives associated with them.

Patient Participants

This single-group study will recruit patients through the PI's clinical practice who are undergoing invasive neurophysiological monitoring (sEEG) with clinically necessary placement of electrodes in the supratemporal plane.

All participants will complete the same behavioral response paradigms.

Group Type EXPERIMENTAL

Dimension-Based Statistical Learning

Intervention Type BEHAVIORAL

Each participant will complete self-paced blocks of stimuli that will first establish a baseline for neural activity and behavioral responses with clear speech, and will then record responses for experimentally manipulated blocks to introduce 1) speech-in-noise and 2) a Canonical-Reverse block to model an "accent." Auditory stimuli will be adjusted to a comfortable level for each participant as determined by a calibration process completed by the participant. Each block involves listening to sound via earphones and making a categorical decision between initial consonants (/b/ or /p/) by tapping a button to indicate the word heard by the participant.

Interventions

Learn about the drugs, procedures, or behavioral strategies being tested and how they are applied within this trial.

Dimension-Based Statistical Learning

Each participant will complete self-paced blocks of stimuli that will first establish a baseline for neural activity and behavioral responses with clear speech, and will then record responses for experimentally manipulated blocks to introduce 1) speech-in-noise and 2) a Canonical-Reverse block to model an "accent." Auditory stimuli will be adjusted to a comfortable level for each participant as determined by a calibration process completed by the participant. Each block involves listening to sound via earphones and making a categorical decision between initial consonants (/b/ or /p/) by tapping a button to indicate the word heard by the participant.

Intervention Type BEHAVIORAL

Eligibility Criteria

Check the participation requirements, including inclusion and exclusion rules, age limits, and whether healthy volunteers are accepted.

Inclusion Criteria

* Individuals 15-25 years old
* Undergoing sEEG placement in the supratemporal plane for clinically necessary localization of epileptic foci or language mapping
* Fluent English speakers
* Cognition and speech-language skills within normal limits (as determined by evaluation prior to surgery)
* Normal or correct-to-normal visual acuity
* Normal hearing acuity in each ear (as determined by audiometric assessment)
* No history of autism or ADHD

Exclusion Criteria

* Individuals with intellectual disabilities
* Abnormal epileptiform activity in the supratemporal plane
* Lack of fluent English comprehension/production
* Severe language or auditory-specific cognitive dysfunction
* History of autism or ADHD
Minimum Eligible Age

15 Years

Maximum Eligible Age

25 Years

Eligible Sex

ALL

Accepts Healthy Volunteers

No

Sponsors

Meet the organizations funding or collaborating on the study and learn about their roles.

National Institutes of Health (NIH)

NIH

Sponsor Role collaborator

Carnegie Mellon University

OTHER

Sponsor Role collaborator

National Institute on Deafness and Other Communication Disorders (NIDCD)

NIH

Sponsor Role collaborator

University of Pittsburgh

OTHER

Sponsor Role lead

Responsible Party

Identify the individual or organization who holds primary responsibility for the study information submitted to regulators.

Taylor Abel

Assistant Professor of Neurological Surgery; Pediatric Neurosurgeon

Responsibility Role PRINCIPAL_INVESTIGATOR

Principal Investigators

Learn about the lead researchers overseeing the trial and their institutional affiliations.

Taylor J Abel, MD

Role: PRINCIPAL_INVESTIGATOR

University of Pittsburgh

Locations

Explore where the study is taking place and check the recruitment status at each participating site.

UPMC Children's Hospital of Pittsburgh

Pittsburgh, Pennsylvania, United States

Site Status

Countries

Review the countries where the study has at least one active or historical site.

United States

Provided Documents

Download supplemental materials such as informed consent forms, study protocols, or participant manuals.

Document Type: Study Protocol

View Document

Document Type: Statistical Analysis Plan

View Document

Document Type: Informed Consent Form

View Document

Other Identifiers

Review additional registry numbers or institutional identifiers associated with this trial.

1R21DC019217-01A1

Identifier Type: NIH

Identifier Source: secondary_id

View Link

STUDY21090011

Identifier Type: -

Identifier Source: org_study_id

More Related Trials

Additional clinical trials that may be relevant based on similarity analysis.

Brain Mapping of Voice Control
NCT00066911 COMPLETED
Neural Mechanisms for Stopping Ongoing Speech Production
NCT05876910 ENROLLING_BY_INVITATION NA
The Role of Cerebellum in Speech
NCT03972202 ACTIVE_NOT_RECRUITING NA