Multimodal Analysis of Structural Voice Disorders Based on Speech and Stroboscopic Laryngoscope Video

NCT ID: NCT05348031

Last Updated: 2022-04-27

Study Results

Results pending

The study team has not published outcome measurements, participant flow, or safety data for this trial yet. Check back later for updates.

Basic Information

Get a concise snapshot of the trial, including recruitment status, study phase, enrollment targets, and key timeline milestones.

Recruitment Status

NOT_YET_RECRUITING

Total Enrollment

1 participants

Study Classification

OBSERVATIONAL

Study Start Date

2022-05-06

Study Completion Date

2027-02-20

Brief Summary

Review the sponsor-provided synopsis that highlights what the study is about and why it is being conducted.

This study intends to collect clinical data such as strobary laryngoscope images and vowel audio data of patients with structural voice disorders and healthy individuals, and to establish a multimodal voice disorder diagnosis system model by using deep learning algorithms. Multi-classification of diseases that cause voice disorders can be applied to patients with voice disorders but undiagnosed in clinical practice, thereby assisting clinicians in diagnosing diseases and reducing misdiagnosis and missed diagnosis. In addition, some patients with voice disorders can be managed remotely through the audio diagnosis model, and better follow-up and treatment suggestions can be given to them. Remote voice therapy can alleviate the current situation of the shortage of speech therapists in remote areas of our country, and increase the number of patients who need voice therapy. opportunity. Remote voice therapy is more cost-effective, more flexible in time, and more cost-effective.

Detailed Description

Dive into the extended narrative that explains the scientific background, objectives, and procedures in greater depth.

1. Detection and Classification of Acoustic Lesions Based on Speech Deep Learning
2. Detection and Classification of Acoustic Lesions Based on Deep Learning of Images
3. Detection and Classification of Acoustic Lesions Based on Deep Learning Based on Multimodality

Conditions

See the medical conditions and disease areas that this research is targeting or investigating.

Voice Disorders

Study Design

Understand how the trial is structured, including allocation methods, masking strategies, primary purpose, and other design elements.

Observational Model Type

CASE_ONLY

Study Time Perspective

RETROSPECTIVE

Eligibility Criteria

Check the participation requirements, including inclusion and exclusion rules, age limits, and whether healthy volunteers are accepted.

Inclusion Criteria

Laryngeal cancer, laryngeal precancerous lesions, benign laryngeal lesions with voice disorders, healthy people without throat diseases

Exclusion Criteria

1. A history of laryngeal surgery
2. Patients with voice disorders caused by various causes except laryngeal cancer, laryngeal precancerous lesions, and benign laryngeal lesions
3. The audio quality is not clear, the stroboscopic laryngoscope does not clearly display the anatomical area related to the glottis, and it is underexposed and blocked;
Minimum Eligible Age

20 Years

Maximum Eligible Age

80 Years

Eligible Sex

ALL

Accepts Healthy Volunteers

Yes

Sponsors

Meet the organizations funding or collaborating on the study and learn about their roles.

Duke Kunshan University

OTHER

Sponsor Role collaborator

Sun Yat-Sen Memorial Hospital of Sun Yat-Sen University

OTHER

Sponsor Role lead

Responsible Party

Identify the individual or organization who holds primary responsibility for the study information submitted to regulators.

Responsibility Role SPONSOR

Principal Investigators

Learn about the lead researchers overseeing the trial and their institutional affiliations.

YueXin Cai

Role: STUDY_CHAIR

Sun Yat-Sen Memorial Hospital of Sun Yat-Sen University

Central Contacts

Reach out to these primary contacts for questions about participation or study logistics.

YueXin Cai

Role: CONTACT

13825063663

Wenting Deng

Role: CONTACT

15017556968

References

Explore related publications, articles, or registry entries linked to this study.

Martínez, David, Lleida Eduardo, Ortega Alfonso,Miguel Antonio, Villalba Jesús. Voice pathology detection on the Saarbrücken voice database with calibration and fusion of scores using multifocal toolkit. Advances in Speech and Language Technologies for Iberian Languages. Springer, Berlin, Heidelberg, 2012. 99-109

Reference Type BACKGROUND

Hegde S, Shetty S, Rai S, Dodderi T. A Survey on Machine Learning Approaches for Automatic Detection of Voice Disorders. J Voice. 2019 Nov;33(6):947.e11-947.e33. doi: 10.1016/j.jvoice.2018.07.014. Epub 2018 Oct 11.

Reference Type BACKGROUND
PMID: 30316551 (View on PubMed)

Al-Nasheri A, Muhammad G, Alsulaiman M, Ali Z. Investigation of Voice Pathology Detection and Classification on Different Frequency Regions Using Correlation Functions. J Voice. 2017 Jan;31(1):3-15. doi: 10.1016/j.jvoice.2016.01.014. Epub 2016 Mar 15.

Reference Type BACKGROUND
PMID: 26992554 (View on PubMed)

.Chuang, ZY,YuXT,Chen JY, Hsu YT,Xu ZZ,Wang CT,Lin FC,Fang SH. DNN-based approach to detect and classify pathological voice. In Proceedings of the 2018 IEEE International Conference on Big Data (Big Data), Seattle, WA, USA, 10-13 December 2018

Reference Type BACKGROUND

Fang SH, Tsao Y, Hsiao MJ, Chen JY, Lai YH, Lin FC, Wang CT. Detection of Pathological Voice Using Cepstrum Vectors: A Deep Learning Approach. J Voice. 2019 Sep;33(5):634-641. doi: 10.1016/j.jvoice.2018.02.003. Epub 2018 Mar 19.

Reference Type BACKGROUND
PMID: 29567049 (View on PubMed)

Bethani Gty As H , Suwandi, Anggraini C D . Classification System Vocal Cords Disease Using Digital Image Processing.The 2019 IEEE International Conference on industry 4.0,Artifical Intelligence,and Communications Technology.2019.129-132

Reference Type BACKGROUND

Unger J, Lohscheller J, Reiter M, Eder K, Betz CS, Schuster M. A noninvasive procedure for early-stage discrimination of malignant and precancerous vocal fold lesions based on laryngeal dynamics analysis. Cancer Res. 2015 Jan 1;75(1):31-9. doi: 10.1158/0008-5472.CAN-14-1458. Epub 2014 Nov 4.

Reference Type BACKGROUND
PMID: 25371410 (View on PubMed)

Xiong H, Lin P, Yu JG, Ye J, Xiao L, Tao Y, Jiang Z, Lin W, Liu M, Xu J, Hu W, Lu Y, Liu H, Li Y, Zheng Y, Yang H. Computer-aided diagnosis of laryngeal cancer via deep learning based on laryngoscopic images. EBioMedicine. 2019 Oct;48:92-99. doi: 10.1016/j.ebiom.2019.08.075. Epub 2019 Oct 5.

Reference Type BACKGROUND
PMID: 31594753 (View on PubMed)

Kim H, Jeon J, Han YJ, Joo Y, Lee J, Lee S, Im S. Convolutional Neural Network Classifies Pathological Voice Change in Laryngeal Cancer with High Accuracy. J Clin Med. 2020 Oct 25;9(11):3415. doi: 10.3390/jcm9113415.

Reference Type BACKGROUND
PMID: 33113785 (View on PubMed)

Godino-Llorente JI, Gomez-Vilda P. Automatic detection of voice impairments by means of short-term cepstral parameters and neural network based detectors. IEEE Trans Biomed Eng. 2004 Feb;51(2):380-4. doi: 10.1109/TBME.2003.820386.

Reference Type BACKGROUND
PMID: 14765711 (View on PubMed)

Ren J, Jing X, Wang J, Ren X, Xu Y, Yang Q, Ma L, Sun Y, Xu W, Yang N, Zou J, Zheng Y, Chen M, Gan W, Xiang T, An J, Liu R, Lv C, Lin K, Zheng X, Lou F, Rao Y, Yang H, Liu K, Liu G, Lu T, Zheng X, Zhao Y. Automatic Recognition of Laryngoscopic Images Using a Deep-Learning Technique. Laryngoscope. 2020 Nov;130(11):E686-E693. doi: 10.1002/lary.28539. Epub 2020 Feb 18.

Reference Type BACKGROUND
PMID: 32068890 (View on PubMed)

Bainbridge KE, Roy N, Losonczy KG, Hoffman HJ, Cohen SM. Voice disorders and associated risk markers among young adults in the United States. Laryngoscope. 2017 Sep;127(9):2093-2099. doi: 10.1002/lary.26465. Epub 2016 Dec 23.

Reference Type RESULT
PMID: 28008619 (View on PubMed)

Other Identifiers

Review additional registry numbers or institutional identifiers associated with this trial.

SYSEC-KY-KS-2022-040

Identifier Type: -

Identifier Source: org_study_id

More Related Trials

Additional clinical trials that may be relevant based on similarity analysis.

Behavioral Abnormalities in Dysphonic Children
NCT05264961 ACTIVE_NOT_RECRUITING NA
Neural Prediction to Enhance Language
NCT05962359 RECRUITING NA
Voice Telerehabilitation
NCT05747859 COMPLETED