China Journal of Oral and Maxillofacial Surgery ›› 2022, Vol. 20 ›› Issue (2): 151-157.doi: 10.19438/j.cjoms.2022.02.009

• Original Articles • Previous Articles     Next Articles

Preliminary establishment of Mandarin speech database of369Chinese oral cancer patients

XIAO Yu-dong, GUO Kai-xin, YANG Le, DENG Wei, ZENG Bin, ZHANG Si-en, LAO Xiao-mei, LIAO Gui-qing, LIANG Yu-jie   

  1. Department of Oral and Maxillofacial Surgery, Guanghua School of Stomatology, Hospital of Stomatology, Sun Yat-sen University; Guangdong Provincial Key Laboratory of Stomatology. Guangzhou 510055, Guangdong Province, China
  • Received:2021-07-30 Revised:2021-11-26 Online:2022-03-20 Published:2022-03-20

Abstract: PURPOSE: To establish Mandarin speech database of Chinese oral cancer patients by collecting speech samples of patients with oral cancers before and after operation, so as to provide important data platform for clinical diagnosis, treatment and rehabilitation of oral cancer patients. METHODS: The patients from Department of Oral and Maxillofacial Surgery, Hospital of Stomatology, Sun Yat-Sen University were collected. A variety of speech corpus were applied to assess the participants and collect speech samples which were followed by uniform pre-processing, segmentation, annotation, and anonymization. A set of speech samples from relative health controls were also collected. RESULTS: A total of 481 individual speakers were enrolled from July 2017 to April 2021. The mean age at first assessment was 46.98±16.34 years. The participants consisted of 274 males(57.0%) and 207 females(43.0%). Of them, 369 patients with oral cancers(76.7%), 79 healthy subjects(16.4%) and 33 subjects with non-malignant lesions(6.9%) were comprised. Among oral cancers, a sum of 258 patients had primary lesions located on the tongue and floor of the mouth, and 202 patients were recruited before surgery whose T classifications were 35 cases at T1(17.3%), 68 cases at T2(33.7%), 41 cases at T3(20.3%) and 58 cases at T4(28.7%). The median follow-up(assessment) time was 219 d(IQR: 87.5~587 d) after surgery with a sum of 1 100 appointments. A sum of 73 008 isolated audio samples were obtained from 461 kinds of corpus stimuli which consisted of vowels (6,1.30%), diadochokinesis syllables(7, 1.51%), single words (238, 51.63%), phrases (169, 36.66%) and sentences (38,8.24%). CONCLUSIONS: The present study established the first speech database whose speech samples were dominantly from oral cancers, which provided important data support for the in-depth study of clinical biomarkers and development of individualized speech rehabilitation.

Key words: Database, Speech, Oral cancer, Mandarin Chinese

CLC Number: