Modelling of a Speech-to-Text Recognition System for Air Traffic Control and NATO Air Command

Grant Zietsman; Reza Malekian

熱門：朱丽彬黃光男王美玲王善边曾瓊瑤崔雪娟

首頁

臺灣期刊 學校系所學協會民間出版

大陸/海外期刊 政府機關學校系所學協會民間出版

DOI註冊服務


篇名	Modelling of a Speech-to-Text Recognition System for Air Traffic Control and NATO Air Command
並列篇名	Modelling of a Speech-to-Text Recognition System for Air Traffic Control and NATO Air Command
作者	Grant Zietsman、Reza Malekian
英文摘要	Accent invariance in speech recognition is a chal- lenging problem especially in the are of aviation. In this paper a speech recognition system is developed to transcribe accented speech between pilots and air traffic controllers. The system allows handling of accents in continuous speech by modelling phonemes using Hidden Markov Models (HMMs) with Gaussian mixture model (GMM) probability density functions for each state. These phonemes are used to build word models of the NATO phonetic alphabet as well as the numerals 0 to 9 with transcriptions obtained from the Carnegie Mellon University (CMU) pronouncing dictionary. Mel-Frequency Cepstral Co-efficients (MFCC) with delta and delta-delta coefficients are used for the feature extraction process. Amplitude normalisation and covariance scaling is implemented to improve recognition accuracy. A word error rate (WER) of 2% for seen speakers and 22% for unseen speakers is obtained.
起訖頁	1527-1539
關鍵詞	Automatic Speech Recognition (ASR)、Hidden Markov Model (HMM)、Gaussian Mixture Model (GMM)、Mel-Frequency Cepstral Coefficients (MFCC)、Covariance scaling
刊名	網際網路技術學刊
期數	202212 (23:7期)
出版單位	台灣學術網路管理委員會
DOI	10.53106/160792642022122307008 複製DOI
QR Code
該期刊上一篇	Developing a Multifunctional Heating Pad Based on Fuzzy-Edge Computations and IoMT Approach
該期刊下一篇	Rafflesia Optimization Algorithm Applied in the Logistics Distribution Centers Location Problem

教師服務合作出版期刊徵稿聯絡高教高教FB	讀者服務圖書目錄教育期刊訂購服務活動訊息	數位服務高等教育知識庫國際資料庫收錄投審稿系統 DOI註冊	線上購買高點網路書店元照網路書店博客來網路書店	教育資源教育網站國際教育網站	關於高教高教簡介出版授權合作單位
知識達	知識達	知識達	知識達	知識達	知識達