ASIA unversity:Item 310904400/8868
English  |  正體中文  |  简体中文  |  Items with full text/Total items : 94286/110023 (86%)
Visitors : 21654573      Online Users : 896
RC Version 6.0 © Powered By DSPACE, MIT. Enhanced by NTU Library IR team.
Scope Tips:
  • please add "double quotation mark" for query phrases to get precise results
  • please goto advance search for comprehansive author search
  • Adv. Search
    HomeLoginUploadHelpAboutAdminister Goto mobile version


    Please use this identifier to cite or link to this item: http://asiair.asia.edu.tw/ir/handle/310904400/8868


    Title: On the effectiveness of speech enhancement to a proposed speech recognition process that applied to noisy isolated-word recognition
    Authors: Liu, Lih-Cherng;Lu, Ching-T.A.;Tsai, H.O.-Hsuan
    Contributors: Department of Information Communication
    Keywords: Contour followers;Control theory;Cybernetics;Face recognition;Learning algorithms;Learning systems;Robot learning;Signal encoding;Speech analysis;Speech enhancement;Viterbi algorithm;Critical-band-wavelet-packet transform;Distance measures;DTW;EM algorithm;End points;Feature vectors;Isolated words;Left-to-right finite-state machine;Noisy speech recognitions;Optimal models;Pre-processing;Recognition accuracies;Recognition processes;Speech models;Word recognitions
    Date: 2008
    Issue Date: 2010-04-08 12:30:38 (UTC+0)
    Publisher: Asia University
    Abstract: A speech recognition process is proposed and applied to noisy speech recognition. The end points of isolated utterance can be determined accurately enough in the training and recognition process. A function for the distance measure between two speech blocks is derived. The feature vector is then defined. The speech model for each word is represented as a left-to-right finite-state machine. Viterbi algorithm and EM algorithm is applied to obtain the optimal model for each isolated word. When an utterance is to be recognized, the Viterbi algorithm is applied to search for the model which gives the smallest accumulated distance along the optimum path. The performance of the proposed recognition process is evaluated by comparing to the method using DTW. The comparison is in three aspects: namely, the recognition accuracy, the noise-resistance capability, and the effectiveness of speech enhancement pre-processing. Experiments showed that the proposed speech recognition process is superior to DTW in these three aspects. © 2008 IEEE.
    Relation: Proceedings of the 7th International Conference on Machine Learning and Cybernetics, ICMLC 6 :3310-3314
    Appears in Collections:[Department of Information Communication] Proceedings

    Files in This Item:

    File Description SizeFormat
    0KbUnknown531View/Open
    221.doc29KbMicrosoft Word328View/Open


    All items in ASIAIR are protected by copyright, with all rights reserved.


    DSpace Software Copyright © 2002-2004  MIT &  Hewlett-Packard  /   Enhanced by   NTU Library IR team Copyright ©   - Feedback