Course Details

Subject {L-T-P / C} : EE6145 : Digital Speech Processing {3-0-0 / 3}
Subject Nature : Theory
Coordinator : Prof. Prasanna Kumar Sahu

Syllabus

Fundamentals of Speech: Parameters of Speech: (Pitch frequency, Cepstral Domain, Pitch Period Measurement using Cepstral Domain etc.,) Spectral Parameters of Speech: (Mel Frequency Cepstral Coefficients, Perceptual Linear Prediction, Wavelet transform Analysis of Speech) Linear Prediction of Speech: Speech Quantization and Coding: Speech Processing Applications: Speech Synthesis: (A Text to Speech System, Synthesizer Technologies, Speech Synthesis using other methods, Emotion Recognition from Speech, Watermarking for Authentication of a Speech/Music Signal etc.)

Course Objectives

  1. To provide students with the knowledge of basic characteristics of speech signal in relation to production and hearing of speech by humans
  2. To describe basic algorithms of speech analysis common to many applications
  3. To give an overview of applications (recognition, synthesis, coding) and to inform about practical aspects of speech algorithms implementation.

Course Outcomes

The students will get familiar with basic characteristics of speech signal in relation to production and hearing of speech by humans. They will understand basic algorithms of speech analysis common to many applications. They will be given an overview of applications (recognition, synthesis, coding) and be informed about practical aspects of speech algorithms implementation. The students will be able to design a simple system for speech processing (speech activity detector, recognizer of limited number of isolated words), including its implementation into application programs

Essential Reading

  1. Gold Ben, Nelson Morgan, and Dan Ellis, Speech and Audio signal processing: processing and perception of speech and music, John Wiley & Sons , 2nd Edition, August 2011
  2. S.D Apte, Speech and Audio Processing, Wiley India , Edition, 2015

Supplementary Reading

  1. Rabiner Lawrence R., and Biing-Hwang Juang, Fundamentals of Speech Recognition, Prentice Hall International , 1993
  2. Benesty Jacob, M. Mohan Sondhi, and Yiteng Huang, Handbook of speech processing, Springer , 2007