E9 261 (JAN) 3:1 Speech Information Processing



Speech Information Processing
January-April, 2021

Announcements:
February 18, 2021: First lecture will be held online on February 26, 2021 (Friday) at 4PM.
February 18, 2021: If you are attending the course (credit or audit), please fill up this form (on or before March 7, 2021) to join the class email list. Once you fill up the form, you will be added to the Microsoft Teams for this course
March 1, 2021: Today's class will be held during 4:45pm-5:45pm. March 2, 2021: Assignment1 is due on March 15, 2021.

Instructor:
Prasanta Kumar Ghosh
Office: EE C 330
Phone: +91 (80) 2293 2694
prasantg AT iisc.ac.in

Teaching Assistant(s):


Class meetings:
5:00pm to 6:00pm every Monday, Wednesday and Friday (Venue: Online)


Course Content:
  • Speech communication and overview
  • Time varying signals/sys
  • Spectrograms and applications
  • Speech parameterization/representation
  • AM-FM, sinusoidal models for speech
  • Linear Prediction, AR and ARMA modeling of speech.
  • Sequence modeling of speech
  • Speech applications - Automatic speech recognition, Text-to-speech synthesis


Prerequisites:
Digital Signal Processing, Probability and Random Processes


Textbooks:
    • Fundamentals of speech recognition, Rabiner and Juang, Prentice Hall, 1993.
    • Automatic Speech Recognition, A Deep Learning Approach, Authors: Yu, Dong, Deng, Li, Springer, 2014.
    • Discrete-Time Speech Signal Processing: Principles and Practice, Thomas F. Quatieri, Prentice Hall, 2001.
    • Digital Processing of Speech Signals, Lawrence R. Rabiner, Pearson Education, 2008.
    • "Automatic Speech Recognition - A deep learning approach" - Dong Yu, Li Deng.


Web Links:
The Edinburgh Speech Tools Library
Speech Signal Processing Toolkit (SPTK)
Hidden Markov Model Toolkit (HTK)
ICSI Speech Group Tools
VOICEBOX: Speech Processing Toolbox for MATLAB
Praat: doing phonetics by computer
Audacity
SoX - Sound eXchange
HMM-based Speech Synthesis System (HTS)
International Phonetic Association (IPA)
Type IPA phonetic symbols
CMU dictionary
Co-articulation and phonology by Ohala
Assisted Listening Using a Headset
Headphone-Based Spatial Sound
Pitch Perception
Head-Related Transfer Functions and Virtual Auditory Display
Signal reconstruction from STFT magnitude: a state of the art
On the usefulness of STFT phase spectrum in human listening tests
Experimental comparison between stationary and nonstationary formulations of linear prediction applied to voiced speech analysis
A modified autocorrelation method of linear prediction for pitch-synchronous analysis of voiced speech
Linear prediction: A tutorial review
Energy separation in signal modulations with application to speech analysis
Nonlinear Speech Modeling and Applications


Grading:
  • Assignments including recording (20 points) - Average of all assignments will be considered. Assignments will include associated recordings. Cheating or violating academic integrity (see below) will result in failing in the course. Turning in identical homework sets counts as cheating.
  • Midterm exam (10 points) - 2 midterm exams. Missed exams earn 0 points. No make-up exams. An average of the midterm scores will be considered.
  • Final exam (20 points)
  • Surprise Test (30 points) - 12-15 in-class surprise tests (each 3 points) will be conducted and the best ten will be considered.
  • Project (20 points) - Quality/Quantity of work (10 points), Report (5 points), Presentation (5 points).


Topics covered:
Date
Topics
Remarks
Feb 26
Course logistics, Information in speech, speech chain, speech research - science and technology
Mar 1
Phonemes, allophones, diphones, morphemes, lexicon, consonant cluster.
IPA, ARPABET, Grapheme-to-Phoneme conversion
Mar 3
Summary of phonetics and phonology, manner and place of articulation
Mar 5
Intonation, stress, co-articulation, Assimilation, Elision, speech production models, formants, Human auditory system, auditory modeling, Cochlear signal processing.










Your Voice/Program files to upload:
Click to upload (max 10Mb)





Your feedback on this course (any time):
Click here





Academic Honesty:
As students of IISc, we expect you to adhere to the highest standards of academic honesty and integrity.
Please read the IISc academic integrity.