© 2000 Todd Neller.
A.I.M.A. t
ext
figures
© 1995 Prentice Hall.
Used by
permission.
Speech Recognition: acoustic
model
•
Question #1: What speech sounds did the
speaker utter? P(signal|words)
–
Human speech has 40-50 sounds called phones
–
characterized by features in acoustic signal (e.g.
frequency, amplitude, duration, etc.)
–
application of machine learning