Unformatted text preview:

Spoken Language Processing Julia Hirschberg CS 4706 01 13 2019 1 Speech Processing How do you produce sounds that other people interpret as language How does your hearer decode what you are trying to convey In a conversation how do you know when it s your turn to talk Once you decide what you want to say how do you decide how to say it 01 13 2019 How do you decide where to pause in the sentence How do you decide what words to emphasize How do you decide what intonational contour to use How do you convey your own feelings and emotions 2 Applications for Speech Technologies Speech synthesis TTS AT T IBM Jeopardy 2 1416 SitePal Speech recognition ASR Nuance Sphinx HTK Speech to Speech Translation TRANSTAC DARPA Speech Search Google Voice Search Homeland Security Deception Detection Dialect and Language ID and Speaker ID trust Spoken Dialogue Systems Over the phone services Android or Siri Tutoring systems KTH s Ville Amtrak Julie or here 01 13 2019 3 What will we do in this course Learn about fundamental aspects of speech signals and how to analyze them Acoustic prosodic information pitch energy Phonemes and phones sounds of a language Intonation pitch intensity timing Study two basic speech technologies TTS and ASR and their application in Spoken Dialogue Systems SDS Build your own SDS using the Festival and Sphinx Toolkits in a domain of your choice 01 13 2019 4 Sample Domains from Previous Years Pizza orders Movie recommendations Spoken interface to games Colossal Cave Adventure Froggie Voice controlled audio book reader Sports event ticket search Voice Yelp Spoken cookie recipes 01 13 2019 5 Student Clinic Appointment Systems Querying Dow Jones Information via Yahoo Finance NFL player stats iPhone task manager Music browsing 01 13 2019 6 Course information Course syllabus and readings Jurafsky Martin second edition Articles in syllabus Speech tools and speech lab Courseworks discussion course files gradebook TAs Erica Cooper and Rivka Levitan 01 13 2019 7 Projects Build and demo a Spoken Dialogue System Teams of 2 or 3 Organize your own or Advertise for team members on Courseworks discussion category Find a team or Send mail to prof or TAs early 4 Deadlines 01 13 2019 Project Description TTS component using Festival tools ASR component using HTK tools Beta test does your system work SDS demos during final exam period 8 Honor policy on syllabus Late policy on syllabus 5 free late days per project for the semester My office hours M 4 15 6 15 CEPSR 705 Rivka Levitan TBD CEPSR 7LW1 Speech Lab rlevitan cs columbia edu Erica Cooper TBD CEPSR 7LW1 Speech Lab ecooper cs columbia edu 01 13 2019 9 Questions 01 13 2019 10


View Full Document

Columbia CS 4706 - intro

Loading Unlocking...
Login

Join to view intro and access 3M+ class-specific study document.

or
We will never post anything without your permission.
Don't have an account?
Sign Up

Join to view intro and access 3M+ class-specific study document.

or

By creating an account you agree to our Privacy Policy and Terms Of Use

Already a member?