DOC PREVIEW
MIT 6 893 - An Oxygenated  Presentation Manager

This preview shows page 1-2-3-23-24-25-26-46-47-48 out of 48 pages.

Save
View full document
View full document
Premium Document
Do you want full access? Go Premium and unlock all 48 pages.
Access to all documents
Download any document
Ad free experience
View full document
Premium Document
Do you want full access? Go Premium and unlock all 48 pages.
Access to all documents
Download any document
Ad free experience
View full document
Premium Document
Do you want full access? Go Premium and unlock all 48 pages.
Access to all documents
Download any document
Ad free experience
View full document
Premium Document
Do you want full access? Go Premium and unlock all 48 pages.
Access to all documents
Download any document
Ad free experience
View full document
Premium Document
Do you want full access? Go Premium and unlock all 48 pages.
Access to all documents
Download any document
Ad free experience
View full document
Premium Document
Do you want full access? Go Premium and unlock all 48 pages.
Access to all documents
Download any document
Ad free experience
View full document
Premium Document
Do you want full access? Go Premium and unlock all 48 pages.
Access to all documents
Download any document
Ad free experience
View full document
Premium Document
Do you want full access? Go Premium and unlock all 48 pages.
Access to all documents
Download any document
Ad free experience
View full document
Premium Document
Do you want full access? Go Premium and unlock all 48 pages.
Access to all documents
Download any document
Ad free experience
View full document
Premium Document
Do you want full access? Go Premium and unlock all 48 pages.
Access to all documents
Download any document
Ad free experience
Premium Document
Do you want full access? Go Premium and unlock all 48 pages.
Access to all documents
Download any document
Ad free experience

Unformatted text preview:

An Oxygenated Presentation ManagerGoals & OverviewApplication ScenarioAn Oxygen ApplicationCamera watching laser point on screenCamera watching laser point on screen (cont)Microphone listening to speakerSpeaker controlling presentation via ipaqOutput to projector, handheld, archiveProcessing – controlling sessionMulti-Modal InputInitial Experience With Presentation ManagerA Better Solution: Multiple ContextsExtending the Galaxy SystemPowerPoint PresentationSystem StructureSlide 17Slide 18Add Recognizer for T9Add Recognizer for GraffitiOther Input ModesReplacing the RecognizersConclusionCritique of Presentation ManagerVision / Gesture RecognitionCritique of Vision / Gesture RecognitionSpeech RecognitionMore useful aspect – Output modalitiesRecord and correlate notes with presentationCORE: Communication Oriented Routing EnvironmentAssumptionsSlide 32Communications-Oriented ProgramsCOREMessage FlowCORE Language ElementsNode handler = (nickname, specifier)Node Statement HandlerLinksLink Statement HandlerMessagesMessage HandlingRulesRule Statement HandlerRules – A link is a ruleRules – Access Control ListsSlide 47Reversibility1L C SLarry Rudolph & Shalini AgarwalAn Oxygenated Presentation ManagerLarry RudolphOxygen Workshop, January, 20022L C SLarry Rudolph & Shalini AgarwalGoals & Overview•Integrate Many Oxygen Technologies•Application Driven–Use an application that we understand–Personally use often–Would help if were more human-centric–Portable (as opposed to E-21)•Develop Architectural Infrastructure–Exposes new requirements•Critique of Presentation Manager–What is wrong with it–What needs improvement3L C SLarry Rudolph & Shalini AgarwalApplication Scenario4L C SLarry Rudolph & Shalini AgarwalAn Oxygen ApplicationComponents–Input*Vision*Speech*Touch–Processing*Changing configuration–Output*Projector*Handheld*ArchiveEquipment –Today, it is too hard  –Linux laptop; windows laptop; camera; microphone; network; projector; power blocks–Tomorrow, much easier –a couple of H21’s5L C SLarry Rudolph & Shalini AgarwalCamera watching laser point on screen•Camera Challenges*Inexpensive ones have wrong focal length*Alignment issues•Use edge of screen, display pattern, figure out from what is known to be visible•We ended up displaying a pattern of concentric circles*Relative size of laser point depends on distance•Beyond ten feet, had to use only certain types of lasers•Could slow-down camera and let pixels saturate (too complicated)6L C SLarry Rudolph & Shalini AgarwalCamera watching laser point on screen (cont)•Camera Interface*Click at point (x,y)•Hold laser at same location for 5 seconds*Select horizontal line ( (x1,y1) , (x1,y2) )•Sweep laser back and forth, line is diameter of ellipse*Select object centered at point (x,y)•Sweep laser in circle, point is center of circle*Previous or Next •Click in left (right) 1/8 of screen7L C SLarry Rudolph & Shalini AgarwalMicrophone listening to speaker•Microphone–Many technologies;–Lapel-mic; mic array; room microphone–Current approach: ipaq*Continuous recognition*Push to speak•Audio server on ipaq–Detects start and stop *Best results when human pushes to start and releases to stop–Audio wave file sent to Galaxy speech system•Galaxy output actions via CGI-script–A nice unifying mechanism–One more complicated component8L C SLarry Rudolph & Shalini AgarwalSpeaker controlling presentation via ipaq•Ipaq output to CGI-script Server–Same actions as from speech server•Action are*Next slide, Previous slide, Goto slide #n, Goto slide named <xxx>*Next item, Previous item, Goto item #n, Goto item named <xxx>*Next animations, previous animation, goto animation #n*Start presentation <name>, End presentation, Pause presentation*Initialize Camera, test microphone•Handheld (Ipaq) display–GUI generated from speechbuilder grammar–List of slides, items per slides*Currently use ad-hoc solution where power-point sends lists to ipaq. Need more automatic solution9L C SLarry Rudolph & Shalini AgarwalOutput to projector, handheld, archive•Unlimited number of video / audio output producers –E.g. powerpoint just one producer of output–At any time, each output device has an associated producer*This producer can receive input from several producers•Handheld has proxy –To reduce bandwidth to ipaq–Current slide, list of slides, list of commands•Archive–Each slide shown, audio (from a different microphone) sent to archive*Currently just gif of current slide10L C SLarry Rudolph & Shalini AgarwalProcessing – controlling session•Do not let powerpoint control the world–Slide viewer; movie player; program execution; browser; etc–Want to mix all types of applications–Presenter has control of the output*Eg: Switch output producer from powerpoint to media player•Remove interrupting technologies–Dynamically disconnect any input / output source•All done via core language–Or some other glue language, e.g. meta-glue–Which does all the other infrastructure issuesMulti-Modal InputShalini AgarwalOxygen ConferenceJanuary 8th, 200212L C SLarry Rudolph & Shalini AgarwalInitial Experience With Presentation ManagerOne Single Monolithic ContextCommand within slide, between slides, between applicationsProblemToo many false positivesPreliminary SolutionSlide tracking*e.g. recognize “Next Slide” command only after at least 60% of words on slide have been said*e.g. recognize “Show Demo” only after slide 17Still lots of problems*Many slide styles hard to track (e.g. figures not words on slide)*Tracking for within slide different than for between slides13L C SLarry Rudolph & Shalini AgarwalA Better Solution: Multiple ContextsVery Active Research AreaIntelligent-room project; Galaxy; OthersThree layers, each having its own context1. Slide (Next Item, Next Animation)2. Presentation (Next Slide, Goto Conclusion, Goto Example)3. Session (Start Presentation, Switch to Browser, Show Questions)Challenges–Each context requires its own speech recognition system–Multicasting sound wave to each system–Selecting the best result14L C SLarry Rudolph & Shalini AgarwalExtending the Galaxy System•Start with context for speech and then extend•Note, our goals are similar but not identical to those of the Spoken Language Group•We are not dialog-based•Exploit their


View Full Document

MIT 6 893 - An Oxygenated  Presentation Manager

Documents in this Course
Toolkits

Toolkits

16 pages

Cricket

Cricket

29 pages

Quiz 1

Quiz 1

8 pages

Security

Security

28 pages

Load more
Download An Oxygenated  Presentation Manager
Our administrator received your request to download this document. We will send you the file to your email shortly.
Loading Unlocking...
Login

Join to view An Oxygenated  Presentation Manager and access 3M+ class-specific study document.

or
We will never post anything without your permission.
Don't have an account?
Sign Up

Join to view An Oxygenated  Presentation Manager 2 2 and access 3M+ class-specific study document.

or

By creating an account you agree to our Privacy Policy and Terms Of Use

Already a member?