MIT 16 412J - SPATIAL INTENTION RECOGNITION USING OPTIMAL MARGIN CLASSIFIERS - D2769020

Home> Schools> Massachusetts Institute of Technology> (16) > 16 412J> SPATIAL INTENTION RECOGNITION USING OPTIMAL MARGIN CLASSIFIERS

DOC PREVIEW

MIT 16 412J - SPATIAL INTENTION RECOGNITION USING OPTIMAL MARGIN CLASSIFIERS

School name Massachusetts Institute of Technology

Course 16 412j- Cognitive Robotics

Pages 6

This preview shows page 1-2 out of 6 pages.

Save

View full document

Premium Document

Do you want full access? Go Premium and unlock all 6 pages.

Access to all documents

Download any document

Ad free experience

Subscribe for instant access Get instant access

View full document

Premium Document

Do you want full access? Go Premium and unlock all 6 pages.

Access to all documents

Download any document

Ad free experience

Subscribe for instant access Get instant access

Premium Document

Do you want full access? Go Premium and unlock all 6 pages.

Access to all documents

Download any document

Ad free experience

Subscribe for instant access Get instant access

Unformatted text preview:

SPATIAL INTENTION RECOGNITION USING OPTIMAL MARGIN CLASSIFIERS Thomas Coffee Shuonan Dong Shen QuIntroduction The high costs of human spaceflight operations favor large investments to optimize astronauts’ time usage during extravehicular activity. These have included extensive expenditures in training, tool development, and spacecraft design for serviceability. However, astronauts’ space suits themselves still encumber more than aid, a focus of several current research programs. Potential improvements face tight integration between suits and astronaut activities, resulting in many mechanical and computational challenges. One major area of work aims to alleviate the difficulties of conducting precise or prolonged movements within a pressurized garment. Powered prosthetic assistance may provide a solution to this problem, but creates key operational challenges. Standard digital or verbal user command interfaces may prove incompatible with such devices, limited by low bandwidth and nonintuitive control structures. Tactile control using, for example, hand or finger gestures seems far more suitable for controlling mechanical effectors, providing high speed and intuitive spatial relationships between command signals and desired actions. Flexibility and robustness in controllers like these will likely require personalized command recognition tailored to individual astronauts. The need for speed and natural facility will make this capability even more indispensable than in, say, speech recognition. Command recognition systems should dynamically adjust their interpretation rules as training data is accumulated, improving their precision and following long-term trends as astronauts develop their working behaviors throughout a career’s worth of extravehicular activity. In this project, we propose a relatively simple gesture-based spatial command recognition system as an analog to more advanced systems suitable for augmenting extravehicular activities with robotic assistance. We aim initially to achieve discrete pattern recognition, with a possible extension to continuous parameter spaces, which may ultimately find favor in many spatial applications. Problem Statement We propose a software agent capable of identifying spatially motivated commands among a finite set indicated by short two-dimensional gestures within the continuous movement stream of a pointing device such as a computer mouse. The agent will construct optimized interpretation rules based on training data sets corresponding to single human users over a period of time, with identifying rules adjusted dynamically during further use. The system may be extended to allow command spaces parameterized by continuous variables. It may also allow users to refine agent interpretations post facto by providing optional explicit clarification after initial training is completed. Background Material Here we provide some background from the most relevant recent literature on statistical learning. This description is adapted from earlier review by the first author. We aim to go beyond elementary classification methods such as binary decision trees to achieve morepowerful yet still feasible pattern recognition (discrete case) and functional estimation (continuous case) capabilities. Weiss et al. (1995) show how to generalize traditional discrete decision trees used for classification to regression trees used for functional estimation. Like decision trees, regression trees perform partitioning based on a disjunctive normal form (DNF) strategy, which has advantages for clarity of knowledge organization and traceability to features. However, for pattern recognition applications, they propose a more general rule-based approach to regression, which eliminates the DNF constraint and can potentially find much more compact representations. This can be important for large spaces, and can potentially find rules with substantially clearer interpretations. Through a number of real-world examples, they show that rule-based regression algorithms using numerical optimization techniques can significantly outperform tree-based methods both in performance and speed. They also show how this approach can be effectively combined with partitioning and nearest-neighbor methods (e.g., bounding pseudo-classes on the basis of a fixed neighborhood population) in order to improve performance still further. They also pursue sample storage compression enhancements with some success. The methods presented here move in the desired direction in terms of generalizing the approach to classification and estimation, and provide concrete algorithms and examples demonstrating their effectiveness in certain situations. However, the fundamental methodology still relies upon complete storage of training samples and a somewhat discretized pseudo-classification approach to pattern decomposition. That is, rule-based regression may not be general enough to provide the kind of dynamic adaptability and scaling to training data desirable in our agent. By contrast, Boser et al. (1992) take a step beyond both tree-based and rule-based parametric decision methods by devising a method to effectively reparameterize the space of observations according to the most useful global indicators. That is, they construct a dynamically generated basis for the input space using "support vectors" optimally chosen to maximize the resolution of the boundary between decision classes. This provides significant improvement on traditional regression-based approaches, which tend to smooth over any atypical patterns as represented in the original input basis. Their training algorithm also grows dynamically with new input data, while incorporating many other linear and nonlinear methods as special cases. Moreover, the authors show how to construct a dual space representation (of reduced dimension) for the actual decision kernel, which allows the underlying quadratic optimization problem to be solved efficiently using standard numerical techniques. The authors demonstrate empirical performance on many classical pattern recognition problems (such as handwritten digit recognition) significantly exceeding other leading algorithms, in some cases even those with pre-defined task-specific models for those problems. This approach demonstrates both generality for uninformed pattern recognition and dynamic adaptation and scaling to enlarging training data sets that we desire for our agent. By comparison to Weiss et

View Full Document