Unformatted text preview:

Part 23 Data MiningCopyright  1971-2002 Thomas P. Sturm Data Mining Part 23, Page 2 OLAP (On-Line Analytical Processing) Oracle OLAP Products Oracle Express Oracle Express Web Agent Oracle Express Objects Oracle Express Analyzer Oracle Financial Analyzer Oracle Financial Controller Oracle Sales Analyzer OLAP Uses Lets users play with information stored in the data warehouse Can view the data from a variety of angles Can analyze sales over time, by region, by product typeCopyright  1971-2002 Thomas P. Sturm Data Mining Part 23, Page 3 Categories of OLAP Tools DOLAP - Desktop OLAP PC tools with limited analysis of small warehouses ROLAP - Relational OLAP Server-based tools for savvy relational database users MOLAP - Multidimensional OLAP Server-based tools for read-only pre-computed cubes of dataCopyright  1971-2002 Thomas P. Sturm Data Mining Part 23, Page 4 Star SchemasCopyright  1971-2002 Thomas P. Sturm Data Mining Part 23, Page 5 Data Mining Discover hidden patterns in data Also known as Knowledge Discovery in Databases (KDD) Pattern has not yet been identified by the end user Discover anomalies in the data One pattern is inconsistent with other patterns Identify bad data, bad reporting, bad modeling, vs. real differences that need to be dealt with or exploited Pattern: A pattern is an inherent relationship and/or distributions in the data under consideration Data: Data is the collection stored in the Data Warehouse environmentCopyright  1971-2002 Thomas P. Sturm Data Mining Part 23, Page 6 Data Mining Initial Process Problem identification Determine the scope of the problem Determine if the data is available to address the problem Data preparation Data collection Data cleaning Data reduction, including removal of useless variables and sampling of large datasets Data transformation to a limited set of numerical values Model formulation Select a model and technique for mining the data Data analysis/Pattern exploration/Data classification Exact approach will depend on modelCopyright  1971-2002 Thomas P. Sturm Data Mining Part 23, Page 7 Data Mining Continuing Process Validation and iteration Test model against unknown cases that were not used to generate the model Verify validity of model Modify as necessary Must be able to predict unusual cases Knowledge acquisition Run model against all cases Generate final statistics Interpretation Translate numbers into words Build a theory of behavior based on subject matter knowledge that is consistent with the facts Implementation Develop a plan of action that is theoretically sound under the theory developedCopyright  1971-2002 Thomas P. Sturm Data Mining Part 23, Page 8 Fields used by Data Mining Statistics Means, variances, normal distribution, regression (single and multiple, linear and non-linear), ANOVA Artificial Intelligence Pattern recognition, deductive reasoning, vision algorithms, expert systems, genetic algorithms Databases Clustering, pattern bit maps Operations Research Linear programming, Lagrangian multipliers, branch and bound, integer programmingCopyright  1971-2002 Thomas P. Sturm Data Mining Part 23, Page 9 Business Applications of Data Mining Market segmentation Market basket analysis Fraud detection Customer retention Forecasting Investment management Manufacturing and productionCopyright  1971-2002 Thomas P. Sturm Data Mining Part 23, Page 10 Data Mining Tasks Classification Clustering Association Prediction Summarization VisualizationCopyright  1971-2002 Thomas P. Sturm Data Mining Part 23, Page 11 Data Mining Methods Statistical regression Neural networks Decision trees Multivariate statistical methods Mathematical programming Case-based reasoning Genetic algorithmsCopyright  1971-2002 Thomas P. Sturm Data Mining Part 23, Page


View Full Document

UST QMCS 450 - Part 23 Data Mining

Download Part 23 Data Mining
Our administrator received your request to download this document. We will send you the file to your email shortly.
Loading Unlocking...
Login

Join to view Part 23 Data Mining and access 3M+ class-specific study document.

or
We will never post anything without your permission.
Don't have an account?
Sign Up

Join to view Part 23 Data Mining 2 2 and access 3M+ class-specific study document.

or

By creating an account you agree to our Privacy Policy and Terms Of Use

Already a member?