DOC PREVIEW
SJSU CS 157A - Association Rule

This preview shows page 1-2-3-4 out of 11 pages.

Save
View full document
View full document
Premium Document
Do you want full access? Go Premium and unlock all 11 pages.
Access to all documents
Download any document
Ad free experience
View full document
Premium Document
Do you want full access? Go Premium and unlock all 11 pages.
Access to all documents
Download any document
Ad free experience
View full document
Premium Document
Do you want full access? Go Premium and unlock all 11 pages.
Access to all documents
Download any document
Ad free experience
View full document
Premium Document
Do you want full access? Go Premium and unlock all 11 pages.
Access to all documents
Download any document
Ad free experience
Premium Document
Do you want full access? Go Premium and unlock all 11 pages.
Access to all documents
Download any document
Ad free experience

Unformatted text preview:

Association RuleData MiningAssociation Rule MiningMarket Basket AnalysisFamous & Interesting FindingWhy beer and Diapers??Two Certainty IndicesExample: SupportExample: ConfidenceFull example from WikiReferencesAssociation RuleBy Kenneth LeungData MiningThe process of extracting valid, previously unknown, comprehensible, and actionable information from large databases, and using it to make crucial business decisions.Make decision based on previous experience or observationAssociation Rule MiningFormal: To find interesting associations and/or correlation relationships among large set of data items. Association rules show attribute value conditions that occur frequently together in a given dataset.Informal: “If – Then” relationship. If this happen, what is most likely to happen next.Obesity => DiabetesMarket Basket AnalysisA typical and widely-used example of association rule mining.Example:Data are collected using bar-code scanners in supermarkets. Each record will consist of all items in a single purchase transaction.Managers would be interested to know if certain groups of items are consistently purchased together.They could use this data for adjusting store layouts (placing items optimally with respect to each other), for cross-selling, for promotions, for catalog design and to identify customer segments based on buying patterns.Famous & Interesting FindingBeer & Diaper“A number of convenience store clerks noticed that men often bought beer at the same time they bought diapers. The store mined its receipts and proved the clerks' observations correct. So, the store began stocking diapers next to the beer coolers, and salesskyrocketed”Why beer and Diapers??Moms are stressed out by their naughty babies, and they need some beers for relief?Diapers boxes for putting oldbeer bottles. Very environmentalFriendly, and easy handling.Two Certainty IndicesDetermine whether a rule is goodSupport of AR: percentage of transactions that contain X and Y (X and Y are two items)Confidence of AR: Ratio of number of transactions that contain X and Y to the number that contain XThe higher, the more reliable.Example: SupportSupermarket has 100,000 transactions.2000/100,000 transactions include beer800/2000 transactions contain diapersSupport for the rule “beer->diapers” is 800 or 800/100,000 = 0.0008, or 0.8%Example: ConfidenceSupermarket has 100,000 transactions.2000/100,000 transactions include beer800/2000 transactions contain item diapersConfidence for the rule “beer->diapers” is 800/2000 = 0.4, or 40%Full example from Wiki1. {Cold, Raining} => No2. {Calm, Dry} => Yes3. {Dry} => No4. {Windy} => No1. {Cold, Raining} => NoSupport: 2/5 = 40%Confidence: 2/2 = 100%=> Good2. {Calm, Dry} => YesSupport: 2/5 = 40%Confidence: 2/2 = 100%=> Good3. {Dry} => NoSupport: 1/5 = 20%Confidence: 1/3 = 33.3%=> Bad4. {Windy} => NoSupport: 0/5 = 0%Confidence: 1/1 = 100%=>BadReferenceshttp://www.resample.com/xlminer/help/Assocrules/associationrules_intro.htmhttp://en.wikipedia.org/wiki/Association_rule_learningDr Sin-Min Lee’s lecture


View Full Document

SJSU CS 157A - Association Rule

Documents in this Course
SQL

SQL

18 pages

Lecture

Lecture

44 pages

Chapter 1

Chapter 1

56 pages

E-R Model

E-R Model

16 pages

Lecture

Lecture

48 pages

SQL

SQL

15 pages

SQL

SQL

26 pages

Lossless

Lossless

26 pages

SQL

SQL

16 pages

Final 3

Final 3

90 pages

Lecture 3

Lecture 3

22 pages

SQL

SQL

25 pages

Load more
Download Association Rule
Our administrator received your request to download this document. We will send you the file to your email shortly.
Loading Unlocking...
Login

Join to view Association Rule and access 3M+ class-specific study document.

or
We will never post anything without your permission.
Don't have an account?
Sign Up

Join to view Association Rule 2 2 and access 3M+ class-specific study document.

or

By creating an account you agree to our Privacy Policy and Terms Of Use

Already a member?