Purdue CS 490D - Introduction to Data Mining

Unformatted text preview:

1CS490D:Introduction to Data MiningProf. Chris CliftonApril 19, 2004More on Associations/RulesProject: Association• Several people have suggested association rule mining– Good idea• Issues– Most data is not “itemsets”, but scaled values– Is apriori the right algorithm?• Ideas– Try Tertius– Decision rules (need to define class)2Tertius• First-order logic learning– Similar to ILP we have been discussing• Provide structural schema definition– Individual: What is an individual (for counting purposes)– Structural: Define relationships between individuals– Properties: Things that describe an individual•http://www.cs.bris.ac.uk/Research/MachineLearning/Tertius/index.htmlTertius: Use• Call: Specify number of literals, number of variables in rules– Finds strongest k rules• Predicate File– Parent 2 person person cwa– Daughter 2 person person cwa– Male 1 person cwa• Fact File– Parent(Chris, Denise)– Daughter(Denise, Chris)– Female(Denise)3JRIP/Ripper• Decision Rules– Like association rules– But need target class (right hand side)• Idea:– Grow rule– Prune rule– If good then keep– Repeat• Growth: Based on information gain• Prune: p+N-n /


View Full Document

Purdue CS 490D - Introduction to Data Mining

Download Introduction to Data Mining
Our administrator received your request to download this document. We will send you the file to your email shortly.
Loading Unlocking...
Login

Join to view Introduction to Data Mining and access 3M+ class-specific study document.

or
We will never post anything without your permission.
Don't have an account?
Sign Up

Join to view Introduction to Data Mining 2 2 and access 3M+ class-specific study document.

or

By creating an account you agree to our Privacy Policy and Terms Of Use

Already a member?