Data Mining: ApplicationsApplications and Trends in Data MiningData Mining ApplicationsBiomedical Data Mining and DNA AnalysisDNA Analysis: ExamplesData Mining for Financial Data AnalysisFinancial Data MiningData Mining for Retail IndustryData Mining in Retail Industry: ExamplesData Mining for Telecomm. Industry (1)Data Mining for Telecomm. Industry (2)Slide 12How to choose a data mining system?How to Choose a Data Mining System? (2)How to Choose a Data Mining System? (3)Examples of Data Mining Systems (1)Examples of Data Mining Systems (2)Slide 18Visual Data MiningVisual Data Mining & Data VisualizationData Mining Result VisualizationSAS Enterprise Miner: scatter plotsAssociation rules in MineSet 3.0Visualization of a decision tree in MineSet 3.0Cluster groupings in IBM Intelligent MinerData Mining Process VisualizationData Mining Processes by ClementineInteractive Visual Data MiningAudio Data MiningScientific and Statistical Data Mining (1)Scientific and Statistical Data Mining (2)Scientific and Statistical Data Mining (3)Theoretical Foundations of Data Mining (1)Theoretical Foundations of Data Mining (2)Data Mining and Intelligent Query AnsweringData Mining and Intelligent Query Answering (2)Slide 37Is Data Mining a Hype or Will It Be Persistent?Life Cycle of Technology AdoptionSocial Impacts: Threat to PrivacyProtect Privacy and Data SecuritySlide 42Trends in Data Mining (1)Trends in Data Mining (2)Slide 45SummaryReferences (1)References (2)References (3)Data Mining: ApplicationsDr. Hany SaleebApplications and Trends in Data MiningData mining applicationsData mining system products and research prototypesAdditional themes on data miningSocial impact of data miningTrends in data miningSummaryData Mining ApplicationsData mining is a young discipline with wide and diverse applicationsThere is still a nontrivial gap between general principles of data mining and domain-specific, effective data mining tools for particular applicationsSome application domains (covered in this chapter)Biomedical and DNA data analysisFinancial data analysisRetail industryTelecommunication industryBiomedical Data Mining and DNA AnalysisDNA sequences: 4 basic building blocks (nucleotides): adenine (A), cytosine (C), guanine (G), and thymine (T). Gene: a sequence of hundreds of individual nucleotides arranged in a particular orderHumans have around 100,000 genesTremendous number of ways that the nucleotides can be ordered and sequenced to form distinct genesSemantic integration of heterogeneous, distributed genome databasesCurrent: highly distributed, uncontrolled generation and use of a wide variety of DNA dataData cleaning and data integration methods developed in data mining will helpDNA Analysis: ExamplesSimilarity search and comparison among DNA sequencesCompare the frequently occurring patterns of each class (e.g., diseased and healthy)Identify gene sequence patterns that play roles in various diseases Association analysis: identification of co-occurring gene sequencesMost diseases are not triggered by a single gene but by a combination of genes acting togetherAssociation analysis may help determine the kinds of genes that are likely to co-occur together in target samplesPath analysis: linking genes to different disease development stagesDifferent genes may become active at different stages of the diseaseDevelop pharmaceutical interventions that target the different stages separatelyVisualization tools and genetic data analysisData Mining for Financial Data AnalysisFinancial data collected in banks and financial institutions are often relatively complete, reliable, and of high qualityDesign and construction of data warehouses for multidimensional data analysis and data miningView the debt and revenue changes by month, by region, by sector, and by other factorsAccess statistical information such as max, min, total, average, trend, etc.Loan payment prediction/consumer credit policy analysisfeature selection and attribute relevance rankingLoan payment performanceConsumer credit ratingFinancial Data MiningClassification and clustering of customers for targeted marketingmultidimensional segmentation by nearest-neighbor, classification, decision trees, etc. to identify customer groups or associate a new customer to an appropriate customer groupDetection of money laundering and other financial crimesintegration of from multiple DBs (e.g., bank transactions, federal/state crime history DBs)Tools: data visualization, linkage analysis, classification, clustering tools, outlier analysis, and sequential pattern analysis tools (find unusual access sequences)Data Mining for Retail IndustryRetail industry: huge amounts of data on sales, customer shopping history, etc.Applications of retail data mining Identify customer buying behaviorsDiscover customer shopping patterns and trendsImprove the quality of customer serviceAchieve better customer retention and satisfactionEnhance goods consumption ratiosDesign more effective goods transportation and distribution policiesData Mining in Retail Industry: ExamplesDesign and construction of data warehouses based on the benefits of data miningMultidimensional analysis of sales, customers, products, time, and regionAnalysis of the effectiveness of sales campaignsCustomer retention: Analysis of customer loyaltyUse customer loyalty card information to register sequences of purchases of particular customersUse sequential pattern mining to investigate changes in customer consumption or loyaltySuggest adjustments on the pricing and variety of goodsPurchase recommendation and cross-reference of itemsData Mining for Telecomm. Industry (1)A rapidly expanding and highly competitive industry and a great demand for data miningUnderstand the business involvedIdentify telecommunication patternsCatch fraudulent activitiesMake better use of resourcesImprove the quality of serviceMultidimensional analysis of telecommunication dataIntrinsically multidimensional: calling-time, duration, location of caller, location of callee, type of call, etc.Data Mining for Telecomm. Industry (2)Fraudulent pattern analysis and the identification of unusual patternsIdentify potentially fraudulent users and their atypical usage patternsDetect attempts to gain fraudulent entry to customer
View Full Document