UCSB ECE 181B - L18(Cortina) - D2830239

Home> Schools> University of California, Santa Barbara> East Asian Cultural Studies (ECE) > ECE 181B> L18(Cortina)

DOC PREVIEW

UCSB ECE 181B - L18(Cortina)

School name University of California, Santa Barbara

Course Ece 181b- Introduction to Computer Vision

Pages 46

This preview shows page 1-2-3-22-23-24-44-45-46 out of 46 pages.

Save

View full document

Premium Document

Do you want full access? Go Premium and unlock all 46 pages.

Access to all documents

Download any document

Ad free experience

Subscribe for instant access Get instant access

View full document

Premium Document

Do you want full access? Go Premium and unlock all 46 pages.

Access to all documents

Download any document

Ad free experience

Subscribe for instant access Get instant access

View full document

Premium Document

Do you want full access? Go Premium and unlock all 46 pages.

Access to all documents

Download any document

Ad free experience

Subscribe for instant access Get instant access

View full document

Premium Document

Do you want full access? Go Premium and unlock all 46 pages.

Access to all documents

Download any document

Ad free experience

Subscribe for instant access Get instant access

View full document

Premium Document

Do you want full access? Go Premium and unlock all 46 pages.

Access to all documents

Download any document

Ad free experience

Subscribe for instant access Get instant access

View full document

Premium Document

Do you want full access? Go Premium and unlock all 46 pages.

Access to all documents

Download any document

Ad free experience

Subscribe for instant access Get instant access

View full document

Premium Document

Do you want full access? Go Premium and unlock all 46 pages.

Access to all documents

Download any document

Ad free experience

Subscribe for instant access Get instant access

View full document

Premium Document

Do you want full access? Go Premium and unlock all 46 pages.

Access to all documents

Download any document

Ad free experience

Subscribe for instant access Get instant access

View full document

Premium Document

Do you want full access? Go Premium and unlock all 46 pages.

Access to all documents

Download any document

Ad free experience

Subscribe for instant access Get instant access

Premium Document

Do you want full access? Go Premium and unlock all 46 pages.

Access to all documents

Download any document

Ad free experience

Subscribe for instant access Get instant access

Unformatted text preview:

Slide 1TaskOutlineContent-based Image Retrieval on the WWWPART I: A System for Large-scale, Content-based Image Retrieval on the WWWSystem OverviewVisual Features describe the ImagesCollateral Text as an additional FeatureRetrieval in 2 StepsRetrieval: TextRetrieval – Visual Features (MPEG-7)Retrieval – Challenges for Visual FeaturesA Combined Distance for the MPEG-7 FeaturesClustering speeds up the searchRelevance Feedback Improves the ResultsRelevance Feedback: Query Vector MovementRelevance Feedback: Weight AdapationImplementation – Software and HardwarePart II: The Semantics WithinSemantics: Combining Text and Visual FeaturesIdentifying a Method to find the SemanticsMethod: Data Mining for Semantic CluesFrequent Itemsets and Association RulesExample & AdvantagesUsing FIMI to find the itemsetsDiapers and Beer !!?Characteristics of the Itemsets and RulesExploiting the Itemsets and RulesSelecting Interesting Low-Level Clusters based on RulesThe Visual LinkThe Visual Link: A Graph-Based ApproachThe Visual Link: An ExampleThe Visual Link: An ApproximationDiscussion & DemoDiscussion: PrecisionBefore we continue … some numbersAnd now … the moment you’ve all been waiting for …ConclusionsQuestionsOutlookThanksWhich Rules are of Interest?Characteristics and ChallengesCharacteristics of the Itemsets and Rules - OverallWhy keyword filtering of the results does not workProposal: Semantic ClustersThursday, May 27, 2004A System for Large-scale, Content-based Web Image Retrieval- and the Semantics withinTill QuackThursday, May 27, 2004Create a content-based image retrieval system for the WWWLarge-scale, one order of magnitude larger than existing systems. Means O(106) itemsRelevance FeedbackExplore and exploit the semantics withinTake large-scale, content-based image retrieval one step closer to commercial applicationsTaskThursday, May 27, 2004OutlineContent-based Image Retrieval on the WWWPART I: A System for Image Retrieval on the WWWFeaturesRetrievalRelevance FeedbackSoftware DesignPART II: The Semantics withinIdentifying a Method to find SemanticsData Mining for Semantic CluesFrequent Itemset Mining and Association RulesThe Visual LinkDiscussion & DemonstrationConclusions & OutlookThursday, May 27, 2004Content-based Image Retrieval on the WWWCharacteristics of the data repositorySize: 4.2 billion documents in Google’s indexDiversity: Documents in any context, languageControl: Anybody can publish anythingDynamics: Ever changingSystem RequirementsFASTSCALABLEMake use of all the information availableMotivation for a new systemExisting systems •Either pure text (Google)•Or pure content-basedLarge-ScaleThursday, May 27, 2004PART I: A System for Large-scale, Content-based Image Retrieval on the WWWUllrich MoenichTill QuackLars ThieleThursday, May 27, 2004System OverviewKeywordsVisualFeaturesImage SpiderWorld Wide WebDMOZDataKeywordExtractionFeatureExtractionImage DescriptionImages(Binaries)KeywordIndexingClusteringKeyword RequestNearest Neighbor SearchMatching ImagesUser picksrelevantimagesMatching ImagesInverted Indexkeyid | imageidRetrievalOfflineCluster nCluster 2Cluster 1Cluster nCluster 2Cluster 1Cluster nCluster 2Cluster 1Cluster nCluster 2Cluster 1mySQLThursday, May 27, 2004Visual Features describe the ImagesGlobal Features from MPEG-7 StandardCurrently no Segmentation •Reasons: Scalability and the diversity of the dataTexture FeaturesEdge Histogram Descriptor (EHD)•Histogram of quantified edge directions. 80 dimensionsHomogeneous Texture Descriptor (HTD)•Output of Gabor filter-bank. 62 dimensions.Color FeaturesScalable Color Descriptor (SCD)•Color Histogram. 256, 128, 64 or 32 dimensionsDominant Color Descriptor (DCD)•Up to 8 dominant colors (3d color-space) and their percentages–32 “dimensions”•“Bins” defined for each imageThursday, May 27, 2004Collateral Text as an additional FeatureALT Tag and Collateral Text around imagesVERY uncontrolled annotationStemming: Porter StemmerExample: training -> trainMore matching terms for boolean queriesBut also some new ambiguities •train: to train [verb] / the train [noun]Thursday, May 27, 2004Retrieval in 2 StepsKeywordsVisualFeaturesImage SpiderWorld Wide WebDMOZDataKeywordExtractionFeatureExtractionImage DescriptionImages(Binaries)KeywordIndexingClusteringKeyword RequestNearest Neighbor SearchMatching ImagesUser picksrelevantimagesMatching ImagesInverted Indexkeyid | imageidRetrievalOfflineCluster nCluster 2Cluster 1Cluster nCluster 2Cluster 1Cluster nCluster 2Cluster 1Cluster nCluster 2Cluster 1mySQL1. Text Retrieval2. Visual Nearest Neighbor SearchThursday, May 27, 2004Retrieval: TextOptionsBoolean query on inverted indexVector Space ModelLSI etc.ChoiceRanked boolean queries on inverted indexRanking: tf*idfReasonsSpeedSparsity of data:•600 000 Keywords in total•1 document: 10-50 wordsKeyword ImageId tfshoe 1233 1sport 1233 1red 1233 1banana 1234 1fruit 1234 2Order 1234 1Keyid ImageId tf124 1233 1341 1233 1345 1233 1445 1234 175 1234 2875 1234 1Thursday, May 27, 2004Retrieval – Visual Features (MPEG-7)K-Nearest Neighbor search (K-NN)Find K closest candidates ci to query image q in a vector spaceDistance: Minkowsky Metrics for distance d(ci,q) namely L1 and L2 normsMost MPEG-7 descriptors are high-dimensional vectorsThe “dimensionality curse” appliesHigh dimensional spaces behave “weirdly”In particular the distances are not too meaningfulThursday, May 27, 2004Retrieval – Challenges for Visual FeaturesWe have several (visual) feature types How can we combine them?Our database is very large.How can we search it fast enough?i.e. how can we avoid comparing the query vector with each database entry?Thursday, May 27, 2004A Combined Distance for the MPEG-7 FeaturesWe use a combined distance of all the visual feature types The individual distances occupy different ranges in different distributionsThe distributions were transformed to a normal distribution in the range [0,1]The distances are then combined linearlyThursday, May 27, 2004Clustering speeds up the searchProblemMillions of items in DBLinear search over the whole dataset too slowLooking only for the K nearest neighbors anyway(One) SolutionPartition the data into Clusters, identified by representative, the

View Full Document