Johns Hopkins EN 600 461 - Computer Vision Grouping and Segmentation - D2030037

Home> Schools> Johns Hopkins University> EN Computer Science (EN 600) > EN 600 461> Computer Vision Grouping and Segmentation

DOC PREVIEW

Johns Hopkins EN 600 461 - Computer Vision Grouping and Segmentation

School name Johns Hopkins University

Course En 600 461- Computer Vision

Pages 17

This preview shows page 1-2-3-4-5-6 out of 17 pages.

Save

View full document

Premium Document

Do you want full access? Go Premium and unlock all 17 pages.

Access to all documents

Download any document

Ad free experience

Subscribe for instant access Get instant access

View full document

Premium Document

Do you want full access? Go Premium and unlock all 17 pages.

Access to all documents

Download any document

Ad free experience

Subscribe for instant access Get instant access

View full document

Premium Document

Do you want full access? Go Premium and unlock all 17 pages.

Access to all documents

Download any document

Ad free experience

Subscribe for instant access Get instant access

View full document

Premium Document

Do you want full access? Go Premium and unlock all 17 pages.

Access to all documents

Download any document

Ad free experience

Subscribe for instant access Get instant access

View full document

Premium Document

Do you want full access? Go Premium and unlock all 17 pages.

Access to all documents

Download any document

Ad free experience

Subscribe for instant access Get instant access

View full document

Premium Document

Do you want full access? Go Premium and unlock all 17 pages.

Access to all documents

Download any document

Ad free experience

Subscribe for instant access Get instant access

Premium Document

Do you want full access? Go Premium and unlock all 17 pages.

Access to all documents

Download any document

Ad free experience

Subscribe for instant access Get instant access

Unformatted text preview:

110/2/2003CS 461, Copyright G.D. Hagerwith slides shamelessly stolen from D. ForsythComputer VisionGrouping and SegmentationProfessor Hagerhttp://www.cs.jhu.edu/~hager10/2/2003CS 461, Copyright G.D. Hagerwith slides shamelessly stolen from D. ForsythGrouping and Segmentation• G&S appear to be one of the early processes in human vision• They are a way of *organizing* image content into “semantically related” groups• In some applications, segmentation is the crucial step (e.g. some types of aerial image interpretation).• We’ll just touch on a few simple approaches of image grouping and segmentation• We’ll also present a couple of useful geometric grouping methods.210/2/2003CS 461, Copyright G.D. Hagerwith slides shamelessly stolen from D. ForsythGrouping• Grouping is the process of associating similar image features together10/2/2003CS 461, Copyright G.D. Hagerwith slides shamelessly stolen from D. ForsythGrouping• Grouping is the process of associating similar image features together• The Gestalt School:– Proximity:tokens that are nearby tend to be grouped.– Similarity:similar tokens tend to be grouped together.– Common fate:tokens that have coherent motion tend to be grouped together.– Common region:tokens that lie inside the same closed region tend to be grouped together.– Parallelism:parallel curves or tokens tend to be grouped together.– Closure:tokens or curves that tend to lead to closed curves tend to be grouped together.– Symmetry:curves that lead to symmetric groups are grouped together.– Continuity:tokens that lead to “continuous ” (as in “joining up nicely ”, rather than in the formal sense): curves tend to be grouped.– Familiar Conguration:tokens that, when grouped, lead to a familiar object,tend to be grouped together310/2/2003CS 461, Copyright G.D. Hagerwith slides shamelessly stolen from D. Forsyth10/2/2003CS 461, Copyright G.D. Hagerwith slides shamelessly stolen from D. Forsyth410/2/2003CS 461, Copyright G.D. Hagerwith slides shamelessly stolen from D. Forsyth10/2/2003CS 461, Copyright G.D. Hagerwith slides shamelessly stolen from D. Forsyth510/2/2003CS 461, Copyright G.D. Hagerwith slides shamelessly stolen from D. Forsyth10/2/2003CS 461, Copyright G.D. Hagerwith slides shamelessly stolen from D. ForsythThe Hough Transform• How can we detect (group) extended line structures in images?y = m x + bxybm610/2/2003CS 461, Copyright G.D. Hagerwith slides shamelessly stolen from D. ForsythThe Hough Transform• How can we detect extended line structures in images?b = - x m + ymbxy10/2/2003CS 461, Copyright G.D. Hagerwith slides shamelessly stolen from D. ForsythHough Transform Idea• Each edge point in an image is a constraint on line parameters:– constraint is a line– each unique point adds another constraint• Algorithm:– Initialize a 2-D array of counters to zero.– For each edge point (x,y), increment any counter which contains a parameter point (b,m) satisfying b = -x m + y– Threshold counters– Group edgels that belong to above threshold counters into contours– Optional: Refit lines to get higher precision.710/2/2003CS 461, Copyright G.D. Hagerwith slides shamelessly stolen from D. ForsythHough Transform Variation• Problem:– m and b are, in principle, unbounded• rewrite as sin(t) x + cos(t) y = d• For each discrete value of t from 0 to pi, increment counter with nominal d minimal distance from exact value– coarse grid leads to grouping of distinct lines• in post-fitting stage, re-hough at higher resolution• Generalizations– Any linear in parameters model: e.g a x2+ b y2= 1 can use the same algorithm– For f(x,a) = 0, choose any cell ac s.t. f(x,ac) < t for some threshold t.• Limitations:– the curse of dimensionality10/2/2003CS 461, Copyright G.D. Hagerwith slides shamelessly stolen from D. ForsythRANSAC: A More General Method• The Hough transform has the nice feature that it provides a method for detecting geometric structure in clutter.• The HT suffers from the curse of dimensionality.• RANSAC (Random Sample Consensus) is a robust estimation technique that avoids (to some degree) the curse of dimensionality810/2/2003CS 461, Copyright G.D. Hagerwith slides shamelessly stolen from D. ForsythRANSAC: General IdeaHow to find the line in this?10/2/2003CS 461, Copyright G.D. Hagerwith slides shamelessly stolen from D. ForsythRANSAC: General IdeaBasic observation: if we wereto choose the “right” pair of points910/2/2003CS 461, Copyright G.D. Hagerwith slides shamelessly stolen from D. ForsythRANSAC: General IdeaBasic observation: if we wereto choose the “right” pair of points then it would be fairlyobvious that many other pointswould agreeAlgorithm:For some number of tries Npick a pair of pointsfit a line to these pointscount the number of other pts that agreeReturn: line parameters with bestagreement10/2/2003CS 461, Copyright G.D. Hagerwith slides shamelessly stolen from D. ForsythRANSAC: General IdeaAlgorithm:For some number of tries Npick a pair of pointsfit a line to these pointscount the number of other pts that agreeReturn: line parameters with bestagreementQuestions: 1) How many points2) What do we mean “agree”1010/2/2003CS 461, Copyright G.D. Hagerwith slides shamelessly stolen from D. ForsythRANSAC• # of points is N = log(1-p)/log(1-(1-ε)s)– assumes sampling is independent– p is the probability one of the samples is all inliers– s is sample size (2 for us)–ε is percentage of outliers• Consensus:– Assume the points (inliners) are contaminated with Gaussian noise– Distance to the line is a χ2mmodel where (in our case) m =1• Choose a probability threshold from χ21giving a distance t= 3.82 σ2 for α = .95• inliner if distance2< t2• outlier if distance2>= t210/2/2003CS 461, Copyright G.D. Hagerwith slides shamelessly stolen from D. ForsythImage Segmentation: A Brief Overview• Segmentation– criteria– region group and counting1110/2/2003CS 461, Copyright G.D. Hagerwith slides shamelessly stolen from D. ForsythSegmentation: Definitions• An affinity measure d(R1,R2) Æ or a homogeneity measure m(R) – note possibly d(R1,R2) = |m(R1) – M(R2)|• An threshold τ (could operate on distance or homogeneity)• A region definition (e.g. square tiles)• A neighborhood definition– 4 neighbors– 8 neighborsx x 0 xxx x xx 0 xx x x10/2/2003CS 461, Copyright G.D. Hagerwith slides shamelessly stolen from D. ForsythSimple

View Full Document