This preview shows page 1 out of 2 pages.

Save
View full document
View full document
Premium Document
Do you want full access? Go Premium and unlock all 2 pages.
Access to all documents
Download any document
Ad free experience
Premium Document
Do you want full access? Go Premium and unlock all 2 pages.
Access to all documents
Download any document
Ad free experience

Unformatted text preview:

Optical mapsIntroductionOptical mapping is a technology that allows us to experimentally determine the relative position of certain landmarks (usually restriction sites - places where a specific restriction enzyme can cut the DNA - usually recognized by a specific 6-8 bp sequence) along a stretch of DNA. Unlike sequencing, optical maps provide long range information - the fragments being mapped are usually on the order of several 100s of kbp - however the information provided is sparse. Thus optical mapping is useful as a complement to sequencing. Also, it can be used as a cheaper way (than sequencing) of getting information about the global structure of a genome.Computationally an optical map is just an ordered list of sizes, together with estimates of the error in the size estimates, representing the list of gaps between adjacent restriction sites.Mapping algorithmI'll focus here on just the problem of aligning an experimentally determined optical map to an in silico optical map constructed, e.g. from the output of an assembler.Formally, the experimental map is represented as the array:emap={ok, sk, k =1, n }where ok and sk are the size and standard deviation for fragment k.The in silico map is:ismap={ek, k=1, m }where ek is the size of the corresponding fragmentAligning the two maps can be performed pretty easily using a dynamic programming algorithm similar to the sequence alignment algorithm.Specifically, V[i,j] is the score of aligning the first i fragments from the experimental map to the first j fragments from the in silico map.The recurrence equation is:V[i,j] = mink<i,l<j { V[k,l] + score(k..i, l..j)}where score(k..i, l..j) is a score of how well the set of fragments between k..i, and l..j, match each other. This score can be defined as a combination of a Χ2 score and a penalty for missed sites:scorek..i , l..j =∑s= kios−∑t = ljet2∑s= kiss2C i−kl− jwhere C is a constant that can be used to tune the contribution of the two components.Interesting research directions• Can optical maps be used to guide genome assembly?• How do you efficiently align maps to an already sequenced genome to identify structural variants?• How do you efficiently align two maps to each other to identify structural


View Full Document

UMD CMSC 858W - Optical maps

Download Optical maps
Our administrator received your request to download this document. We will send you the file to your email shortly.
Loading Unlocking...
Login

Join to view Optical maps and access 3M+ class-specific study document.

or
We will never post anything without your permission.
Don't have an account?
Sign Up

Join to view Optical maps 2 2 and access 3M+ class-specific study document.

or

By creating an account you agree to our Privacy Policy and Terms Of Use

Already a member?