DOC PREVIEW
CMU BSC 03711 - lecture

This preview shows page 1-2-15-16-31-32 out of 32 pages.

Save
View full document
View full document
Premium Document
Do you want full access? Go Premium and unlock all 32 pages.
Access to all documents
Download any document
Ad free experience
View full document
Premium Document
Do you want full access? Go Premium and unlock all 32 pages.
Access to all documents
Download any document
Ad free experience
View full document
Premium Document
Do you want full access? Go Premium and unlock all 32 pages.
Access to all documents
Download any document
Ad free experience
View full document
Premium Document
Do you want full access? Go Premium and unlock all 32 pages.
Access to all documents
Download any document
Ad free experience
View full document
Premium Document
Do you want full access? Go Premium and unlock all 32 pages.
Access to all documents
Download any document
Ad free experience
View full document
Premium Document
Do you want full access? Go Premium and unlock all 32 pages.
Access to all documents
Download any document
Ad free experience
Premium Document
Do you want full access? Go Premium and unlock all 32 pages.
Access to all documents
Download any document
Ad free experience

Unformatted text preview:

1Pairwise sequence alignment (global and local)Multiple sequence pqalignmentlocalglobalSubstitution matricesDatabase searchingBLASTSequence ttitiEvolutionary tree reconstructionRNA structure predictionGene FindingProtein structure predictionstatisticsComputational genomics…Ernst Haeckel(1834-1919)2PhylogenyPhylogeny: An evolutionary tree.Ahypothesisconcerningthe evolutionaryA hypothesis concerningthe evolutionary history of a group of taxaTaxon: a unit of classification–strain, species, individual, gene,p , ,g– also called OTUs(Operational Taxonomic Units)Properties of Trees• Leaf nodes – contemporary taxa•Internal nodes-ancestral taxa•Internal nodes -ancestral taxa• Topology – relationships between species• Branch lengths – degree of changeAAAACTAT3Properties of Trees• If the mutation rate is constant in all lineages (molecular clock hypothesis),lineages (molecular clock hypothesis), the branch lengths are proportional to time.Trees represent the order of branching onlyABC DE†FGFGCDE†AB4One of these trees is not like the othersABCDEABCDEABCDEABCDEABCDEEDCABABCDEVarious types of trees you will seeRRR5ABCDEFABROne of these trees is not like the othersABCDEFRCDEFRABEDCABRFEDCRFPhylogeny reconstructionGiven data observations of contemporary taxa, reconstruct the evolutionarytaxa, reconstruct the evolutionary history.6Data for phylogeny reconstruction•Morphology•Morphology• Behavior• Biochemistry• Molecular and sequence data• Character data: shared characteristics• Distance data: difference between speciesAmerican Scientist7American ScientistSome other examples:•Niche (e g what finches eat)•Niche (e.g., what finches eat)• Biochemistry: serum:anti-serum reactions.• Behavior: Firefly flashing patterns8Multiple Sequence Alignment Glb2;CASgl; ~~~~ALTEKQEALLKQSWEVLKQNIPAHSLRLFALIIEAAPESKYVFSFLKDSGlb2;CASgl; ~~~MALTEKQEALLKQSWEVLKQNIPAHSLRLFALILEAAPESKYVFSFLKDSGlb2;CASgl; ~~~MALTERQEALLKQSWEVLKQNIPGHSLRLFALIIEAAPESKYVFSFLKDSGlb2;CASgl; ~~~~~~~~~~EALLKQSWEVLKQNIPGHSLCLFALIIEAAPESKYVFSFLKDSHUMANMKWVTFISLL FLFSSAYSRG V FRRDA H KSEVAHRFKD LGEENFKALVFour class 2 globins from Casuarina glaucaHUMANMKWVTFISLL FLFSSAYSRG V..FRRDA.H KSEVAHRFKD LGEENFKALVRABIT MKWVTFISLL FLFSSAYSRG V..FRREA.H KSEIAHRFND VGEEHFIGLVPIG ~~WVTFISLL FLFSSAYSRG V..FRRDT.Y KSEIAHRFKD LGEQYFKGLVCHICK MKWVTLISFI FLFSSATSRN LQRFARDAEH KSEIAHRYND LKEETFKAVAAlbumin in four speciesGene Trees vs Species TreesGene sequences can be used to infer the history of speciation–infer the history of speciation– infer the history of gene families9Genes evolve within species17Mouse Rat HumanA species tree represents the evolution of populationsIndividualPopulation18Mouse Rat Human10Allelic variationAllelic variation20Mouse Rat Human21Mouse Rat Human1122Mouse Rat HumanPhylogenetics (usually) assumes that times between species divergences are long enough that allelic variation can be ignored.23Mouse Rat Human12Phylogenetics (usually) assumes that times between species divergences are long enough that allelic variation can be ignored.24Mouse Rat HumanGene Trees vs Species TreesGene sequences can be used to infer the history of speciation–infer the history of speciation– infer the history of gene familiesCaveat emptor: the history of the gene, may not be the same as the history of the organism– gene duplications– horizontal gene transfer13DuplicationA gene tree that doesn’t agree with the species tree.LOSS26G1_FishG2_ChknG1_MouseG2_HumanHuman Mouse Chicken FishLOSSLOSSWhy phylogeny reconstruction?Some applications14Species relationshipsFly Amphioxus Hagfish Lamprey Bony fish MammalsSharkHemoglobin gene family in humanalphaalphaExample: hemoglobinbetabetaα ζ ε γ β δbetaalphaζε αγ αβ, αδtime0 wks 8 wks birthEmbryo Fetus Adult15Character evolutionAmerican Scientist16Bi hBiogeographyFiForensics17Phylogeny reconstruction…atgcaaggagtcgcagagc……atgcgaggtctcgtagtgt…atgggaggtctcccagtgtMdlGiven …atgggaggtctcccagtgt… …atgcgacgtcacgtattgg……atgtgtggtctggcagtga……atgcgacctctcggagaat…Model– Multiple sequence alignment– Model of sequence evolutionfind the (binary) tree that is the best explains the data with respect to the model.Finding the optimal treeGiven k taxa, –Consider all trees withkleaves–Consider all trees with kleaves– Score each tree with respect to chosen evolutionary model.– Select highest scoring tree(s)Phylogeny reconstruction is NP-complete:Except in special cases when the data obeys specific constraints, the only way to find the best tree is to consider all trees.18Character dataPrimitive character:winglessPrimitive character:winglessWingsBees WingsMoths WingsAnts No wingsCentipedes No wingsBeesAntsCentipedesMothsDistance dataMoths Ants CentipedesBees a b cMoths d eAnts fuvBeesAntsCentipedesMothsxyzw19Finding the optimal treeGiven k taxa, –Consider all trees withkleaves–Consider all trees with kleaves¾Score each tree with respect to chosen evolutionary model.– Select highest scoring tree(s)Criteria for evaluating which tree best fits the data:Criteria for evaluating which tree best fits the data:• Maximum parsimony (character data)• Minimum evolution (distance data)• Maximum Likelihood (character data)Multiple Sequence Alignmentas Character Data Each column (or site) is one character.~~~~ALTEKQEALLKQSWEVLKQNIPAHSRLFALIIEAA…~~~MALTEKQEALLKQSWEVLKQNIPAHSRLFALILEAA…~~~MALTERQEALLKQSWEVLKQNIPGHSRLFALIIEAA…~~~~~~~~~~EALLKQSWEVLKQNIPGHSCLFALIIEAA…C1 C2 C3 C4Bees A H S RMoths A H S RAnts G HSRCentipedes G HSC20Multiple Sequence Alignmentas Distance Data HumanRabbitPigChicken~~~~ALTEKQEALLKQSWEVLKQNIPAHSLRLFALIIEAA…~~~MALTEKQEALLKQSWEVLKQNIPAHSLRLFALILEAA…~~~MALTERQEALLKQSWEVLKQNIPGHSLRLFALIIEAA…~~~~~~~~~~EALLKQSWEVLKQNIPGHSLCLFALIIEAA…ChickenPigRabbitHuman HumanRabbitPigChickenHuman 0 3 7 9 Rabbit 0 6 8 Pig 0 6 Chicken 0 Finding the optimal treeGiven k taxa, –Consider all trees withkleaves–Consider all trees with kleaves¾Score each tree with respect to chosen evolutionary model.– Select highest scoring tree(s)Criteria for evaluating which tree best fits the data:Criteria for evaluating which tree best fits the data:¾Maximum parsimony (character data)• Minimum evolution (distance data)• Maximum Likelihood (character data)21Maximum Parsimony: Nature is thriftyThe


View Full Document

CMU BSC 03711 - lecture

Documents in this Course
lecture

lecture

8 pages

Lecture

Lecture

3 pages

Homework

Homework

10 pages

Lecture

Lecture

17 pages

Delsuc05

Delsuc05

15 pages

hmwk1

hmwk1

2 pages

lecture

lecture

6 pages

Lecture

Lecture

10 pages

barnacle4

barnacle4

15 pages

review

review

10 pages

Homework

Homework

10 pages

Midterm

Midterm

12 pages

lecture

lecture

11 pages

Lecture

Lecture

7 pages

Lecture

Lecture

17 pages

Lecture

Lecture

12 pages

Lecture

Lecture

21 pages

Lecture

Lecture

11 pages

Lecture

Lecture

28 pages

Homework

Homework

13 pages

Logistics

Logistics

11 pages

lecture

lecture

11 pages

Lecture

Lecture

8 pages

Lecture

Lecture

9 pages

lecture

lecture

8 pages

Problem

Problem

6 pages

Homework

Homework

10 pages

Lecture

Lecture

9 pages

Problem

Problem

7 pages

hmwk4

hmwk4

7 pages

Problem

Problem

6 pages

lecture

lecture

16 pages

Problem

Problem

8 pages

Problem

Problem

6 pages

Problem

Problem

13 pages

lecture

lecture

9 pages

Problem

Problem

11 pages

Notes

Notes

7 pages

Lecture

Lecture

7 pages

Lecture

Lecture

10 pages

Lecture

Lecture

9 pages

Homework

Homework

15 pages

Lecture

Lecture

16 pages

Problem

Problem

15 pages

Load more
Download lecture
Our administrator received your request to download this document. We will send you the file to your email shortly.
Loading Unlocking...
Login

Join to view lecture and access 3M+ class-specific study document.

or
We will never post anything without your permission.
Don't have an account?
Sign Up

Join to view lecture 2 2 and access 3M+ class-specific study document.

or

By creating an account you agree to our Privacy Policy and Terms Of Use

Already a member?