Unformatted text preview:

Integrative Biology 200A "PRINCIPLES OF PHYLOGENETICS" Spring 2006 University of California, Berkeley B.D. Mishler February 26, 2004. MOLECULAR DATA IN SYSTEMATICS I. Techniques - kinds of data -- intrinsically distance-based data: -- immunology (cross reaction of antibodies) -- DNA - DNA hybridization -- AFLPs - RAPDs -DNA fingerprinting -- microsatellites -- character-based data: -- allozymes -- restriction enzyme sites -- sequencing methods -- direct -- cloning -- PCR -- genomic data (gene arrangement) II. Special features of molecular data -- purported advantages: -- closer to (or equal to) the genetic information. -- huge numbers of potential characters, especially useful in organisms with simple morphology. -- ability to homologize across very broad groups. -- independence from morphological characters which are perhaps more subject to adaptive convergence. -- ability to model or weight, because of relatively simple models of change. -- $$$. -- purported weaknesses: -- simplicity of characters (i.e., no ontogeny, few possible character states) leading to special problems with homoplasy. -- sampling problems. -- fossil taxa generally can't be included. -- highly conserved regions, used to reconstruct deep branching points, are perhaps more subject to adaptive convergence. -- $$$. Properties of a good marker, as compared between molecules (i.e., DNA sequence data) and morphology. molecules morphology 1) COMPLEXITY AND COMPARABILITY – + 2) DISCRETE STATES + – 3) HERITABILITY + – 4) INDEPENDENCE ? ? 5) LOW RATE OF CHANGE (λ) ? ? 6) MANY POSSIBLE CHARACTER STATES – +III. Methods of analysis (an overview for now -- more later in the class) A. Phenetic -- molecular systematics is the last hold-out of phenetic methods as used for phylogenetic reconstruction. -- disadvantages: -- usually assumes molecular clock. -- many distance measures used are non-metric, therefore one can't interpret branch lengths. -- hides homoplasy. -- throws away the information on individual characters that was so laboriously obtained. -- advantages: -- ??? (at best able to mimic the results of a phylogenetic analysis) -- Averaging across whole genome? -- Avoiding problem of reticulation? (some argue phenetic methods are OK below species level, as in the field of "phylogeography"). B. Phylogenetic -- many molecular systematists are deeply concerned with adapting standard character-based methods of phylogenetic analysis to these data; most of the issues we have already discussed are involved: -- homology (including alignment problems) -- what is a character? -- character correlations -- physical rearrangements (i.e., deletions, inversions) -- allozymes -- restriction sites: --RFLP's --mapping -- nucelotide positions -- comparing genomes -- synteny, rearrangements, insertions/deletions -- the gene "annotation" problem -- multigene families -- paralogy vs orthology -- the fate of duplicated genes: ghost genes, subfunctionalization -- Exon shuffling -- weighting: -- gains versus losses -- transitions versus transversions -- purines A G -- pyrimidines C T U -- codon positiion bias -- compensatory substitutions in RNA (due to secondary structure) -- comparing different data sets (e.g., morphology versus molecules)-- maximum parsimony versus maximum likelihood (more later) -- the "window of opportunity": how to tell when it's open? -- considerations of lambda (λ) -- compatibility, "signature nucleotides" (i.e., the "true" synapomorphy approach in a new guise) IV. Recommendations (Mishler's Aphorisms): -- treat these data as any other; if the object is phylogeny reconstruction, use phylogenetic methods. -- include all available data in an analysis, even if your own focus has been on molecules; it makes no sense to ignore older data just because newer data have been generated. -- be wary of consensus tree approaches; they may be worthwhile as part of the analysis, but it is probably best to combine all putative homologies into one matrix (perhaps with weighting if this can be independently justified). -- for reconstructing deep splits, it is much better to sequence portions of several different genes, scattered around the nuclear and organelle genomes, than it is to concentrate on extensive sequencing of a single gene (because of the problem of tight selective constraints on any one highly conserved region). Or for that matter, use morphology or genome structure. -- it is probably better to break large surveys down into reasonable local analyses, to avoid spurious homoplasy (e.g., instead of putting all eukaryotes into one huge matrix, work on relationships within smaller, a priori justified monophyletic groups, and later link those groups together using archetypes: "compartmentalization" -- see earlier lecture on characters). -- molecular evolutionary studies and phylogeny reconstruction using molecules are two very different goals; for the former purpose, one should use phylogenies based on morphology (and other characters, perhaps including molecules -- but not the molecules that are being studied evolutionary). Simplified cladogram of the 'many-to-many' relationships of classical nuclear receptors. Triangles indicate expansion within one lineage; bars represent single members. Numbers in parentheses indicate the number of paralogues in each group.From: Biochemistry & Molecular Biology of Plants edited by Bob B. Buchanan, Wilhelm Gruissem, Russell L. Jones. 2000.


View Full Document

Berkeley INTEGBI 200A - MOLECULAR DATA IN SYSTEMATICS

Documents in this Course
Quiz 1

Quiz 1

2 pages

Quiz 1

Quiz 1

4 pages

Quiz 1

Quiz 1

5 pages

Quiz 2

Quiz 2

4 pages

Quiz 1

Quiz 1

2 pages

Quiz 1

Quiz 1

2 pages

Notes

Notes

3 pages

Quiz 2

Quiz 2

3 pages

Load more
Download MOLECULAR DATA IN SYSTEMATICS
Our administrator received your request to download this document. We will send you the file to your email shortly.
Loading Unlocking...
Login

Join to view MOLECULAR DATA IN SYSTEMATICS and access 3M+ class-specific study document.

or
We will never post anything without your permission.
Don't have an account?
Sign Up

Join to view MOLECULAR DATA IN SYSTEMATICS 2 2 and access 3M+ class-specific study document.

or

By creating an account you agree to our Privacy Policy and Terms Of Use

Already a member?