1Computational Genomics and Molecular BiologyDannie DurandFall 2004Lecture 1OutlineA whirlwind review of molecular biologyAn overview of computational molecular biologyNew problems in genomicsGenes Encode ProteinsGTGCACCTGACTCCTGAG...V H L T P E...A gene is a DNA sequenceA protein is an amino acid sequenceA protein folds into a 3D structureDNA forms a double stranded helixATGCDNA replication…aggaggcctcgcctctcccagcatgggctggggctcctgtcccccactgtgtgtgcctggggcctggccaggactcccagtga…chromosomeDNAcell2Protein SynthesisDNAProtein Synthesis: TranscriptionDNARNARNA transcriptiondouble-stranded DNA sequence...GTGTCGCTGACTCCTTGGCTACCCGAG......CACAGCGACTGAGGAACCGATGGGCTC...GTGCACRNA transcriptionopen DNA helixGTGCAC aCTGACTCCTTGGCTACCCGAG...GaTGTCGGTGCACCTGACTCCTGAGGTGCACCTGACTCCTGAGCACGTGGACTGAGGACTC...CACaAGCa..CACGTG aGACTGAGGAACCGATGGGCTC...GTGCAC aCTGACTCCTTGGCTACCCGAG...GaTGTCGGTGCACCTGACTCCTGAGGTGCACCTGACTCCTGAGCACGTGGACTGAGGACTC...CACaAGCaCACGTG aGACTGAGGAACCGATGGGCTC...RNA transcriptionRNA transcriptCUGACUCCUUGGCUACCCGAG...RNA• Adenine, Guanine, Cytosine, Uracil (AGCU)• Single stranded• Secondary structure...CUGACUCCUUGGCUACCCGAG......GACTGAGGAACCGATGGGCTC...3RNA Secondary StructureCCGUGAACGUGUACCGGAUUUUUAUUCCC...Protein Synthesis: TranslationtranslationRNAProtein Translation...GUGCACCUGACUCCUGAG...UGAGGATVLHP....messenger RNAamino acid sequencetransfer RNAribosometransfer RNAsecondary structuretertiary structureECUCProtein Translation...GUGCACCUGACUCCUGAG...UGATVLHGGAP....messenger RNAamino acid sequencetransfer RNAribosomeCUCEProtein Translation...GUGCACCUGGUGCACCUGACUCCUGAGGUGCACCUGACUCCUGAGCUCCUGAG...messenger RNAamino acid sequencetransfer RNAribosome....tertiary protein structrure4Gene RegulationIf I have the same set of genes in every cell, how come my liver cells look so different from my skin cells?Only a small number of genes are being translated into protein at any one time…aggaggcctcgcctctcccagcatgggctggggctcctgtcccccactgtgtgtgcctggggcctggccaggactcccagtga…protein coding sequenceA gene is a location on a chromosome that encodes a proteinTranslating Genes into Proteins Bacteriamessenger RNA:amino acid sequence:transcriptiontranslationRNA polymeraseDNA:promoter geneGene Regulation Controls When a Gene Is TranscribedRNA polymeraseDNA:promoter geneGene Regulation Controls When a Gene Is TranscribedRNA polymeraseDNA:promoter generepressortryptophanGene Regulation Controls When a Gene Is TranscribedDNA:promoter generepressortryptophan5Gene Regulation Controls When a Gene Is TranscribedDNA:promoter generepressortryptophanRNA polymeraseTranslating Genes into Proteins:Multicellular organismsmRNA:RNA splicingDNA:intronsexon1 exon2 exon3 exon4amino acid sequencetranslationtranscriptionRNA transcript:Alternative splicingAlternate splice forms:exon6exon1 exon2 exon3 exon5exon1 exon2 exon3 exon4DNA:mRNA:exon1 exon2 exon3 exon4exon6exon5exon1 exon2 exon3 exon4exon6exon5femalemaleGene RegulationIn single cell organisms, gene regulation orchestrates– Responses to changing environment– Cell cycleIn multicellular organisms, gene regulation orchestrates– Tissue type differentiation– Development from embryo to adulthoodA whirlwind review of molecular biologyAn overview of computational molecular biology– Sequence comparison– Reconstruct evolutionary history – Gene prediction – Predict structure from sequence• RNA • ProteinsNew problems in genomicsOutlineThis courseComputational Structural Biology 15-879The Origins of Computational BiologySanger-Coulson sequencingMaxam-Gilbert sequencingGilbert, Sanger winNobel PrizeCongress establishes GenbankHuman Genome Project beginsGenBank goes online.197019801990ARPANETFirst royal emailUSENET newgroupsTCP/IPInternetWorld Wide Web, GopherNCSA MosaicPizza Hut goes on line6Growth of sequence data during the ’90’sCollins et al, Science, Oct 1998National Center for Biotechnology InformationOutlineA whirlwind review of molecular biologyAn overview of computational molecular biology– Sequence comparison– Reconstruct evolutionary history – Gene prediction – Predict structure from sequence• RNA • ProteinsNew problems in genomicsSequence Comparisonvtfisll v..frrda.h ksevahrfkd lgeenfkalv…vtfisll v..frrea.h kseiahrfnd vgeehfiglv…vtfisll v..frrdt.y kseiahrfkd lgeqyfkglv…vtlisfi lqrfardaeh kseiahrynd lkeetfkava……mkwvtfisll flfssaysrg v..frrda.h kseva…mkwvtfisll flfssaysrg v..frrea.h kseia…~~wvtfisll flfssaysrg v..frrdt.y kseia…mkwvtlisfi flfssatsrn lqrfardaeh kseialocal multiple alignmentglobal multiple alignment…atgcaaggagtcccagagcctgagctgactacgt……atgcgaggtctcccagtgtctgaactgactaagt……atgcaag_cgtcccagtgccagaactccctacgt……acc_gtggtctccgagtggctgaactgac_aaca…global pairwise alignment local pairwise alignmentApplications• Database searching• RNA structure• Evolutionary tree reconstruction• Gene finding• Sequence assembly….…atgccaggactcccagtga… …atgccaggactcccagtga……atgccaggactcccagtga…early mammal…atgcaaggagtcccagagc……atgcgaggtctcccagtgt……atgggaggtctcccagtgt……atgcgaggtctcgtagtgt……atgcaaggagtcgcagagc…mousehuman ratWhy sequence data is so powerful: Sequences are related!…atgcaaggagtcgcagagc……atgcgaggtctcgtagtgt……atgggaggtctcccagtgt……atgcaaggagtcccagagc……atgcgaggtctcccagtgt…Sequence similarity => functional similarity…LWDEFNQLGTEMIVTKAGRRMFPTFQVKLFGMDPMADYMLLMDFVPVDDKRYRYAFHS…O…LWDPTFQVEFNQLG……TMFPTFEMIVTKAG……RRMFPTFQVPTFQV……KLFMFPTFGEMDPM……ADYMMCFPTFLLMD……FVPVDDKPTSFQVR…...BLASTOutlineA whirlwind review of molecular biologyAn overview of computational molecular biology– Sequence comparison– Reconstruct evolutionary history – Gene prediction – Predict structure from sequence• RNA • ProteinsNew problems in genomics7…atgccaggactcccagtga…early mammal…atgcaaggagtcccagagc……atgcgaggtctcccagtgt……atgggaggtctcccagtgt…chimpanzee…atgcaaggagtcgcagagc…gorilla…atgcgaggtctcgtagtgt…humanReconstructing Evolutionary History …atgcaaggagtcgcagagc……atgcgaggtctcgtagtgt……atgggaggtctcccagtgt……atgcaaggagtcgcagagc……atgcgaggtctcgtagtgt……atgggaggtctcccagtgt…OutlineA whirlwind review of molecular biologyAn overview of computational molecular biology– Sequence comparison– Reconstruct evolutionary history– Gene prediction– Predict structure from sequence• RNA • ProteinsNew problems in genomicsGene Recognition
View Full Document