DOC PREVIEW
Stanford CS 374 - Regulatory Motif Finding

This preview shows page 1-2-3-4-25-26-27-51-52-53-54 out of 54 pages.

Save
View full document
View full document
Premium Document
Do you want full access? Go Premium and unlock all 54 pages.
Access to all documents
Download any document
Ad free experience
View full document
Premium Document
Do you want full access? Go Premium and unlock all 54 pages.
Access to all documents
Download any document
Ad free experience
View full document
Premium Document
Do you want full access? Go Premium and unlock all 54 pages.
Access to all documents
Download any document
Ad free experience
View full document
Premium Document
Do you want full access? Go Premium and unlock all 54 pages.
Access to all documents
Download any document
Ad free experience
View full document
Premium Document
Do you want full access? Go Premium and unlock all 54 pages.
Access to all documents
Download any document
Ad free experience
View full document
Premium Document
Do you want full access? Go Premium and unlock all 54 pages.
Access to all documents
Download any document
Ad free experience
View full document
Premium Document
Do you want full access? Go Premium and unlock all 54 pages.
Access to all documents
Download any document
Ad free experience
View full document
Premium Document
Do you want full access? Go Premium and unlock all 54 pages.
Access to all documents
Download any document
Ad free experience
View full document
Premium Document
Do you want full access? Go Premium and unlock all 54 pages.
Access to all documents
Download any document
Ad free experience
View full document
Premium Document
Do you want full access? Go Premium and unlock all 54 pages.
Access to all documents
Download any document
Ad free experience
View full document
Premium Document
Do you want full access? Go Premium and unlock all 54 pages.
Access to all documents
Download any document
Ad free experience
Premium Document
Do you want full access? Go Premium and unlock all 54 pages.
Access to all documents
Download any document
Ad free experience

Unformatted text preview:

Regulatory Motif FindingOutlineRegulation of GenesOverview of Gene ControlTranscriptional RegulationSlide 6Slide 7Slide 8Slide 9What is a motif?daf-19 Binding Sites in C. elegansMotif RepresentingMotif Logos: an ExampleMeasure of ConservationSlide 15Finding Regulatory MotifsIdentifying Motifs: ComplicationsCurrent Motif Discovery MethodsMotif Finding: Comparative AnalysisMotif Discovery ProcedureAlignment of promoters & 3’ UTRsMotif Conservation Score (MCS)MCSConservation Properties of Regulatory MotifsSlide 25Results: motifs in promotersResults: motifs in 3’ UTRsProperty1: strand specificityProperty2Properties => miRNAThe microRNA pathwayRelationship with miRNA8-mer motifs ->new miRNA genesPrevalence of miRNA regulationSummary: comparative genome analysisNow…Motif Finding: Structural KnowledgeStructure-based approachCys2His2 Zinc Finger protein familyCys2His2 Zinc Finger: Canonical DNA binding modelCys2His2 Zinc Finger: DNA Binding ModelCys2His2 Zinc Finger: Compiling datasetProfile HMMExample: full profile HMMSlide 45Cys2His2 Zinc Finger: Probabilistic ModelEM algorithmEstimate DNA-recognition preferencesApply on TFs from the same familyEvaluationCompare with other preferencesSummaryDiscussionThank you!Regulatory Motif FindingWenxiu MaCS374 Presentation11/03/20052OutlineRegulation of genesRegulatory MotifsMotif RepresentationCurrent Motif Discovery Methods3Regulation of Genes What turns genes on (producing a protein) and off? When is a gene turned on or off? Where (in which cells) is a gene turned on? How many copies of the gene product are produced?4Overview of Gene ControlThe mechanisms that control the expression of genes operate at many levels.source: Molecular Biology of the Cell (4th ed.), A. Johnson, et al.5Transcriptional RegulationThe transcription of each gene is controlled by a regulatory region of DNA relatively near the transcription start site (TSS). two types of fundamental componentsshort DNA regulatory elementsgene regulatory proteins that recognize and bind to them.6Regulation of GenesGeneRegulatory ElementRNA polymerase(Protein)Transcription Factor(Protein)DNAsource: M. Tompa, U. of Washington7Regulation of GenesGeneRNA polymeraseTranscription Factor(Protein)Regulatory ElementDNAsource: M. Tompa, U. of Washington8Regulation of GenesGeneRNA polymeraseTranscription FactorRegulatory ElementDNANew proteinsource: M. Tompa, U. of Washington9OutlineRegulation of genesRegulatory MotifsMotif RepresentationCurrent Motif Discovery Methods10What is a motif?A subsequence (substring) that occurs in multiple sequences with a biological importance.Motifs can be totally constant or have variable elements.Protein Motifs often result from structural features.DNA Motifs (regulatory elements)Binding sites for proteinsShort sequences (5-25)Up to 1000 bp (or farther) from geneInexactly repeating patterns11daf-19 Binding Sites in C. elegansGTTGTCATGGTGACGTTTCCATGGAAACGCTACCATGGCAACGTTACCATAGTAACGTTTCCATGGTAAC che-2 daf-19 osm-1 osm-6 F02D8.3-150 -1source: Peter Swoboda12Motif RepresentingConsensus sequence: a single string with the most likely sequence(+/- wildcards)Regular expression: a string with wildcards, constrained selectionProfile: a list of the letter frequencies at each positionSequence Logo: graphical depiction of a profile conservation of elements in a motif.13Motif Logos: an Example(http://www-lmmb.ncifcrf.gov/~toms/sequencelogo.html)14Measure of ConservationRelative heights of letters reflect their abundance in the alignment.Total height = entropy-based measurement of conservation.Entropy(i) = -SUM { f(base, i)* ln[f(base, i)] }over all basesConservation(i) = 2- Entropy(i)Units of conservation = bits of informationEntropy measures variability/disorder.High conserved = low entropy = tall stackVery variable = high entropy = low stack15OutlineRegulation of genesRegulatory MotifsMotif RepresentationCurrent Motif Discovery Methods16Finding Regulatory MotifsGiven a collection of genes with common expression,Find the (TF-binding) motif in common...17Identifying Motifs: ComplicationsWe do not know the motif sequenceWe do not know where it is located relative to the genes startMotifs can differ slightly from one gene to anotherHow to discern it from “random” motifs?18Current Motif Discovery MethodsGOAL: comprehensive identification of all the regulatory motifs in genomes.by overrepresentationMEME, Gibbs samplingby phylogenetic footprintingFootprinterCross species comparative analysisCombine structure information19Motif Finding: Comparative AnalysisSystematic discovery of regulatory motifs in human promoters and 3' UTRs by comparison of several mammals. Xie, X. et al., Nature (2005).Identify motifs based on comparative analysis of human, mouse, rat and dog genomesA systematic catalogue of human gene regulatory motifsShort, functional sequences (6-10bp) used many times in a genomeFocus regionsPromoters3’ untranslated regions (3’ UTRs)microRNAs (miRNAs)post-transcriptional regulation20Motif Discovery ProcedureAlignment of promoters & 3’ UTRsMotif conservation score (MCS) Measure the extent of excess conservation“Highly conserved motifs”MCS>6Clustering21Alignment of promoters & 3’ UTRsconstruct a whole-genome alignment for the four mammalian genomes Blastz1 and Multiz2Extract the aligned promoter and 3’ UTRs portions respectively.Coordinates: the annotation of NCBI reference sequences (RefSeq)22Motif Conservation Score (MCS)Consensus sequence representationAlphabet size: 11 (A,C,G,T,[AC], [AG], [AT], [CG], [CT], [GT], [ACGT])conserved occurrence of a motif m is an instance in which an exact match to this motif is found in all four species.conservation rate p = ratio of conserved occurrences to total occurrences in humanExpected conservation rate p0 = avg. conservation rate of 100 random motifs,given same length and redundancy.23MCSMCS = # of s.d. by which the observed conservation rate of a motif p exceeds the expected conservation rate p0.p = k/nBinomial probability of observing k out of nEstimated by way of Normal approximation to the binomial Dist. 0 0!( ) (1 )!( )!k n knP k out of n p pk n k-= --0 0 0( ), , (1 )kz where np np pmm ss-= = = -24Conservation Properties of Regulatory MotifsKnown 8-mer


View Full Document

Stanford CS 374 - Regulatory Motif Finding

Documents in this Course
Probcons

Probcons

42 pages

ProtoMap

ProtoMap

19 pages

Lecture 3

Lecture 3

16 pages

Load more
Download Regulatory Motif Finding
Our administrator received your request to download this document. We will send you the file to your email shortly.
Loading Unlocking...
Login

Join to view Regulatory Motif Finding and access 3M+ class-specific study document.

or
We will never post anything without your permission.
Don't have an account?
Sign Up

Join to view Regulatory Motif Finding 2 2 and access 3M+ class-specific study document.

or

By creating an account you agree to our Privacy Policy and Terms Of Use

Already a member?