DOC PREVIEW
Stanford CS 262 - Computational Genomics

This preview shows page 1-2-3-4-5-6 out of 18 pages.

Save
View full document
View full document
Premium Document
Do you want full access? Go Premium and unlock all 18 pages.
Access to all documents
Download any document
Ad free experience
View full document
Premium Document
Do you want full access? Go Premium and unlock all 18 pages.
Access to all documents
Download any document
Ad free experience
View full document
Premium Document
Do you want full access? Go Premium and unlock all 18 pages.
Access to all documents
Download any document
Ad free experience
View full document
Premium Document
Do you want full access? Go Premium and unlock all 18 pages.
Access to all documents
Download any document
Ad free experience
View full document
Premium Document
Do you want full access? Go Premium and unlock all 18 pages.
Access to all documents
Download any document
Ad free experience
View full document
Premium Document
Do you want full access? Go Premium and unlock all 18 pages.
Access to all documents
Download any document
Ad free experience
Premium Document
Do you want full access? Go Premium and unlock all 18 pages.
Access to all documents
Download any document
Ad free experience

Unformatted text preview:

Computational GenomicsBiology in One Slide: 2 ParadigmsHigh Throughput BiologyGoals of this courseSlide 5Topics in CS262Slide 7Course responsibilitiesReading materialTopic 1. Sequence AlignmentComplete genomesEvolutionEvolution at the DNA levelEvolutionary RatesSequence conservation implies functionSequence AlignmentWhat is a good alignment?Scoring FunctionComputational GenomicsLecture 1, Tuesday April 1, 2003Biology in One Slide: 2 ParadigmsMolecular ParadigmEvolution ParadigmHigh Throughput BiologyBiology is becoming an information science…ACGTGACTGAGGACCGTGCGACTGAGACTGACTGGGTCTAGCTAGACTACGTTTTATATATATATACGTCGTCGTACTGATGACTAGATTACAGACTGATTTAGATACCTGACTGATTTTAAAAAAATATT…Gene ExpressionDNA SequencingGoals of this course•Introduction to Computational BiologyBasic biology for computer scientistsBreadth: mention many topics & applications•In-depth coverage of Computational GenomicsAlgorithms for sequence analysisCurrent applications, trends, and open problems• Coverage of useful algorithmsHidden Markov modelsDynamic ProgrammingString algorithmsApplications of AI techniquesTopics in CS262Part 1: In-depth coverage of basic computational methods for analysis of biological sequencesSequence Alignment & Dynamic ProgrammingHidden Markov modelsThese methods are used heavily in most genomics applications:DNA sequencingComparison of DNA and proteins across organismsDiscovery of genes, promoters, regulatory sitesTopics in CS262Part 2: Topics in computational genomics, more algorithms, and areas of active researchDNA sequencing & assembly: reading a complete genome such as the human DNAGene finding: marking genes on the DNA sequenceLarge-scale comparative genomics: comparing whole genomes from multiple organismsMicroarrays & regulation: understanding the regulatory code, and potential disease-causing genesRNA structure: predicting the folding of RNAPhylogeny and evolution: quantifying the evolution of biological sequencesCourse responsibilities•Homeworks [72%]4 challenging problem sets, 4-5 problems/psetCollaboration allowed – please give creditHws due Thursday, solutions explained FridayTwo worst problems in all hws do not count•Final [18%]Takehome, 1 dayCollaboration not allowedBasic questions – much easier than homeworks•Scribing [10%]Due one week after the lecture, except special permissionReading material•Books“Biological sequence analysis” by Durbin, Eddy, Krogh, Mitchinson•Chapters 1-4, 6, (7-8), (9-10)“Algorithms on strings, trees, and sequences” by Gusfield•Chapters (5-7), 11-12, (13), 14, (17)•Papers•Lecture notesTopic 1. Sequence AlignmentComplete genomesEvolutionEvolution at the DNA level…ACGGTGCAGTCACCA……ACGTTGCAGTCCACCA…CSEQUENCE EDITS REARRANGEMENTSEvolutionary Rates OKOKOKXXStill OK?next generationChanges in non-functional sites are OK, so will be propagatedMost changes in functional sites are deleterious and will be rejectedSequence conservation implies functionInterleukin region in human and mouse100%40%Sequence Alignment-AGGCTATCACCTGACCTCCAGGCCGA--TGCCC---TAG-CTATCAC--GACCGC--GGTCGATTTGCCCGACDefinitionGiven two strings x = x1x2...xM, y = y1y2…yN,an alignment is an assignment of gaps to positions0,…, M in x, and 0,…, N in y, so as to line up each letter in one sequence with either a letter, or a gapin the other sequenceAGGCTATCACCTGACCTCCAGGCCGATGCCCTAGCTATCACGACCGCGGTCGATTTGCCCGACWhat is a good alignment?Alignment: The “best” way to match the letters of one sequence with those of the otherHow do we define “best”?Alignment:A hypothesis that the two sequences come from a common ancestor through sequence editsParsimonious explanation:Find the minimum number of edits that transform one sequence into the otherScoring Function•Sequence edits:AGGCCTCMutations AGGACTCInsertionsAGGGCCTCDeletionsAGG.CTCScoring Function:Match: +mMismatch: -sGap: -dScore F = (# matches)  m - (# mismatches)  s – (#gaps) 


View Full Document

Stanford CS 262 - Computational Genomics

Documents in this Course
Lecture 8

Lecture 8

38 pages

Lecture 7

Lecture 7

27 pages

Lecture 4

Lecture 4

12 pages

Lecture 1

Lecture 1

11 pages

Biology

Biology

54 pages

Lecture 7

Lecture 7

45 pages

Load more
Download Computational Genomics
Our administrator received your request to download this document. We will send you the file to your email shortly.
Loading Unlocking...
Login

Join to view Computational Genomics and access 3M+ class-specific study document.

or
We will never post anything without your permission.
Don't have an account?
Sign Up

Join to view Computational Genomics 2 2 and access 3M+ class-specific study document.

or

By creating an account you agree to our Privacy Policy and Terms Of Use

Already a member?