DOC PREVIEW
Stanford CS 262 - Functional Genomics & Network Integration

This preview shows page 1-2 out of 5 pages.

Save
View full document
View full document
Premium Document
Do you want full access? Go Premium and unlock all 5 pages.
Access to all documents
Download any document
Ad free experience
View full document
Premium Document
Do you want full access? Go Premium and unlock all 5 pages.
Access to all documents
Download any document
Ad free experience
Premium Document
Do you want full access? Go Premium and unlock all 5 pages.
Access to all documents
Download any document
Ad free experience

Unformatted text preview:

CS 262 – Computational Genomics Prof. Serafim Batzoglou Functional Genomics & Network Integration March 14, 2006 –Guest Lecture: Balaji S. Srinivasan Scribe: Mike Polcari What we’ve learned this semester - sequencing & aligning genomic data – has a unifying goal. That is: to use the genomic data as a platform for biological analysis. This lecture covers the following topics: Functional Genomics: How can we use methods of functional genomics to assay individual proteins or make inferences about pairs of proteins? Network Integration: How can we stack multiple data-sets together in systematic ways, to discover? Further Directions Functional genomics covers: -determining function on a genomic scale. This has only been possible in last 10 years with whole genomes -large scale assays for DNA, RNA, protein properties -The “omics” (proteomics,metabolomics, glycomics) are very fashionable Examples: Micro-arrays – study different times, conditions etc in each experiment and cluster genes based on their reaction. Pheno-arrays – take mutant strains and see how they survive –this can tell us: what is the function of the underlying deleted gene?Taxonomy- presence or absence across species can tell us what a protein does. Each of these techniques can yield answers to particular questions: When is a particular gene expressed? What phenotype does a particular mutant exhibit? Which species have a particular protein? However, what if we want to discover system level info? To discover system level info, we need to discover pair wise similarities. ie How does the above data co-vary with other genes/proteins/etc? Which genes co-express with a particular gene? System level info is clearly very interesting but is (as usual) computationally demanding. For a typical microbe with 3700 genes, there are 6.9 × 106 pairs of genes. Even with just pairwise correlations from just one dataset, we can discover revealing information. For example, by correlating gene-expression time during the cell-cycle, wecan identify different modules & group them into hierarchy of function. Imagine what we could discover if we used multiple data sources! Functional genomics can provide both gene level and system level info, how can we systematically combine them for even more-revealing discoveries? Network Integration When co-expression & co-inheritance tell us different things about which proteins should be linked, we need a systematic way to resolve & combine results from different datasets. Functional genomics provides dozens of ways to discover covariance – we need a systematic way to integrate all of these findings: We can create many protein interaction networks – each corresponding to the results from a functional genomics experiment. Each of these networks are on completely different scales – how do we combine them? Training sets We assay different methods on known true/false interactions. By measuring these known covariances & how they relate to functional linkage, we can determine higher-dimensional decision rules for categorizing proteins as functionally linked. We can take another step back and view this as a classical machine-learning problem where the dozens of functional genomics readings represent dozens of dimensions of the input data & a binary classification Machine Learning algorithm (an SVM, for example) is used to classify proteins as either functionally linked or not.Network integration can provide insight into ‘hidden biology’ - linkages that would not be evident if either of the data-dimensions is used in isolation: 30-60% of interactions fall in this region!Scope & Scale – it scales to very large datasets: Applied to 230 sequenced microbes 3-8 dimensions ~4.89 billion possible protein pairs, but only 2.81million predicted interactions. Example of an application: We can make interesting predictions about Caulobacter crescentus – in particular, what are the interaction partners of mreB ? When looking at this through our network browser, we can see strong linkages to proteins we’d expect (which is encouraging). More interestingly, we can see moderate linkages that would have been missed without network integration. One of these linkages was later confirmed! The moral of the story: We can discover new & interesting relationships! Now that we have hundreds of interactive networks – what can we do with them? 1) Network alignment: Can we systematically compare interaction networks to find conserved components? 2) Can we suggest proteins within a species to assay in the lab? Network alignment: Sequence alignment is about finding conserved proteins – network alignment is about finding conserved modules – both conserved sequence & interaction. If we can discover something about the interaction or behavior of a module in a particular species, we can use network alignment to translate those findings to a forward hypothesis in novel organisms. Protein recommender: Use a partial set of proteins to recommend other proteins which are likely involved. These recommendations are then tested


View Full Document

Stanford CS 262 - Functional Genomics & Network Integration

Documents in this Course
Lecture 8

Lecture 8

38 pages

Lecture 7

Lecture 7

27 pages

Lecture 4

Lecture 4

12 pages

Lecture 1

Lecture 1

11 pages

Biology

Biology

54 pages

Lecture 7

Lecture 7

45 pages

Load more
Download Functional Genomics & Network Integration
Our administrator received your request to download this document. We will send you the file to your email shortly.
Loading Unlocking...
Login

Join to view Functional Genomics & Network Integration and access 3M+ class-specific study document.

or
We will never post anything without your permission.
Don't have an account?
Sign Up

Join to view Functional Genomics & Network Integration 2 2 and access 3M+ class-specific study document.

or

By creating an account you agree to our Privacy Policy and Terms Of Use

Already a member?