DOC PREVIEW
UMD CMSC 838T - Parallel Detection of Regulatory Elements with gMP

This preview shows page 1-2-3-26-27-28 out of 28 pages.

Save
View full document
View full document
Premium Document
Do you want full access? Go Premium and unlock all 28 pages.
Access to all documents
Download any document
Ad free experience
View full document
Premium Document
Do you want full access? Go Premium and unlock all 28 pages.
Access to all documents
Download any document
Ad free experience
View full document
Premium Document
Do you want full access? Go Premium and unlock all 28 pages.
Access to all documents
Download any document
Ad free experience
View full document
Premium Document
Do you want full access? Go Premium and unlock all 28 pages.
Access to all documents
Download any document
Ad free experience
View full document
Premium Document
Do you want full access? Go Premium and unlock all 28 pages.
Access to all documents
Download any document
Ad free experience
View full document
Premium Document
Do you want full access? Go Premium and unlock all 28 pages.
Access to all documents
Download any document
Ad free experience
Premium Document
Do you want full access? Go Premium and unlock all 28 pages.
Access to all documents
Download any document
Ad free experience

Unformatted text preview:

Parallel Detection of Regulatory Elements with gMPMotivationTalk OverviewTechniqueSlide 5gRNA frameworkgRNA - APIsgRNA environmentgRNA GridgMPSlide 11REDUCE algorithmSlide 13REDUCE methodSlide 15....Table: Finding significant motifsREDUCE parallelised with gMP......ExperimentResultsAnalysisRelated workSlide 24ComparisonObservationsSlide 27ConclusionsParallel Detection of Regulatory Elements with gMP Bertil Schmidt, Lin Feng, Amey Laud, Yusdi SantosoDamayanti GuptaCMSC 838 PresentationCMSC 838T – PresentationMotivationFundamental questionHow are expression levels of thousands of genes regulated ?Very importantUnderstanding of gene functionResponse to environmentUnderstand genetic causes of diseases Evaluate effects of drusDetect mutationsRememberSets of genes -> Pathways -> Genetic NetworksGene regulationControl decisions turn genes on/offGene Regulation NetworkCMSC 838T – PresentationTalk OverviewOverview of talkMotivationTechniqueExperimentRelated workConclusionsCMSC 838T – PresentationTechniqueMotifs upstream of genes regulate gene expressionMotifs are sites of regulatory activityIdentify regulatory motifs by combiningGene expression dataDetect common motifs occuring upstream of genesHuge datasetsUtilise parallel computingCMSC 838T – PresentationTechniquegRNAJava development frameworkgMPJava communication libraryREDUCEAlgorithm to identify regulatory motifsREDUCE parallelised with gMPIncrease computing powerGet motifs ranked in statistical significanceCMSC 838T – PresentationgRNA frameworkConsists of APIsCMSC 838T – PresentationgRNA - APIsInteract with data sourcesProvide functionality from biologyPipelines tasks into unified processRepository of resourcesDistributed programmingCMSC 838T – PresentationgRNA environmentgRNA GridClustered computing environmentApplication written for gRNAMultiple-tier applicationApplications operate from client computerCommunicates with cluster through single computerHosts EJB serverServer identifies processing nodeseach of these perform tasksCMSC 838T – PresentationgRNA GridCMSC 838T – PresentationgMPJava based message passing toolBuilt on top of socketsManages virtual processors to run on available machinesScalableMachines added/removed easilyCMSC 838T – PresentationgMPProcesses are groupedCommunication primitives provided for sending and receiving dataCollective communication to several nodes enabled modularly and efficientlyEnables functions to be implemented on dataCMSC 838T – PresentationREDUCE algorithmBased on model Upstream motifs contribute additively to expression level of each geneQuantify the extent to which these motifs contribute to expression dataFit log of expression ratio to sum of activating and inhibitory termsFind stastically most significant motifsPlots of fitting parameters suggest biological functionCMSC 838T – PresentationREDUCE algorithmTermsOccurence vectorMeasure of how often a motif is foundExpression vectorMeasure of gene expressionCMSC 838T – PresentationREDUCE methodConsists of1) Motif frequency countercounts occurrences of DNA motifs upstream of each ORFmotifs are about 7~11 nucleotides in lengthget occurence vectorsCMSC 838T – PresentationREDUCE algorithm2) Significant motif finderUsei) Normalised occurrence vector made for each motif nμii) Normalised vector of logs of gene expression ratio vectors- aTake dot product of these (a . nμ) ,and square.Can be considered as frequency of occurence X expressive power of regulatory motifIt is squared to get rid of negativesCorrelate gene expression with occurence of motifLargest dot product is most significant motifCMSC 838T – Presentation....a is modified to remove effect of this motifresidual gene expression vectorProcess repeated until motifs are rankedCMSC 838T – PresentationTable: Finding significant motifsUses a - (.5816,.2522,.2886,-.5947, -.1595, -.3683)CMSC 838T – PresentationREDUCE parallelised with gMP...Parallel motif frequency counterSplit set of ORFs equallyDistribute across available nodesEach node calculates in parallel to get occurence vectorsMatrix transpositionOccurence vectors scattered across nodesAdvantageous to store each vector in single nodeTranspose motif frequency matrixFor each ORF can only calculate fraction of occurence frequencies for all motifsBut the entire occurence frequency is neededCMSC 838T – Presentation...Parallel significant motif finderNormalises occurence vector within each nodeAt each node, most significant motif calculatedGlobal most significant motif calculatedProcess iterated to rank occurence vectorsInterface in gRNA allows ease of implementationCMSC 838T – PresentationExperimentUse Compaq Alpha systemConsists of cluster of 8 AlphaServer SC/ES45 Connected by high-speed Alpha SC 16-Port switch and ELAN PCI adapter cards.Each server contains 4 Alpha EV68 processorsCMSC 838T – PresentationResultsUse 7090 gene expressions of yeastORFs of length 600Motifs upto length 7 Throughput (in MBytes/s) also shown20 most significant motifs computed.CMSC 838T – PresentationAnalysisRuntime scales well with number of processing nodesFrequency counter scales perfectlyMotif finder also scalesCannot achieve perfect scaling because of communication overhead.CMSC 838T – PresentationRelated workDiscoveryLinkProvides configurable wrappers as interfaces to multiple data sourcesKleisli systemSystematically manages and integrates external databasesUses functional query language to perform correlation across databasesToolkits designed with functionality for specialised areas BioJava, BioPerl, PALSequence AnalysisEnsembl initiative, DASprovide extensible approach to issue of annotating genomic dataCMSC 838T – PresentationRelated workPrevious approaches using Java for high performance computingBindings into native message-passing APIs(e.g.MPI)Does not allow easy integration into larger Java applicationsPure Java message passing interfacesJMPI, CCJBoth implemented on top of Java RMI–Slower than using raw socketsCCJ tries to overcome–optimised RMI implementation–not portableBoth cannot handle integrationCMSC 838T –


View Full Document

UMD CMSC 838T - Parallel Detection of Regulatory Elements with gMP

Documents in this Course
Load more
Download Parallel Detection of Regulatory Elements with gMP
Our administrator received your request to download this document. We will send you the file to your email shortly.
Loading Unlocking...
Login

Join to view Parallel Detection of Regulatory Elements with gMP and access 3M+ class-specific study document.

or
We will never post anything without your permission.
Don't have an account?
Sign Up

Join to view Parallel Detection of Regulatory Elements with gMP 2 2 and access 3M+ class-specific study document.

or

By creating an account you agree to our Privacy Policy and Terms Of Use

Already a member?