Berkeley COMPSCI 258 - Highly Available ACID Memory - D2531129

Home> Schools> University of California, Berkeley> Computer Science (COMPSCI) > COMPSCI 258> Highly Available ACID Memory

DOC PREVIEW

Berkeley COMPSCI 258 - Highly Available ACID Memory

School name University of California, Berkeley

Course Compsci 258- Parallel Processors

Pages 19

This preview shows page 1-2-3-4-5-6 out of 19 pages.

Save

View full document

Premium Document

Do you want full access? Go Premium and unlock all 19 pages.

Access to all documents

Download any document

Ad free experience

Subscribe for instant access Get instant access

View full document

Premium Document

Do you want full access? Go Premium and unlock all 19 pages.

Access to all documents

Download any document

Ad free experience

Subscribe for instant access Get instant access

View full document

Premium Document

Do you want full access? Go Premium and unlock all 19 pages.

Access to all documents

Download any document

Ad free experience

Subscribe for instant access Get instant access

View full document

Premium Document

Do you want full access? Go Premium and unlock all 19 pages.

Access to all documents

Download any document

Ad free experience

Subscribe for instant access Get instant access

View full document

Premium Document

Do you want full access? Go Premium and unlock all 19 pages.

Access to all documents

Download any document

Ad free experience

Subscribe for instant access Get instant access

View full document

Premium Document

Do you want full access? Go Premium and unlock all 19 pages.

Access to all documents

Download any document

Ad free experience

Subscribe for instant access Get instant access

Premium Document

Do you want full access? Go Premium and unlock all 19 pages.

Access to all documents

Download any document

Ad free experience

Subscribe for instant access Get instant access

Unformatted text preview:

Highly Available ACID MemoryIntroductionProject goalQuestions to answerOutlineAcid Memory APIImplementationEvaluationSlide 9Slide 10Slide 11Replication for availabilityArchitectureSlide 14Slide 15ConclusionsAdditional SlidesEvaluation, w.r.t. logging based approachHow to make file cache persistentHighly Available ACID Memory Vijayshankar RamanIntroductionWhy ACID memory?non-database apps:•want updates to critical data to be atomic and persistent•synchronization useful when multiple threads are accessing critical datadatabases•concurrency control and recovery logic runs through most of database code.•Extremely complicated, and hard to get right•bugs lead to data loss -- disastrous!Project goalTake recovery logic out of appsBuild a simple user-level library that provides recoverable, transactional memory.all the logic in one place => easy to debug, maintaineasy to to make use of hardware advancesuse replication and persistent memory for recovery -- instead of writing logs+simpler to implement+simpler for applications to use ??Questions to answerprogram simplicity vs. performancehow much do we lose by replicating instead of logging?on a cluster, can we use replication directly for availability?traditionally availability handled on top of the recovery systemOutlineIntroductionAcid Memory APISingle Node design & implementation EvaluationHigh Availability: multiple node design and implementationEvaluationConclusionAcid Memory APITransaction manager interface•TransactionManager(database name, acid memory area)Transaction interface•beginTransaction()•getLock(memory region1, READ/WRITE)•getLock(memory region2, READ/WRITE)•...–memory region = virtual address prefix•commit/abort() -- all locks released combine concurrency control with recovery•recovery done on write-locked regionssupports fine granularity locking => cannot use VM for recoveryapplications can modify data directlyImplementationassume non-volatile memory (NVRAM, battery backup)assume persistent file cacheacid memory area mmap’d from file persistence => writes are permanentgetLock(WRITE) -- copy the region onto mirror areatransaction abort / system crash undo changes on all writelocked regions using copy in mirror areaonly overhead of recovery is a memcpy on each write lockDisk filemaster copymirrorAcid memory areammapEvaluationOverhead of acid memoryread lock:  35usec (lock manager overhead)write lock:  35usec + 5.5usec/KB (memcpy cost)much lesser than methods that write log to diskEase of programmingapplication needs to only acquire locks to become recoverablecan manipulate the data directly -- do not have to call special function on every updateExample: suppose I want to transfer 1M $ from A’s account to B’s With ACID memory/* a points to A’s account *//* b points to B’s account */trans = new Transaction(transMgr);trans->getLock(a, WRITE);trans->getLock(b, WRITE);a = a - 1000000; b = b + 1000000;trans->commit(); Using loggingBeginTransaction();getLock(A’s account, WRITE);getLock(B’s account, WRITE);read(A’s account, a);read(B’s account, b);a = a - 1000000; b = b + 1000000;Update(A’s account, a);Update(B’s account, b);commit();(Update() creates the needed logs)Performance comparison: acid memory vs. loggingconsider a transaction updating integers in a 1KB data-structurelogging each individual update is a bit faster, to an extentacid memory gives okay performance with very easy programmabilityNumber of integer writesTime (in microseconds)Acid memory: write-lock the data-structureLogging: write-lock the structureand update each integer separatelyOutlineIntroductionAcid Memory APISingle Node design & implementation EvaluationHigh Availability: multiple node design and implementationEvaluationConclusionReplication for availability traditionally, availability has been handled in a separate layer -- above recoverycan we handle both recovery and availability via same mechanism?Transaction processing monitorDBMSDBMS DBMSreplicateArchitectureTransactions run by transaction handlerall lock requests must go to ownerdata in all replicas must be kept in syncbalance load by partitioning datadifferent owner for each partitionfailure modelfail-stop: nodes never send incorrect messages to othersfailed nodes never recover data after crashnetwork never failsOwnerdatalock managerdata datadata dataTransaction handlerreplicasclientReads: client gets data from random replicaWrites: must update all replicason commit, transaction sends new data to ownerowner propagates update atomically to all replicas•3 phase non-blocking commit protocol. Always ensure that there is someone to take over the propagation if you crashif owner crashes, fail-over to a replicaOwnerdatalock managerdata datadata dataclientTransaction handlerEvaluationVery fast recovery -- 424 usecs+get fast transactions without non-volatile memorywrites are slower- 4n messages at commit if n replicas-still, this is faster than logging to disk–homogeneous software: susceptible to bugsConclusionsAcid memory easier to usePerformance relative to logging not too badreplication gives fast recoveryUsing cache for replicationwhen/how much to replicate?Future WorkAdditional SlidesEvaluation, w.r.t. logging based approachEase of implementationvery little to code, mostly lock manager stuffwhereas in a traditional dbms•specialized buffer manager•log manager•complex recovery mechanismHow to make file cache persistentRio (Chen et. Al, 1996)place file cache in non-volatile memoryprotect it against OS crashes using VM protectionflush pages in file cache to disk files on

View Full Document