DOC PREVIEW
CORNELL CS 514 - Lecture Notes

This preview shows page 1-2-3-21-22-23-42-43-44 out of 44 pages.

Save
View full document
View full document
Premium Document
Do you want full access? Go Premium and unlock all 44 pages.
Access to all documents
Download any document
Ad free experience
View full document
Premium Document
Do you want full access? Go Premium and unlock all 44 pages.
Access to all documents
Download any document
Ad free experience
View full document
Premium Document
Do you want full access? Go Premium and unlock all 44 pages.
Access to all documents
Download any document
Ad free experience
View full document
Premium Document
Do you want full access? Go Premium and unlock all 44 pages.
Access to all documents
Download any document
Ad free experience
View full document
Premium Document
Do you want full access? Go Premium and unlock all 44 pages.
Access to all documents
Download any document
Ad free experience
View full document
Premium Document
Do you want full access? Go Premium and unlock all 44 pages.
Access to all documents
Download any document
Ad free experience
View full document
Premium Document
Do you want full access? Go Premium and unlock all 44 pages.
Access to all documents
Download any document
Ad free experience
View full document
Premium Document
Do you want full access? Go Premium and unlock all 44 pages.
Access to all documents
Download any document
Ad free experience
View full document
Premium Document
Do you want full access? Go Premium and unlock all 44 pages.
Access to all documents
Download any document
Ad free experience
Premium Document
Do you want full access? Go Premium and unlock all 44 pages.
Access to all documents
Download any document
Ad free experience

Unformatted text preview:

CS514: Intermediate Course in Operating SystemsReal-world time-critical systemsA “system of systems”Slide 4Examples: AmazonWhy is this “time critical”?AkamaiMilitary exampleAir Traffic Control ExampleSlide 10Issues? Let’s focus on scalingTempestSlide 13Slide 14How to solve such problems?Too many choices!How would Amazon answer?Best technology for Amazon?Consistency in TempestTempest CollectionsSlide 21Two level implementationEvaluationSlide 24ExperimentPerformanceDelay to order pending updatesRecovery under loadServices characteristicsPetStoreSlide 31SummaryWhat would an Air Traffic System want?Replicated componentsChoice we saw last timeMore choicesHow would we pick?Picking between Paxos and VsyncMore practical questionsChallenges of request duplicationThen….Raises a questionGeneralized questionSlide 44CS514: Intermediate Course in Operating SystemsProfessor Ken BirmanVivek Vishnumurthy: TAReal-world time-critical systemsThe challenge:Suppose I need to build a rapidly responsive systemI want to handle large scaleI plan to use a modular architectureCan this be done in a web services setting?A “system of systems”We use the term “system of systems” or SoS to capture this conceptExamples will help clarify the ideaBasic structure:FrontEndBack endBack endBack endA “system of systems”Or might interconnect systems at different data centers to give a reasonably integrated “picture”FrontEndBack endBack endBack endFrontEndBack endBack endBack endExamples: AmazonAmazon would often use the front end to build a web page for a userThe back-end systems fill in contentProduct popularityCurrent inventoryGreat deals on related productsProducts other people who did a similar search ultimately purchased…Why is this “time critical”?Amazon is graded by quick accurate responseGood grade: You buy the bookBad grade: You use Google and shop elsewhereFor Amazon’s line of business, this SoS configuration is as critical as it gets!AkamaiCorporate site controls a large number of satellite systemsGoal: Move content to be close to users who are likely to access that contentTime critical aspect: Akamai is paid by hosts seeking to ensure snappy load times for their web sitesMilitary exampleTeam comes under fire, calls for helpCommander needs to knowWhat resources are available?What’s the terrainWhere have enemy forces been seen?Is there an evacuation option?… and needs a fast responseAir Traffic Control ExampleNew radar ping detectedTrack formation system should fit this to existing tracks (or create a new one)Flight plan lookup should check for known aircraft that might match this trackWarnings system should check for proximity rulesLong term planner should schedule a landing slotAir Traffic Control ExampleAlso see issues from controller to controllerWhen A hands off to B need to ensure continuous coverageAnd when centers talk to each otherFrance has 5 ATC centers… Europe has hundreds…Issues? Let’s focus on scalingScalability allows us to handle more load and also provides fault-toleranceEach service becomes a replicated group of servers that cooperateThey replicate data by multicasting updatesAnd the reads are load-balancedIssues are specific to time-criticality?Castor 4/07TempestStart with a standard web services applicationPerhaps, builds web pages for air traffic controllerWS front-endServicesServicesServicesCastor 4/07TempestWe’ll scale it out by replicating the components… and automate management, repair, adaptation even when faults occurWS front-endServicesServicesServicesWS front-endWS front-endWS front-endWS front-endServicesServicesServicesServicesServicesServicesServicesServicesServicesCastor 4/07WS front-endServicesServicesServicesWS front-endWS front-endWS front-endWS front-endServicesServicesServicesServicesServicesServicesServicesServicesServicesTempestThen interconnect data centersWS front-endServicesServicesServicesWS front-endWS front-endWS front-endWS front-endServicesServicesServicesServicesServicesServicesServicesServicesServicesWS front-endServicesServicesServicesWS front-endWS front-endWS front-endWS front-endServicesServicesServicesServicesServicesServicesServicesServicesServicesHow to solve such problems?Tools in our toolkitUDP multicast – very fast, unreliableRON – routes around problems, unreliableBitTorrent – receivers cooperate to offload work from the senderVirtual synchrony – strong consistencyQuorums – even stronger (but slower)CASD or Ricochet: real-time multicastToo many choices!Need to askHow strong does the consistency property need to be for the application of interest?How harsh is the runtime environment?How critical is timing?Is the system “safe” if the primitive is unreliable?How would Amazon answer?To guarantee fast response, they bought lots of hardware… now they damn well expect speedups!Selling a book that is actually out of stock isn’t a disasterFast matters more than “real time” of the provable, conservative kindBest technology for Amazon?Probably something like Ricochet would work best for themGets the update through FASTUses pro-active FEC to recover from likely patterns of lossBackground gossip mechanism repairs any losses not caught by FECHow might inconsistency “look” to users?Consistency in TempestRecall that transactional services offer strong data consistency modeleach read operation returns the result of the latest writeTempest implements a weaker model called sequential consistencyevery replica sees the operations on the same data item in the same orderorder may be different than the order updates were issuedTempest CollectionsPersistent service state = collection of objectsEach object (obj) is naturally represented by the tuple 〈 Histobj, Pendingobj〈Hist is the state of the objectcurrent value or list of updatesPending is the set of updates that cannot be applied yetapplied when ordering consistent acrossA Tempest ServiceA = sell(“IBM”, 108)B = sell(“IBM”, 163)C = buy(“IBM”, 32)Hist = Pending ={ F = sell(“IBM”, 81)E = sell(“IBM”, 76) }TempestCollectionReplica 1A = sell(“IBM”, 108)B = sell(“IBM”, 163)Hist = Pending ={ C = buy(“IBM”, 32)D = buy(“IBM”, 53)E = sell(“IBM”, 76) } TempestCollectionReplica 2A =


View Full Document

CORNELL CS 514 - Lecture Notes

Documents in this Course
LECTURE

LECTURE

29 pages

LECTURE

LECTURE

28 pages

Load more
Download Lecture Notes
Our administrator received your request to download this document. We will send you the file to your email shortly.
Loading Unlocking...
Login

Join to view Lecture Notes and access 3M+ class-specific study document.

or
We will never post anything without your permission.
Don't have an account?
Sign Up

Join to view Lecture Notes 2 2 and access 3M+ class-specific study document.

or

By creating an account you agree to our Privacy Policy and Terms Of Use

Already a member?