DOC PREVIEW
U of I CS 525 - Characteristic Studies of Distributed Systems

This preview shows page 1-2-17-18-19-36-37 out of 37 pages.

Save
View full document
View full document
Premium Document
Do you want full access? Go Premium and unlock all 37 pages.
Access to all documents
Download any document
Ad free experience
View full document
Premium Document
Do you want full access? Go Premium and unlock all 37 pages.
Access to all documents
Download any document
Ad free experience
View full document
Premium Document
Do you want full access? Go Premium and unlock all 37 pages.
Access to all documents
Download any document
Ad free experience
View full document
Premium Document
Do you want full access? Go Premium and unlock all 37 pages.
Access to all documents
Download any document
Ad free experience
View full document
Premium Document
Do you want full access? Go Premium and unlock all 37 pages.
Access to all documents
Download any document
Ad free experience
View full document
Premium Document
Do you want full access? Go Premium and unlock all 37 pages.
Access to all documents
Download any document
Ad free experience
View full document
Premium Document
Do you want full access? Go Premium and unlock all 37 pages.
Access to all documents
Download any document
Ad free experience
Premium Document
Do you want full access? Go Premium and unlock all 37 pages.
Access to all documents
Download any document
Ad free experience

Unformatted text preview:

Slide 1Distributed System ImplementationsSlide 3Modeling of P2P SystemsObserving P2PData LogKazaa ObjectsUsers are PatientUsers Slow DownUsers Slow DownClient ActivityWorkloadObject DynamicsZipfNon-ZipfNon-Zipf modelReally Non-Zipf?Locality AwarenessDiscussion PointsSlide 20Cluster ComputingAmazon AWSAn Example Application- GrepTheWebAn Example ApplicationAWS InterfaceAWS SecurityS3 Evaluation-Test ScenariosAverage Daily Read ThroughputCDF of Read and Write Throughput March 22-April8ResultsQuery VarianceResultsSlide 33Other ExperimentsExperience with EC2 and SQSConclusionDiscussion PointsCharacteristic Studiesof Distributed SystemsM a ryam R a hm a n i h e r i s & D a n ie l U hl i g1Distributed System ImplementationsCharacteristics of systems in real worldHow users and systems behaveP2P Track clients and traffic on KazaaExplore and model usage Cloud Computing Experiment with Amazon Web ServicesMeasure performance metrics2Krishna P. Gummadi, Richard J. Dunn, Stefan Saroiu, Steven D. Gribble, Henry M. Levy, And John ZahorjanMeasurement, Modeling, and Analysis of Peer-to-Peer File-Sharing Work LoadPeer To Peer CharacteristicsPresented by Daniel Uhlig3Modeling of P2P SystemsObserve clients and users behavior in real worldUsed large P2P system: KazaaClosed P2P protocalFasttrack network (still exists?)Peaked at 2.3 million users in 2003 (more than Napster’s peak)Now subscription DRM music store4Observing P2PRecorded all Kazaa traffic at U. of Washington60,000 faculty, staff, and students203 day trace (late spring semester past the end of fall semester)Protected privacy by making data anonymousAnalyze data, develop modelcompared to web: Is there a better comparison?5Data Log•Recorded all Kazaa traffic–Incoming and outgoing data transfer and searches–HTTP traffic with username in header–KazaaLite showed up 60 days into trace•Username hardcoded so used IP to differentiated. •20 terabytes of incoming data•1.6 Million Requests •25,000 users•8.85 TB of unique objects•Paper used requests from university peers to external peers. •Will this bias results?6Kazaa ObjectsFetch at most once 94% of Kazaa files vs 57% of web objects99% of Kazaa files fetch at most twiceClients download objects just once What is an object?Authors assume immutable object that is uniqueSame song Still fetch at most once?- Different filename?- Different bitrate?- Different Length?- Encoded by different user?7Users are Patient8•Web users want instant access, P2P users will wait •P2P users are patient •Small Objects –30%  1 hr –10% nearly a day•Large Objects–50%  1 day–20% wait 1 week•Is this accurate since client automatically restarts requests?Users Slow DownTook 30 day traces of ‘new’ usersBytes requested decrease with agePossible reasons?Loss of interestNew P2P appNew ID9Users Slow DownUsers leave the system foreverUsers request less data as they ageCore clients have constant activity levelBut, less data request 10Client Activity•How to measure activity: –Logged in vs downloading?•Average session length 2.4 minutes–Sessions could be split with a short term break in transfer•Activity over lifetime = 5.5%•Average transfer 17 minutes, but average session 2.4 minutes•Many transactions fail, looking for new host peer11WorkloadLarge (>100 MB) vs Small (<10 MB) filesRequest Volume vs Transfer volumeAudio clips vs Video clips12Object Dynamics13Clients fetch objects at-most-oncePopular objects quickly cycleNew objects are most popularMost requests are for old objectsSmall Objects Large ObjectsTop 10 Top 100Top 10Top 100Overlap between first and last 30 days0 of 10 5 of 1001 of 1044 of 100# of popular objects that a less 30 days old6 of 10 73 of 952 of 9 47 of 56How does the idea of distinct objects affect this?Zipf Distribution where most popular objects are most fetched. Classical result for web pages14Non-Zipf•Authors propose that Kazaa traffic is NOT modeled by Zipf distribution.•P2P differences from web –‘fetch-at-most once’–Immutable object (cnn.com changes regularly, a multimedia file does not)•Simulated model of behaviors and observed 15Non-Zipf model•Zipf seen as model in many places–Video on demand, video rental, movie tickets•Non-Zipf might better explain some of these models•Common characteristics–Birth of new objects–Fetch at most once–Immutable•Characteristics sometimes seen–Expensive to get object–Object remains•Does their non-zipf model explain everything?16Really Non-Zipf?•Multiple copies of the same object?–Does fetch at most once still hold•Requests for established files handled by internal users? (‘cached’)•Are objects immutable?–Changing names–Is new album/song a new object or an update from artist? •Non-Zipf in other multimedia–YouTube, Video Rental, DVD purchase, movie tickets?17Locality Awareness•Conserve University P2P bandwidth•86% of objects requested already at U of W–Cache data (legal issues)–Redirector so request stay internal when possible•Few key nodes can save significant bandwidth18Discussion Points•What is an unique item–Does this affect the distribution of popular objects?•Are objects immutable•Apply ideas to other multimedia: –YouTube video popularity–Still fetch at most once–Non-Zipf for DVD rental or purchase•How to define a unique object•Should P2P handle large and small objects differently•Caching or other forced locality vs P2P built in locality 19P r e s e n t e d b y : Maryam RahmaniherisU n i v e r s i t y o f I l l i n o i s a t U r b a n a - C h a m p a i g nC S 5 2 5 - S p r i n g 2 0 0 9 An Evaluation of Amazon's Grid Computing Services: EC2, S3, SQSSimon L. Garfinkel20Cluster ComputingBuilding your own clusterCostlySpaceCooling systemStaff…UnderutilizationOverutilization Cloud computing Computing as a utilityTime-multiplexing of resourcesNo need fro planning aheadElasticityAmazon’s AWSYou only need a working credit card21Amazon AWSEC2 (Elastic Compute Cloud)Linux virtual machines for 10 cents per CPU hourS3 (Simple Storage Service):Data storage for 15 cents per gigabyte per monthSQS (Simple Queue Service):Messaging service for 10 cents per thousands messages22An Example Application- GrepTheWeb23An Example


View Full Document

U of I CS 525 - Characteristic Studies of Distributed Systems

Documents in this Course
Epidemics

Epidemics

12 pages

LECTURE

LECTURE

7 pages

LECTURE

LECTURE

39 pages

LECTURE

LECTURE

41 pages

P2P Apps

P2P Apps

49 pages

Lecture

Lecture

48 pages

Epidemics

Epidemics

69 pages

GRIFFIN

GRIFFIN

25 pages

Load more
Download Characteristic Studies of Distributed Systems
Our administrator received your request to download this document. We will send you the file to your email shortly.
Loading Unlocking...
Login

Join to view Characteristic Studies of Distributed Systems and access 3M+ class-specific study document.

or
We will never post anything without your permission.
Don't have an account?
Sign Up

Join to view Characteristic Studies of Distributed Systems 2 2 and access 3M+ class-specific study document.

or

By creating an account you agree to our Privacy Policy and Terms Of Use

Already a member?