UCLA EE 202A - A Hierarchical CPU Scheduler - D2358658

Home> Schools> University of California, Los Angeles> (EE) > EE 202A> A Hierarchical CPU Scheduler

DOC PREVIEW

UCLA EE 202A - A Hierarchical CPU Scheduler

School name University of California, Los Angeles

Course Ee 202a-

Pages 24

This preview shows page 1-2-23-24 out of 24 pages.

Save

View full document

Premium Document

Do you want full access? Go Premium and unlock all 24 pages.

Access to all documents

Download any document

Ad free experience

Subscribe for instant access Get instant access

View full document

Premium Document

Do you want full access? Go Premium and unlock all 24 pages.

Access to all documents

Download any document

Ad free experience

Subscribe for instant access Get instant access

View full document

Premium Document

Do you want full access? Go Premium and unlock all 24 pages.

Access to all documents

Download any document

Ad free experience

Subscribe for instant access Get instant access

View full document

Premium Document

Do you want full access? Go Premium and unlock all 24 pages.

Access to all documents

Download any document

Ad free experience

Subscribe for instant access Get instant access

Premium Document

Do you want full access? Go Premium and unlock all 24 pages.

Access to all documents

Download any document

Ad free experience

Subscribe for instant access Get instant access

Unformatted text preview:

A Hierarchical CPU Scheduler for Multimedia Operating SystemsMotivationFrameworkFramework ExampleRequirements of Scheduling at Non-leaf NodesStart-time Fair Queuing (SFQ) NotationSFQ AlgorithmSFQ Algorithm contd.SFQ Scheduling ExampleSFQ PropertiesSFQ Properties contd. -Bounds on throughput and delaySlide 12ImplementationImplementation contd.Slide 15Experimental ResultsExperimental Results contd.Slide 18Slide 19Slide 20Related WorkRelated Work contd.ConclusionReferencesA Hierarchical CPU Scheduler for Multimedia Operating SystemsEE202A, Fall 2001Prof. Mani SrivastavaHanbiao Wang, Arun SomasundaraMotivation• Various application classes (hard real-time, soft real-time, best efforts) in multimedia system need different scheduling algorithms •Hierarchical CPU scheduling supports different scheduling algorithms for different application as well as protects application classes from one anotherFramework•A tree structure, each node has a weight and a scheduler•Each leaf node represents an aggregation of threads, and hence an application class•Each non-leaf node represents an aggregation of application classesFramework ExampleRequirements of Scheduling at Non-leaf Nodes•Fairness among child nodes•No a priori knowledge of time duration a task executes before blocked•Bounds on minimum throughput and maximum delay•Computationally efficientStart-time Fair Queuing (SFQ)Notationrfweight of thread fqjfjth quantum of thread fljflength of qjf in unit of instructionsA(qjf) qjf request time. It’s transition time when f block->runnable; otherwise, it’s time at which f’s previous quantum finishesSfstart tag for thread fFffinish tag for thread fv(t) virtual time at time tSFQ Algorithm•Virtual time v(t)–Initially 0–Start tag of the thread in service, when the CPU is busy at time t–Maximum finish tag assigned to any thread, when the CPU is idle at time tSFQ Algorithm contd.•Start tag Sf–Sf = max{v(A(qjf)), Ff}, stamped to thread f when qjf is requested•Finish tag Ff–Initially 0–Ff = Sf + ljf/rf, incremented when qjf finishes execution•Threads serviced in increasing order of start tags; ties broken arbitrarilySFQ Scheduling ExamplerA:rB = 1:2qA = qB = 10mslA = lB = 10SFQ Properties•Fair allocation of CPU regardless of bandwidth variation. Any threads f, m–|Wf(t1,t2)/rf – Wm(t1,t2)/rm|  lfmax/rf + lmmax/rm–Wx(t1,t2) is the aggregate work done by CPU in interval [t1,t2] for thread x, lxmax is the maximum length of quantum for which thread x is scheduled•No need of a priori knowledge of the quantum length since scheduling relies on increasing order of start tagsSFQ Properties contd.-Bounds on throughput and delay•Interrupted CPU modeled as Fluctuation Constrained server (FC) with average rate C (instructions/sec) and burstiness (C) (instructions), effective bandwidth:–W(t1, t2)  C * (t2 - t1) - (C) •Throughput of thread f (rf)is also FC:–(rf, rf * (n{lnmax} + (C) ) / C + lfmax) •Execution Complete Time of qjf:–L(qjf)  EAT(qjf)+(nf{lnmax}+ljf+ (C))/C–where EAT(qjf) = max{A(qjf),EAT(qj-1f)+lj-1f/rf}SFQ Properties contd.-Bounds on throughput and delay•Interrupted CPU modeled as Exponentially Bounded Fluctuation (EBF) server with (C, B, , (C)), effective bandwidth distribution:–P{W(t1, t2) < C * (t2 - t1) - (C) - }  Be- •Throughput of thread f (rf)is also EBF:–(rf, B, rf *  /C, rf * ( n{lnmax} + (C)) / C + lfmax)•Execution Complete Time of qjf:–P{L(qjf)  EAT(qjf)+(nf{lnmax}+ljf+(C)+ )/C}  1 - Be- Implementation•SFQ Scheduling is used for intermediate nodes (non-leaf)•Leaf nodes can use any algorithm•Each node–Weight, start tag, finish tag•Each non-leaf node–list of child nodes–list of runnable child nodes sorted by start tags–virtual time of the child node having the minimum start tag•Each leaf node–pointer to a function (Scheduling for thread).Implementation contd.•some functions–hsfq_mknod, hsfq_parse, hsfq_move, hsfq_rmnod (only if no child nodes), hsfq_admin, hsfq_schedule, hsfq_update (start and finish tags of all ancestors updated), hsfq_setrun, hsfq_sleep•Priority inversion–Not done between classes–Only within the same leaf•if rate monotonic, Priority inheritance•if SFQ: transferring weights of blocked to the blocking thread.Implementation contd.Experimental Results•Framework–Sun SPARCstation 10 with 32MB RAM.–Multiuser mode–all system processes running–Dhrystone benchmark•Scheduling structureExperimental Results contd.•Achieves predictable resource allocation•Throughputs of 5 threads running both schemes..Experimental Results contd.•Scheduling overhead–impact of more threads–impact of depthExperimental Results contd.•Achieves fair allocation•nodes isolated from each other and throughput depends only on weight.Experimental Results contd.•Fair allocation when bandwidth is dynamically changedRelated Work•Weighted Fair Queuing (WFQ)–not fair when bandwidth fluctuates–Requires length of quantums to be known a priori–Computation of round number expensive•Fair Queuing based on Start-time (FQS)–modification of WFQ, when quantums not known–increasing order of start tags–computationally expensive–does not provide fairness when bandwidth fluctuatesRelated Work contd.•Self Clocked Fair Queuing (SCFQ)–Approximates round number with finish tag–increasing order of finish tags–larger delay guarantee than SFQ•Lottery Scheduling–Fairness only over large time intervals–Supports hierarchical partitioning but•only one scheduling algorithm•computational overhead.Conclusion•Enables co-existence of heterogeneous schedulers•Protects application classes from each other•Computationally efficientReferences•P. Goyal, X. Guo, and H. M. Vin. A Hierarchical CPU Scheduler for Multimedia Operating Systems. Operating Systems Review, vol.30, spec. issue., (Second USENIX Symposium on Operating Systems Design and Implementation (OSDI), Seattle, WA, USA, 28-31 Oct. 1996.) ACM, 1996. p.107-21.•P. Goyal, H. M. Vin, and H. Cheng. Start-time Fair Queuing: A Scheduling Algorithm for Integrated Services Packet Switching Networks. Proceedings of ACM SIGCOMM’96, pages 157-168, August

View Full Document