MIT 6 854J - Problem Set #2 - D1712988

Home> Schools> Massachusetts Institute of Technology> Electrical Engineering and Computer Science (6) > 6 854J> Problem Set #2

MIT 6 854J - Problem Set #2

School name Massachusetts Institute of Technology

Course 6 854j- Advanced Algorithms

Pages 2

Download Save

Unformatted text preview:

Massachusetts Institute of Technology Handout 3 6.854J/18.415J: Advanced Algorithms Wednesday, September 14, 2005 David Karger Problem Set 2 Due: Wednesday, September 21, 2005. Notice that one problem is marked noncollaborative. As you might expect, this prob-lem should be done without any collaboration. Problem 1. Describe a data structure that represents an ordered list of elements under the following three types of operations: access(k): Return the kth element of the list (in its current order). insert(k, x): Insert x (a new element) after the k th element in the current version of the list. reverse(i, j) Reverse the order of the ith through jth elements. For example, if the initial list is [a, b, c, d, e], then access(2) returns b. After reverse(2,4), the represented list becomes [a, d, c, b, e], and then access(2) returns d. Each operation should run in O(log n) amortized time, where n is the (current) number of elements in the list. The list starts out empty. Hint: First consider how to implement access and insert using splay trees. Then think about a special case of reverse in which the [i, j] range is represented by a whole subtree. Use these ideas to solve the real problem. Remember, if you store extra information in the tree, you must state how this information can be maintained under various restructuring operations. This data structure is useful in eﬃciently implementing the Lin Kernighan heuristic for the travelling salesman problem. Problem 2. Given the theorem about access time in splay trees, it is tempting to con-jecture that splaying does not create trees in which it would take a long time to ﬁnd an item. Show that this conjecture is false by showing that for large enough n, it is possible to restructure any binary tree on n nodes into any other binary tree on n nodes by a sequence of splay operations (implying that there is some access sequence that turns a tree into a path). Problem 3. Let S be a search data structure that performs insert, delete and search in O(log n) time, where n is the number of elements stored. An empty data structure S can be created in O(1) time. We will construct a static data structure with n elements that is worst-case optimal in total access time, given the number of times an element is accessed in an access sequence.� 2 Handout 3: Problem Set 2 The data structure is constructed as follows. Search data structure Sk holds the 22k most frequently occurring items in the access sequence. A search on v is done on S0, S1, . . . until an Si holding v is encountered. Notice that all elements in Si are held in Si+1. (a) Show that the above data structure is asymptotically comparable to the optimal static tree in terms of the total time to process the access sequence. Recall from class that the statically optimal data structure achieves average access time O(− pi log pi) where pi is the fraction of accesses to item i. (b) Make the data structure capable of insert operations. Assume that the number of searches to b e done on v is provided when v is inserted. The cost of inse rt should be O(log n) amortized time, and total cost of searches should still be worst case optimal (non-amortized). (c) Improve your solution to work even if the frequency of acce ss is not given during the insert. Your data structure now satisﬁes the same static optimality theorem as splay trees. (d) Optional. Make your data structure satisfy the working set theorem on splay trees. Ignore the static optimality condition. Problem 4. Worked example. (a) Build an uncompressed suﬃx trie for “banana$”. Show the structure and node traversal path for e ach suﬃx insertion. Mark the suﬃx links that are actually used as shortcuts in the eﬃcient construction algorithm. (b) Draw the compressed suﬃx tree f or “banana$”. NONCOLLABORATIVE Problem 5. In this problem, we will see how to construct a suﬃx tree on multiple texts, and what some useful properties of such a suﬃx tree are. Suppose you are given n texts T1, T2, . . . Tn. (a) Suppose you build a common suﬃx tree of all the texts T1, T2, . . . Tn, i.e., a trie that contains all the suﬃxes of all the n texts. Argue that you can do this in time O(|T1+ T2+ . . . Tn). Be careful not to produce suﬃxes that cross from one | | | | |text to another (which would happen if you simply concatenated all the texts). (b) Suppose that we add a diﬀerent unique terminating symbol $i to each of the texts Ti. Consider a node N in the common suﬃx tree, and let s be the string corresponding to this node (i.e., the string on the path from the root to the node). How can you determine whether the string s is a substring of all the n texts by looking at the subtree rooted at N? (c) Using the above approach, explain how you can ﬁnd the largest common substring of the two texts T1, T2 in time O(|T1+ T2) (there’s a simple generalization to | | |more

View Full Document


School:
Email:
New Password:
Confirm Password:

MIT 6 854J - Problem Set #2

Sign up for free to view:

Please select your school