Unformatted text preview:

MASSACHUSETTS INSTITUTE OF TECHNOLOGY Sloan School of Management 15.565 – INTEGRATING INFORMATION SYSTEMS: TECHNOLOGY, STRATEGY, AND ORGANIZATIONAL FACTORS 15.578 – GLOBAL INFORMATION SYSTEMS: COMMUNICATIONS & CONNECTIVITY AMONG INFORMATION SYSTEMS Spring 2002 Homework Assignment 4 Due – Lecture #21 Question 1. The excerpt below represents data obtained from http://quote.yahoo.com when ticker symbol INTC is entered (output is slightly reformated). INTEL CORPORATION Symbol INTC Last trade 30.06 4:00 PM Change -0.04 -0.13% Volume 34,154,200 a) In class we have referred to such web sites as “semi-structured”. Explain what that means. b) In considering this web site as a database returning database records, what are the 7 field values that are returned by that web page? Give names to each of the fields and briefly (informally) explain the meaning of each (as best you understand them). c) For each of the 7 fields, explain, in your own terms, what context knowledge is needed and indicate what you believe to be the context definitions for each of the 7 fields in this particular example. Clearly state any assumptions that you feel are appropriate. - 1 -Question 2. This question addresses the issue of schema integration. Assume that the schema below are used by the Sloan Student database, the Sloan Alumni database, and the Career Development Office (CDO) Recruit database. We want to produce an integrated schema. Perform a step-by-step process, similar to Lecture 16. Show the results of each step: (1) identify equivalent entities and attributes, (2) convert attributes to entities, and (3) integrate the two schema. Briefly explain the reasons, justifications, and assumptions behind each step. Sloan Students Database name year (1,2) Subject #Term Type (home, Stree Student Subject Previous Degrees school, summer) Degree (BS, MS, PhD) Year (e.g., 19XX) College Address t City CDO Recruit Database NameCompany Addresssds Interview Student Date Position Name Sloan Alumni Database NameCorporation Alumni MIT Degree Name Title Address street Address city Type (BS, MS, PhD) Graduated (e.g., 19XX) - 2 -Question 3. For this question, let us consider the distributed homogenous SQL database depicted below. SQL SQL SQL SQL HP UNIX IBM 390 Sun Solaris IBM NT POS Code Positio n Company Dat e Nam e Company POS Cod e DATABASES POSITIONTB ALUMNIT COMPANYTB B Let us consider the SQL query to print the names of all alumni that are “vice presidents” of companies that are interviewing on “December 14”. The query might be: select a.Name from alumnitb a, positiontb p, companytb c where p.Position = “vice president” and p.Poscode = a.Poscode and c.Date = “December 14” and c.Company = a.Company a) Explain what “retrieval transparency” means in the context of the above situation. b) Explain what “performance transparency” means in the context of the above situation. c) Describe at least two different ways to accomplish “performance transparency” for the above situation (i.e., two different sequences for processing the query.) Make maximum use of concurrency and explain which steps can be performed concurrently. d) Of the two approaches described above, which one is better? Explain your reasoning and any assumptions made (or information that you would need.) If helpful, make hypothetical assumptions about table sizes, etc. - 3


View Full Document

MIT 15 565J - Study Guide

Download Study Guide
Our administrator received your request to download this document. We will send you the file to your email shortly.
Loading Unlocking...
Login

Join to view Study Guide and access 3M+ class-specific study document.

or
We will never post anything without your permission.
Don't have an account?
Sign Up

Join to view Study Guide 2 2 and access 3M+ class-specific study document.

or

By creating an account you agree to our Privacy Policy and Terms Of Use

Already a member?