Unformatted text preview:

This WeekTextbooks for COSC 6340Lectures in COSC 6340Tentative Schedule COSC 6340Tentative Schedule COSC 6340 Part2Exams and Homeworks COSC 6340Why are integrated databases popular?Popular Topics in DatabasesData ModelSchema for the Library Example using the E/R Data ModelRelational Schema for Library Example in SQL/92Referential Integrity in SQL/92Example of an Internal Schema for the Library ExampleExample: Stored DatabaseSlide 15Disks and FilesWhy Not Store Everything in Main Memory?Components of a DiskAccessing a Disk PageReview: The ACID propertiesExampleAtomicity of TransactionsConcurrency in a DBMSExample (Contd.)SummaryChristoph F. EickIntroduction Data ManagementThis Week1. Introduction to Databases2. Course Information 3. Grading and Other Things4. Questionnaire5. The E/R Data ModelChristoph F. EickIntroduction Data ManagementTextbooks for COSC 6340Required Text: Raghu Ramakrishnan and Johannes Gehrke, Data Management Systems, McGraw Hill, Third Edition, 2002 (complication: the chapter numbers in the new edition are different!!)Recommended: Jiawei Han and Micheline Kamber, Data Mining: Concepts and Techniques, Morgan Kaufman Publishers, 2001, ISBN 1-55860-489-8 (4 chapters will be covered)Other books with relevant material: Ramez Elmasri and Shamkant Navathe, Fundamentals of Database Systems, Third Edition Addison Wesley ISBN: 0-8053-1755-4Christoph F. EickIntroduction Data ManagementLectures in COSC 6340I: Basic Database Management Concepts --- Review of basic database concepts, techniques, and languages (9 classes, Chapters 1-5, 8-11, and 16 of the textbook). III: Introduction to KDD and Data Warehousing centering on data warehouses, OLAP, and data mining; moreover, more detailed coverage of querying and mining data streams and database clustering (5 classes; Chapters 1, 2, 6, and 7 of the Han/Kamber book & chapter 25 of our textbook and additional material)III: Relational Database Design (2 classes, chapters 19)IV: Implementation of Relational Operators, Query Optimization, and Physical Database Design (Chapters 12+14+20, 3-4 classes)V: Internet Databases and XML (1 class, chapter 27 of the textbook)VI: Other: discussion of home works and exams, student presentations; discussion of course projects (3 classes)Christoph F. EickIntroduction Data ManagementTentative Schedule COSC 6340Jan. 18: Introduction to COSC 6340 Fast Review of Undergraduate Material (Jan. 20-Feb. 15)Jan. 20: Entity-Relationship Data Model  more detailed than textbookJan. 25: Entity-Relationship Data ModelJan. 27: Relational Data Model  Feb. 1: Mapping E/R Diagrams to Relations Feb. 8/10/15: Index & storage structures, B+-trees, and hashing, PDBDFeb. 15/17: Relational Algebra Feb. 22: Writing SQL Queries (somewhat short)Feb. 24: Leftovers / Review March 1: Exam0 (Undergraduate Review Exam)Christoph F. EickIntroduction Data ManagementTentative Schedule COSC 6340 Part2II: KDD and Data Warehousing (approx. 5.5 classes)March 3: Introduction to KDDMarch 8: Similarity Assessment March 10: Clustering March 22: Association Rule MiningMarch 29: Spatial DatabasesMarch 31: Data Warehouses and OLAP April 8 (30 minutes): Spatial Data Mining April 5+7+[8]: III: Relational Database Design (2.5 classes)April 14: Midterm Exam (30 minutes review on April 12, 2005)April 19+26+[28]: Student PresentationsIV: Physical Database Design and Query Optimization (3 classes)April 8(makeup): Implementation of Relational Operators April 8(makeup)/12: Introduction to Query Optimization April 12/19: Physical Database Design IIV: Internet Databases (1 class)April 21: Introduction to XML and Semi-Structured Data April 28: Course Summary + Teaching Evaluation + LeftoversChristoph F. EickIntroduction Data ManagementExams and Homeworks COSC 6340Tu., March 1: Undergraduate Material Review ExamTh., April 14: Midterm Exam3 graded homeworks: deadlines: Feb. 17, April 11, April 28Ungraded HomeworksFinal Exam: Tu/TH., May 10, 11a-1:30pQualifying Exam Part2: Fr., May 13, 10-11:30aChristoph F. EickIntroduction Data ManagementWhy are integrated databases popular?Avoidance of uncontrolled redundancyMaking knowledge accessible that would otherwise not be accessibleStandardization --- uniform representation of data facilitating import and exportReduction of software development (though the availability of data management systems)IntegratedDatabaseBookkeepingDeviceCar SalesmanChristoph F. EickIntroduction Data ManagementPopular Topics in DatabasesEfficient algorithms for data collections that reside on disks (or which are distributed over multiple disk drives, multiple computers or over the internet).Study of data models (knowledge representation, mappings, theoretical properties)Algorithms to run a large number of transactions on a database in parallel; finding efficient implementation for queries that access large databases; database backup and recovery,…Database designHow to use database management systems as an application programmer / end user.How to use database management systems as database administratorHow to implement database management systemsData summarization, knowledge discovery, and data miningSpecial purpose databases (genomic, geographical, internet,…)Christoph F. EickIntroduction Data ManagementData ModelData ModelSchema (definesa set of databasestates)Current Database Stateis used to defineChristoph F. EickIntroduction Data ManagementSchema for the Library Example using the E/R Data ModelMany-to-Many1-to-1 1-to Many Many-to-1titleauthorB#whenphonenamessnCheck_out PersonBook(0,35) (0,1)Christoph F. EickIntroduction Data ManagementRelational Schema for Library Example in SQL/92CREATE TABLE Book (B# INTEGER, title CHAR(30), author CHAR(20), PRIMARY KEY (B#));CREATE TABLE Person (ssn CHAR(9), name CHAR(30), phone INTEGER, PRIMARY KEY (ssn));CREATE TABLE Checkout( book INTEGER, person CHAR(9), since DATE, PRIMARY KEY (book), FOREIGN KEY (book) REFERENCES Book, FOREIGN KEY (person) REFERENCES Person));Christoph F. EickIntroduction Data ManagementReferential Integrity in SQL/92SQL/92 supports all 4 options on deletes and updates.Default is NO ACTION (delete/update is rejected)CASCADE (also delete all tuples that refer to deleted tuple)SET NULL / SET DEFAULT (sets foreign key value of referencing tuple)CREATE TABLE


View Full Document
Download COSC 6340 INTRODUCTION
Our administrator received your request to download this document. We will send you the file to your email shortly.
Loading Unlocking...
Login

Join to view COSC 6340 INTRODUCTION and access 3M+ class-specific study document.

or
We will never post anything without your permission.
Don't have an account?
Sign Up

Join to view COSC 6340 INTRODUCTION 2 2 and access 3M+ class-specific study document.

or

By creating an account you agree to our Privacy Policy and Terms Of Use

Already a member?