This WeekTextbooks for COSC 6340Lectures in COSC 6340Tentative Schedule COSC 6340Tentative Schedule COSC 6340 Part2Exams and Homeworks COSC 6340Why are integrated databases popular?Popular Topics in DatabasesData ModelSchema for the Library Example using the E/R Data ModelRelational Schema for Library Example in SQL/92Referential Integrity in SQL/92Example of an Internal Schema for the Library ExampleExample: Stored DatabaseSlide 15Disks and FilesWhy Not Store Everything in Main Memory?Components of a DiskAccessing a Disk PageReview: The ACID propertiesExampleAtomicity of TransactionsConcurrency in a DBMSExample (Contd.)SummaryChristoph F. EickIntroduction Data ManagementThis Week1. Introduction to Databases2. Course Information 3. Grading and Other Things4. Questionnaire5. The E/R Data ModelChristoph F. EickIntroduction Data ManagementTextbooks for COSC 6340Required Text: Raghu Ramakrishnan and Johannes Gehrke, Data Management Systems, McGraw Hill, Third Edition, 2002 (complication: the chapter numbers in the new edition are different!!)Recommended: Jiawei Han and Micheline Kamber, Data Mining: Concepts and Techniques, Morgan Kaufman Publishers, 2001, ISBN 1-55860-489-8 (4 chapters will be covered)Other books with relevant material: Ramez Elmasri and Shamkant Navathe, Fundamentals of Database Systems, Third Edition Addison Wesley ISBN: 0-8053-1755-4Christoph F. EickIntroduction Data ManagementLectures in COSC 6340I: Basic Database Management Concepts --- Review of basic database concepts, techniques, and languages (9 classes, Chapters 1-5, 8-11, and 16 of the textbook). III: Introduction to KDD and Data Warehousing centering on data warehouses, OLAP, and data mining; moreover, more detailed coverage of querying and mining data streams and database clustering (5 classes; Chapters 1, 2, 6, and 7 of the Han/Kamber book & chapter 25 of our textbook and additional material)III: Relational Database Design (2 classes, chapters 19)IV: Implementation of Relational Operators, Query Optimization, and Physical Database Design (Chapters 12+14+20, 3-4 classes)V: Internet Databases and XML (1 class, chapter 27 of the textbook)VI: Other: discussion of home works and exams, student presentations; discussion of course projects (3 classes)Christoph F. EickIntroduction Data ManagementTentative Schedule COSC 6340Jan. 18: Introduction to COSC 6340 Fast Review of Undergraduate Material (Jan. 20-Feb. 15)Jan. 20: Entity-Relationship Data Model more detailed than textbookJan. 25: Entity-Relationship Data ModelJan. 27: Relational Data Model Feb. 1: Mapping E/R Diagrams to Relations Feb. 8/10/15: Index & storage structures, B+-trees, and hashing, PDBDFeb. 15/17: Relational Algebra Feb. 22: Writing SQL Queries (somewhat short)Feb. 24: Leftovers / Review March 1: Exam0 (Undergraduate Review Exam)Christoph F. EickIntroduction Data ManagementTentative Schedule COSC 6340 Part2II: KDD and Data Warehousing (approx. 5.5 classes)March 3: Introduction to KDDMarch 8: Similarity Assessment March 10: Clustering March 22: Association Rule MiningMarch 29: Spatial DatabasesMarch 31: Data Warehouses and OLAP April 8 (30 minutes): Spatial Data Mining April 5+7+[8]: III: Relational Database Design (2.5 classes)April 14: Midterm Exam (30 minutes review on April 12, 2005)April 19+26+[28]: Student PresentationsIV: Physical Database Design and Query Optimization (3 classes)April 8(makeup): Implementation of Relational Operators April 8(makeup)/12: Introduction to Query Optimization April 12/19: Physical Database Design IIV: Internet Databases (1 class)April 21: Introduction to XML and Semi-Structured Data April 28: Course Summary + Teaching Evaluation + LeftoversChristoph F. EickIntroduction Data ManagementExams and Homeworks COSC 6340Tu., March 1: Undergraduate Material Review ExamTh., April 14: Midterm Exam3 graded homeworks: deadlines: Feb. 17, April 11, April 28Ungraded HomeworksFinal Exam: Tu/TH., May 10, 11a-1:30pQualifying Exam Part2: Fr., May 13, 10-11:30aChristoph F. EickIntroduction Data ManagementWhy are integrated databases popular?Avoidance of uncontrolled redundancyMaking knowledge accessible that would otherwise not be accessibleStandardization --- uniform representation of data facilitating import and exportReduction of software development (though the availability of data management systems)IntegratedDatabaseBookkeepingDeviceCar SalesmanChristoph F. EickIntroduction Data ManagementPopular Topics in DatabasesEfficient algorithms for data collections that reside on disks (or which are distributed over multiple disk drives, multiple computers or over the internet).Study of data models (knowledge representation, mappings, theoretical properties)Algorithms to run a large number of transactions on a database in parallel; finding efficient implementation for queries that access large databases; database backup and recovery,…Database designHow to use database management systems as an application programmer / end user.How to use database management systems as database administratorHow to implement database management systemsData summarization, knowledge discovery, and data miningSpecial purpose databases (genomic, geographical, internet,…)Christoph F. EickIntroduction Data ManagementData ModelData ModelSchema (definesa set of databasestates)Current Database Stateis used to defineChristoph F. EickIntroduction Data ManagementSchema for the Library Example using the E/R Data ModelMany-to-Many1-to-1 1-to Many Many-to-1titleauthorB#whenphonenamessnCheck_out PersonBook(0,35) (0,1)Christoph F. EickIntroduction Data ManagementRelational Schema for Library Example in SQL/92CREATE TABLE Book (B# INTEGER, title CHAR(30), author CHAR(20), PRIMARY KEY (B#));CREATE TABLE Person (ssn CHAR(9), name CHAR(30), phone INTEGER, PRIMARY KEY (ssn));CREATE TABLE Checkout( book INTEGER, person CHAR(9), since DATE, PRIMARY KEY (book), FOREIGN KEY (book) REFERENCES Book, FOREIGN KEY (person) REFERENCES Person));Christoph F. EickIntroduction Data ManagementReferential Integrity in SQL/92SQL/92 supports all 4 options on deletes and updates.Default is NO ACTION (delete/update is rejected)CASCADE (also delete all tuples that refer to deleted tuple)SET NULL / SET DEFAULT (sets foreign key value of referencing tuple)CREATE TABLE
View Full Document