Ashley Heisner BMGT301 Final Study Guide Big Data What is Big Data Companies generate and store Big Data Over 235 Terabytes of data in each More data in each company than the US Library of Congress A Corporate Asset Business Benefits Increased revenue Avoided costs Retaining profitable customers o Improved customer service Avoiding unprofitable customers Keep improve profitable activities Stop correct unprofitable activities Business Intelligence Systems Use data created by other systems To provide reporting and analysis for organizational decision making o Collect data and information o Discern patterns and meaning in the information o Respond and act on the resultant information Data Rich Information Poor 2 real life examples High amounts of data o Transactional systems Every business interaction o Operational systems Every business processes o Ex census data forecast weather map Independent unrelated databases Now Social Media GIGO Get In Get Out The ideal All data can be related any system can use Transactional operational external observational With Business Intelligence Time IS Money Data Mining analyze data to extract information Find hidden patterns Information is NOT found in raw data alone Tools o Statistics o Intelligent agents o Queries o Reports o Multi dimensional analysis Cluster Analysis technique used to divide information set into mutually exclusive groups so that the members of each group are as close together as possible to one another and the different groups are as far apart as possible Association Detection patterns in dependencies Market Basket Analysis determining which products customers buy together and how an organization an use this to cross sell more products or services Predicts future behavior by identifying affinities among customers choices of products and services Statistical Analysis performs such functions as information correlations distributions calculations and variance analysis Beer sales don t necessarily rise in the warm weather Cloud coverage wind and temperature affect lotion sales most Healthy snack sales can increase during both good and bad weather depending on location In Winter and Summer NY sales rise in above average temp and below average cloud coverage Bottled Water is affected depending on the season humidity can matter for sales more than temperature Affinity Grouping group formed around a shared interest or common goal Forecasting process of making statements about events whose actual outcomes has not been observed Estimation determine values for an unknown continuous variable behavior or estimated future value Time series of Information sequence of data points measured at successive points in a time spaced at uniform time intervals Moore s Law Volatile Memory wiped clean when power is cut off from a device Random Access Memory RAM very fast chip based volatile storage in a computing device Non volatile Memory storage in a computing device that retains data even when powered down Ex flash memory hard disk read write CD s and DVD s Read Only Memory ROM memory that is non volatile and can only be accessed read Cannot be added to or updated o Ex Read only CD s Read only DVD s Movie Film Burned memory chips Microprocessor Central Processing Unit CPU responsible for performing all of the operations of the computer Brain of the computing device Arithmetic Logic Unit ALU perform math and logical operations Control Unit fetch program instructions decode instructions retrieve data store results Grid Computing collection of many computers or microprocessors that uses special software which allows them to work together to reach a common goal Supercomputers computers that are among the fastest of any in the world at the time of their introduction Massively Parallel many microprocessors working together to solve problems Ex IBM s Watson won on Jeopardy Utility Computing on demand computer rented from external provider Paid on as needed basis Cloud Computing instead of running apps yourself they run on a shared data center Costs less and fast to get started Just log in customize the app and use it Dr Gordon Moore of Intel in 1970s Hypothesis computer processing performance would double every 18 Moore s Law months Moore s Law Constraints Manufacturing Heat and power Reliability Size of chip and speed of light Lengths widths and complexities of chip pathways Multicore microprocessors now mainstream More powerful chip Most new PCs and laptops sold have at least a two core dual core processor Can run older software written for single brain chips Will use only one core at a time Moore s Law for Storage Faster and cheaper hard drives Not directly part of Moore s Law Ex of groups that benefit o Amazon o Google o Netflix o Facebook E Waste old and obsolete electronics and computers Hard to dispose of because process of separating out the densely packed materials is extremely labor intensive and expensive 3 interrelated forces that slow advancement of Moore s Law size heat and power Disruptive Technology Why disruption Information technology is everywhere and in almost everything Combinations of technologies could multiply impact The nature of work will change and millions of people will require new skills 4 Characteristics of a Disruptive Technology 1 Technology is rapidly advancing or experiencing breakthroughs 2 The potential scope of impact is broad 3 Significant economic value could be affected 4 Economic impact is potentially disruptive Disruptive Trend 5 million vs 400 cost of computer in 1975 vs an iPhone 4 with equal performance Potential Disruptors Mobile Internet increasingly inexpensive and capable mobile computing devices and internet connectivity Automation of knowledge work intelligent software systems that can perform knowledge work tasks involving unstructured commands and subtle judgments The Internet of Things networks of low cost sensors and actuators for data collection monitoring decision making and process optimization o Links machinery equipment and other physical assets with networked sensors and actuators o Capture data and manage performance o Enable machines to collaborate and even act on new information independently o Application of Internet of Things Remote monitoring of assets systems and people Improving preventive maintenance and performance management using real time data Optimizing performance of complex systems including through closed loop decision making Providing Quantified Self applications for people to
View Full Document