A History of SupercomputingOutlineDefinitionsNames and PlacesEarly days: ColossusColossusEarly days: EniacEniacOther early supercomputersMainframesMainframes: LARCMainframes: Stretch and HarvestMainframes: CDC 6600CDC 6600 ConsoleMainframes: IBM 360/91IBM 360/91 ConsoleMainframes: CDC 7600Two CDC 7600sSIMD arrays: Illiac IVILLIAC IVSIMD Arrays: PEPE and BSPOne-bit Arrays: STARAN, DAP, MPPOne-bit Arrays: Connection MachineVector pipelining: TI ASCVector Pipelining: CDC Star 100Amdahl’s lawVector Pipelining: Cray-1Cray-1VLIW: FPS, Multiflow, and CydromeShared Memory: Cray Vector SystemsShared Memory: MultithreadingShared Memory: CC-NUMADistributed Memory: Cosmic CubeDistributed Memory: Classic MPPsDistributed Memory: ClustersThe Top 500 ListJapanese Systems: Fujitsu, NEC, HitachiJapanese National ProjectsDARPA Strategic Computing ProgramThe DOE ASCI ProgramWhat Next?Apologies: Companies Not Mentioned1A History of SupercomputingBurton SmithMicrosoft2OutlineDefinitionsEarly days: Colossus and EniacMainframes: Univac LARC to CDC 7600SIMD systemsProcessor arrays: Illiac IV, PEPE, and BSPOne-bit arrays: Staran, DAP, and the Goodyear MPPVector pipelining: TI-ASC, CDC Star-100, and Cray-1 Wide instructions: FPS, Multiflow, and CydromeMIMD systemsShared memory: Cray X-MP to SGI AltixDistributed memory: Cosmic Cube to Blue GeneJapan-US competitionWhat next?3DefinitionsSupercomputer: the world’s fastest computer at the timeSIMD: Single Instruction Stream, Multiple Data StreamOne instruction at a time, each operating on an array of dataMIMD: Multiple Instruction Stream, Multiple Data StreamMultiple processors asynchronously issuing instructionsShared Memory: MIMD computer in which a common memory is accessible by all processorsUMA: Uniform memory access by all processorsNUMA: Non-uniform memory access, based on placementDistributed Memory: MIMD computer in which the memory is partitioned into processor-private regionsRISC: Reduced Instruction Set ComputerUses fast and simple instructions to do complex thingsPipelining: Processing in assembly-line style4Names and PlacesBMD/ATC: Ballistic Missile Defense Advanced Technology Center, Huntsville, ALBRL: Ballistics (now Army} Research Laboratory, Aberdeen, MDDARPA: Defense Advanced Research Projects Agency, VANASA-Ames: Ames Research Center, Mountain View, CANASA-Goddard: Goddard Space Flight Center, Greenbelt, MDLivermore: Lawrence Livermore National Laboratory, Livermore, CALos Alamos: Los Alamos National Laboratory, Los Alamos, NMNSA: National Security Agency, Fort Meade, MDSandia: Sandia National Laboratories, Albuquerque, NM5Early days: ColossusUsed at Bletchley Park, England during WW IIfor cryptanalysis of Lorenz SZ40/42 rotor systems (“Fish”)Only recently declassified by Her Majesty’s GovernmentConcepts by Max Newman et al.Designed and built by Tommy Flowersat the Post Office Research Station, Dollis HillFeatures:Paper tape loop data input at 30 mph = 5000 characters/sec1500 vacuum tubes (“valves”) in the Mark I, 2400 in Mark IIProgrammed by switchesNot a general-purpose computer6Colossus7Early days: EniacBuilt at Penn’s Moore School for BRL, Aberdeen, MDThe motivating application was artillery firing tablesIts first use was evaluating H-bomb feasibilityDesigned and built by J. Presper Eckert and John MauchlyUnveiled at the Moore School on February 15, 1946It had 17,468 vacuum tubes and consumed 174 KWReliability was a concern, so it was left on continuouslyIt was 80 feet long and weighed 30 tonsProgramming was via switches and patch cablesNot a general-purpose computer8Eniac9Other early supercomputersZuse Z3 (1941)The IAS machines (1952)Manchester/Ferranti Mark I (1951)Univac 1 (1951)10MainframesThe 50’s and 60’s saw fast progress in computer technologyMost of it was first used in “mainframe” supercomputersThese systems were used for both business and scienceLater, supercomputers became much more science-orientedExamples of new technology included:Magnetic core memoryTransistor logic circuitsFloating point hardwarePipeliningHigh level languages and compilers11Mainframes: LARCBegun in 1955 for Livermore and delivered in 1960Had dual processors and decimal arithmeticEmployed surface-barrier transistors and core memory12Mainframes: Stretch and Harvest• IBM 7030 (STRETCH)• Delivered to Los Alamos 4/61• Pioneered in both architecture and implementation at IBM• IBM 7950 (HARVEST)• Delivered to NSA 2/62• Was STRETCH + 4 boxes• IBM 7951 Stream unit• IBM 7952 Core storage• IBM 7955 Tape unit• IBM 7959 I/O Exchange13Mainframes: CDC 6600Seymour Cray and Bill Norris left Univac to start CDCCray’s CDC 1604 was the first successful transistor systemHe wanted to build a scientific supercomputerThe 6600, built with silicon transistors, shipped Sept. 1964The chief architects were Seymour Cray and Jim ThorntonThe 6600 had several notable featuresA very simple instruction set, lending itself to more speedNo decimal arithmetic needed or wanted10 arithmetic function units able to process in parallelThis overlapped execution style is now called superscalarIt was coordinated by a famous circuit called the scoreboardParallel I/O processors using a new idea called multithreadingSome say RISC means “Really Invented by Seymour Cray”A futuristic operator’s console was provided at no extra charge14CDC 6600 Console15Mainframes: IBM 360/91IBM’s Tom Watson was angry: "Last week, Control Data ...announced the 6600 system. I understand that in the laboratory...there are only 34 people including the janitor. Of these, 14 are engineers and 4 are programmers. Contrasting this modest effort with our vast development activities, I fail to understand why we have lost our industry leadership position by letting someone else offer the world's most powerful computer." The 360/91 (AKA 370/195) was IBM’s answerIt was delivered to NASA-Goddard in October 1967Killer App: Passenger Airline Reservation Systems (1964)Jointly developed by IBM, American, and United AirlinesOriginally written in IBM assembly (i.e. machine) languageIt was very close to the CDC 7600 in performanceThe 360/91 also had a pretty impressive console…16IBM 360/91 Console17Mainframes: CDC
View Full Document