About Coring and Distancing
(Translation of the German original in c't by Marcel Sieslack)
Hot chips and hot decisions hailed from Palo Alto in California: the press release from HP about the detachment of its PC business and revelations by the developers at the Hot Chips conference.
German renovator at work: Just as "Mr. Turnaround" Eckard Pfeiffer once pulled Compaq out of the loss area and turned it into the largest PC manufacturer with a two-fisted renovation program – together with the acquisition of Tandem and Digital Equipment –, now, twenty years later, his successor Léo Apotheker intends to realign Hewlett-Packard, the company that bought Compaq, for the challenges of the future.
Better leave a possibly sinking PC ship and sink the capital into business software instead – that's probably how one has to interpret the HP boss' decision, who is thinking to buy the English-American software company Autonomy for the steep price of 7 billion British pounds in order to strengthen HP's software-wise position against IBM, Oracle and others.
The unveiled Bulldozer die measures 315 mm² and should have around 1.6 billion transistors. With his estimate of 320 mm², chip architect Hans de Vries made a good early guess one year ago.
Driving Forces
Well, conventional databases and dumb search engines are outdated and Autonomy provides modern software for search and knowledge management that can handle unstructured requests and thousands of different file formats for text, audio and video.
But who will want to buy HP's Personal Systems Group? Samsung, Acer, EMC², Google, Abu Dhabi? Michael Dell, at least, makes fun of HP's decision on Twitter: "HP.... They are calling it a separation but it feels like a divorce", "...maybe they will call it Compaq?" and other, similar comments.
But the times when PC manufacturers were the driving force of PC technology are long gone, anyway – today, they mostly have Chinese partners assemble their machines from building bocks –, and now processor firms like Intel and AMD in liaison with software companies like Microsoft have taken over. And to ensure it stays that way, Intel and Microsoft will coincidentally hold their big developers conferences IDF and BUILD at the same time this September. Prior to that, they will be having a joint Rütli day to swear fealty to x86 on some hill in San Francisco – for a while, it had indeed appeared as though Windows 8 had only been designed for ARM.
At the Hot Chips conference at the famous Stanford University, mere three miles from HP's headquarters in Hanover Street in Palo Alto, ARM dinosaur Simon Segars had the privilege of giving the keynote speech. Boldly, he foretold that the superphones will make notebooks and netbooks obsolete in no more than two years: golden times for ARM – if Intel doesn't manage to gain a foothold here. On the following day, speculations ran wild again: at the bargain price of 10 billion pounds, Intel could just buy the lively competitor up. However, the analysts at Wall Street know that not only the European Commission, the EU's antitrust authority, but also the comparatively tolerant American Federal Trade Commission (FTC) would strongly protest against such a deal.
At the same conference, AMD presented the eagerly expected Bulldozer chip in more detail, in all of its 11 metal layers. It was disclosed that the chip, with its eight cores housed by four modules, measures a total of 315 mm² and that the "contacted gate pitch" – the distance between the outer transistor connectors – for the lower three metal layers is only 104 nm. For comparison: Intel's Sandy Bridge packs four cores plus GPU on 216 mm², manages with two layers less in the current 32 nm process (P1268), but the critical layers have a slightly larger pitch of 112.5 nm. And so it's not surprising that Intel's chips require slightly more space per transistor.
The possibly more important question, right now, of when the Bulldozer will roll out, remained unanswered, though – at least officially. It appears that, in spite of a certain jocose video clip, the Bulldozer won't be around before mid-October – but then maybe in its C0 stepping already.
Intel was conspicuously tight lipped concerning the next processor generation: no news regarding the Sandy Bridge E/EP or the successor Ivy Bridge. The audience only got a more detailed insight into the power management and the turbo boost 2.0 of the current generation. However, Intel shed some more light on the Itanium processor Poulson, planned for 2012. With eight cores and 3.1 billion transistors it will have less transistors per core than the current Itanium Tukwila. Thanks to the Instruction Replay Technology (IRT), Poulson can cancel and re-execute commands when an error occurs. Additionally, it supports "Dual Domain Multithreading": frontend (fetch commands and decode them) and backend (execute and write back results) can thread independently from each other. That doesn't sound too bad and maybe the chip is so good that HP is going to restructure its business model because of it. At least, the Autonomy software also supports HP-UX-Itanium.
Slimmer but Higher
The company Oracle intends to enter the game, too. But instead of further software for the Itanium, it's counting on SPARC hardware. To this end, the former Sun developers had to significantly "core down" the Niagara 3 processors in order to better attune them to the needs of the Oracle software. While elsewhere the motto continues to be "more cores", Oracle now moves into the opposite direction, as announced by Larry Ellison about nine months ago: the T4 (Yosemite Falls), expected at the end of 2011, will only have 8 instead 16 cores, but they will be more powerful with almost double the clock speed, local L2 caches, out-of-order execution and 2-issue superscalar. So at least the single-thread performance of the upcoming waterfalls will "explode". As proof, Oracle cites comparative results from the SPEC-CPU2006 benchmark, which show a factor five and factor seven performance increase, for the integer and floating-point suite respectively, in comparison to the predecessor T3. However, a possible auto-parallelization of the compiler might have strongly distorted the results.
Strangely enough, Oracle doesn't provide any SPECrate values for multiple threads although only those have until now been estimated as important and published by Oracle/Sun/Fujitsu. In any case, in the transaction benchmark TPC-C, the new eight-core chip only has a very slight advantage over the old T3 with 16 cores. Which means that, for the chip to be recommendable, it will have to be significantly more power efficient than its predecessor.