Friday, February 6, 2009

Intel Conroe Core 2 Duo/Extreme Processors

Intel's new Core 2 architecture builds on the foundations laid down by the Core micro-architecture which debuted on Intel's current range of dual-core mobile CPUs. Again, those unfamiliar to the new terminology may know them as Yonah.

Let's be clear about this. Intel, by moving away from NetBurst and the ultra-high clockspeeds required for the Pentium 4 series to be competitive against AMD's Athlon 64 series, has decided that a new micro-architecture was required that would provide both excellent performance and be energy-efficient, too. That, then, spelt an end to the Pentium 4 as Intel's consumer CPU of choice. It's still around, of course, but Core 2 Duo/Extreme is now positioned as Intel's performance CPU.

The following table details the range of Core 2-based CPUs Intel will be offering right off the bat, along with a Pentium Extreme Edition 965 and Athlon 64 FX-62 for reference. We'll then discuss the reasons behind why Core 2 is destined to give AMD a very hard time.

Models

Intel is initially launching Core 2 Duo at a starting speed-grade of 1.83GHz. This model, designated E6300, will be equipped with 2MiB of L2 cache and falls under the codename of Allendale. We expect Intel to release a lower-clocked version in the near future. Other than a lower clock speed and 2MiB L2 cache, the E6300 is, for all intents and purposes, architecturally identical to the rest of the Duo range; it's no Celeron model. That's especially gratifying with respect to its relatively low price.

The range is headlined by the Intel Core 2 Extreme X6800, which operates at 2.93GHz and, like Extreme Editions before, is multiplier-unlocked. Deep-walleted enthusiasts will have some fun with this one.

Let's digest the information above, particularly the architectural elements, with some insightful commentary.
Architecture analysis
Here's what makes the Core 2 Duo tick, folks.

Dual-core support

Intel Core 2 Duo-based CPUs will harness two execution cores based on a single piece of silicon. The cores communicate with the rest of the system via a single bus, which will be clocked in at 1066MHz and offer around 8.5GB/s CPU-to-MCH bandwidth. Initial Core 2 Duo CPUs will be manufactured on Intel's proven 65nm process. Projections state that 45nm production will begin in Q2 2007. Core 2 Duo supports a 14-stage execution pipeline, down (read better) from the 31 present on the Prescott-based Pentium 4.

Wide Dynamic Execution

Current x86 processors can deliver 3 instructions per clock cycle. Core 2 Duo, however, has been architected to fetch, dispatch, execute and retire up to four full instructions simultaneously, offering a 33% boost over, say, a Pentium 4 CPU. Allied to this, Core 2 Duo also supports what Intel terms Macro-Fusion, which can combine certain common x86 instructions (pairs, say, compare and conditional jump) into a single instruction (micro-op) for execution, thereby reducing overall processing time. Processing efficiency is the name of the game here.

Intel Smart Memory Access and Advanced Smart Cache

Higher-end Core 2 Duo CPUs will be equipped with 4MiB of on-chip L2 cache, minimising the need to run back to system memory for frequently used data. Unlike the present Pentium 4 micro-architecture, Core 2 Duo's two cores will share the cache amongst each other. Intel's engineering team has found that forcing the cores to individually allocate and use cache is more efficient than allotting a fixed, per-core amount. By varying the amount of cache split over the two cores Intel hopes that cache misses, the bane of modern CPUs in terms of execution efficiency, will be further reduced.

Core 2 Duo also supports what Intel terms Smart Memory Access. Put simply, and falling under the banner of memory disambiguation, it's a form of out-of-order, built-in intelligence that predicts and loads the upcoming instruction data before current store instructions have been processed. Intel has designed algorithms that can accurately predict whether a load can be processed before the store, thereby, again, potentially saving overall execution time.

Coupled with a heavily optimised cache, and thinking about it how it all fits together, the Core 2 Duo's memory access latency will be better than the present Pentium 4's, for the reasons discussed above. Intel has toyed with the idea of integrating a memory controller right on the CPU die itself, a la AMD, but reckons that Core 2 Duo's intelligent architecture masks latency well enough for it to do without. We'll put this assertion to the test in our ScienceMark 2.0 memory latency analysis.

Advanced Media Boost

Increasing efficiency with Streaming SIMD (Single Instruction Multiple Data) Extentions, Core 2 Duo CPUs are able to process a 128-bit instruction in a single clock cycle, rather than requiring the incumbent two clocks that current generations employ.

Intelligent Power Capability

Intel has designed Core 2 Duo not only to perform well on a clock-for-clock basis but also to be energy-efficient whilst doing so. This is precisely where its mobile heritage shines through. Intelligent, which seems to be the watchword for Core 2 Duo, management monitors core usage and application requirements such that it can power-gate parts of the CPU when not in use; there's little need for two cores running at full power in single-threaded applications, for example. Intel reckons that it has improved the physical requirements of power-gating enough for it to offer better power consumption than previous generations'.

Intel indicates that all Core 2 Duo models (barring Extreme) will harness a 65W TDP; half that of the Pentium Extreme Edition 965 CPU. The Core 2 Duo Extreme, however, ships with a slightly higher 75W TDP.

Further, Conroe will ship with an integrated digital thermal sensor. The sensor is embedded on the die itself and will give more-accurate readings. Incidentally, the 965 Express chipset family supports Intel Quiet System Technology, which intelligently manages processor and system fan-speeds in relation to core temperature, ensuring the fan(s) are spun just fast enough to keep the processor from throttling.

Virtualisation Technology, 64-bit processing

Virtualisation Technology offers hardware-isolated virtual partitions that allow the user to run multiple operating systems on one PC, and 64-bit processing (EMT64), along with Execute Disable Bit, is carried over from the Pentium 4 line of CPUs.

No comments:

Post a Comment