Original Link: https://www.anandtech.com/show/169
The Microprocessor: Revolutionary New Architecture: Dream or Reality?
by Anand Lal Shimpi on August 13, 1998 3:06 PM EST- Posted in
- CPUs
Quite a few of you may have been involved with computers long enough to remember the highly anticipated introduction of Intel's 386 processor. Or you may have been in to hardware long enough to remember wanting to custom build a multi-thousand dollar "high end" 486 system. Then again there are those of you who's earliest memory of a processor's introduction was with the Pentium who's dark history as the 586 was masked by bug fixes and higher clock speeds. With all of these new CPU's coming out at alarming rates, do you ever wonder if manufacturers are actually telling the truth when they claim that their "Revolutionary New Microprocessor" is unlike anything the world has seen before?
A few months back AnandTech conducted a series of tests on the current processors out at the time for the Evolution of the Microprocessor Article. By making sure all processors had their L1 and L2 caches clocked at equivalent (or close to equivalent) speeds, and by essentially leveling the playing field we managed to make a few interesting discoveries about your favorite CPU's. For one thing, the Pentium II, by default, disables its on-board L2 cache if you exceed its locked Clock Multiplier Setting and also configures itself with a 2.0x multiplier setting. We also discovered that, at the lowest of levels, the Cyrix 6x86MX turned out to be the best overall processor however we also managed to illustrate the poor FPU performance of non-Intel processors in spite of their manufacturer's claims of improved performance.
There were a few flaws in the original testing brought on by limitations of motherboards and the CPU's available at the time. The Pentium II operated its memory bus at 83MHz while the rest of the processors kept their memory bus at 66MHz, also the Pentium II was crippled in some tests due to its automatic disable of its on-board L2 cache. This time the bridge between CPU architectures and interfaces can be crossed much easier with the advent of the Super7 motherboard platform and the 100MHz Front Side Bus (FSB).
Chips & Bits
How can you compare a processor that interfaces with the motherboard via a Socket to a processor that interfaces with the motherboard via a Slot? It's actually quite simple, with a little help from a motherboard standard called Super7. The VIA MVP3 Chipset allows for the memory bus to be operated at an external frequency locked to the frequency which the AGP bus operates at. In the case of the tests AnandTech ran on the processors for this article, the AGP bus ran at 2/3 of the speed of the FSB which was clocked at 100MHz, making the AGP and Memory Buses run at 66MHz. Now keep in mind that because of the 100MHz FSB the L2 cache of the Super7 test system was operating at 100MHz, and for the sake of comparison all processors were clocked at 200MHz. Why? Well, let's take a look at the Pentium II to find out.
Leveling the playing ground is a difficult task when the Pentium II is brought into the game, the Pentium II derives its L2 cache speed from the Clock Speed of the CPU, not from the FSB speed. The L2 cache on a Pentium II operates at 1/2 of the clock speed, so in order to keep things fair the Pentium II would have to run at 200MHz in order for it to be compared to the Super7 chips since a 200MHz P2 would run its cache at half the clock speed, or 100MHz. Although the Intel 440BX chipset doesn't allow for operation of the memory bus at the speed of the AGP bus, we can simulate the same effect on the ABIT BH6 BX Motherboard used for the Pentium II tests. While all of the Super7 processors ran at the 100MHz FSB x a 2.0 clock multiplier, with the Memory Bus running at 66MHz the Pentium II test system ran at the 66MHz FSB x a 3.0 clock multiplier which kept the memory bus at 66MHz while the L2 cache still operating at 1/2 the clock speed, being 100MHz.
A Truly "Fair" Comparison
As discussed in the original Evolution of the Microprocessor Article, you may believe that in order for a fair comparison one would have to disable the L1 cache on all the processors, however doing so is penalizing a processor for being "itself," the way its manufacturer intended it to be. For example, the Pentium II mutates from a hideously fast daemon to a mere tortoise when you take away its precious L1 cache, to put it into perspective, imagine running Win95 on a 286..it isn't pretty, try running it on a Pentium II without L1 cache...the scene isn't all that different. So none of the processors were penalized for any enhancements that happened to be included in the physical die of the chip (the actual chip, not any additional enhancements made outside of the chip's packaging) since you can't order a 6x86MX with only 32KB of L1 cache and you can't order a Pentium II with 64KB of L2 (not yet at least), the L1 cache is part of the processor so it would be unfair to disable it.
Each processor was run at a clock speed of 200MHz, with their L1 cache operating at 200MHz, their L2 cache operating at 100MHz, their memory bus operating at 66MHz, and with their AGP bus also running at 66MHZ.
AMD K6-2 | Cyrix M-II | Intel Pentium MMX | Intel Pentium II | |
CPU Interface | Socket-7 | Socket-7 | Socket-7 | Slot-1 |
L1 Cache (size) | 64KB | 64KB | 32KB | 32KB |
L1 Cache (speed) | 200MHz | 200MHz | 200MHz | 200MHz |
L2 Cache (size) | 512KB | 512KB | 512KB | 512KB |
L2 Cache (speed) | 100MHz | 100MHz | 100MHz | 100MHz |
CPU Clock | 200MHz | 200MHz | 200MHz | 200MHz |
External System RAM | 64MB PC100 | 64MB PC100 | 64MB PC100 | 64MB PC100 |
Memory Bus Clock | 66MHz | 66MHz | 66MHz | 66MHz |
Front Side Bus Clock | 100MHz | 100MHz | 100MHz | 66MHz |
AGP Bus Clock | 66MHz | 66MHz | 66MHz | 66MHz |
Motherboard | FIC VA-503+ 1.1B | FIC VA-503+ 1.1B | FIC VA-503+ 1.1B | ABIT BH6 |
Chipset | VIA MVP3 | VIA MVP3 | VIA MVP3 | Intel 440BX |
Each processor was benchmarked using Ziff-Davis Publications' Winstone 98, Winbench 98, and id Software's Quake 2. Each test was run a total of 3 times and an average of the three scores was taken and recorded as the final score. Each test system used 1 x 64MB Mushkin PC100 SDRAM DIMM, a 5.1GB Western Digital UltraATA HDD, a Matrox Millennium G200 (AGP) Video Card, and the latest drivers for all of the devices installed in the test system running under Windows 98. The Quake 2 tests were conducted in software mode and used no OpenGL or 3DNow! drivers. No processors produced any seemingly erroneous results and no crashes were experienced during the test7ing process.
The Results
Processor Performance Comparison - L2 Cache Enabled - Winstone 98 |
||||
AMD K6-2 | Cyrix M-II | Intel Pentium MMX | Intel Pentium II | |
Business Winstone 98 | 19.8 | 21.0 | 18.1 | 18.4 |
Winbench 98 - CPUMark32 | 631 | 538 | 462 | 492 |
Winbench 98 - FPU Winmark | 656 | 450 | 780 | 1020 |
Processor Performance Comparison - L2 Cache Enabled - Quake 2 |
||||
AMD K6-2 | Cyrix M-II | Intel Pentium MMX | Intel Pentium II | |
Timedemo Demo 1 | 9.7 | 9.5 | 11.0 | 12.5 |
Processor Performance Comparison - L2 Cache Disabled - Winstone 98 |
||||
AMD K6-2 | Cyrix M-II | Intel Pentium MMX | Intel Pentium II | |
Business Winstone 98 | 14.2 | 17.0 | 12.6 | 15.0 |
Winbench 98 - CPUMark32 | 335 | 318 | 275 | 317 |
Winbench 98 - FPU Winmark | 641 | 446 | 734 | 976 |
Processor Performance Comparison - L2 Cache Disabled - Quake 2 |
||||
AMD K6-2 | Cyrix M-II | Intel Pentium MMX | Intel Pentium II | |
Timedemo Demo 1 | 7.9 | 8.6 | 8.9 | 11.3 |
Explaining the Results
Shocking results once again wouldn't you say? Once again Cyrix steps up with the best overall Business Application performer with their M-II that consistently beat the competition in both a core-to-core comparison and a clock for clock comparison. Removing the L2 cache from the Cyrix test system left the computer relatively unaffected compared to the competition which is why the M-II still remains strong in the tests without L2 cache.
For raw FPU performance, the two Intel chips still bring the numbers to the table. Without L2 cache, the Pentium II still stands strong with the most powerful FPU which is why the cache-less Pentium II, the Celeron, has been so successful among gamers. The Pentium MMX isn't too far behind the Pentium II as far as FPU performance goes, however the difference between the two processors grows to a more noticeable degree once you remove the L2 cache.
The AMD K6-2 is the most surprising chip in this roundup, as its performance varies greatly courtesy of its incredible reliance on the performance boost the 100MHz FSB gives its L2 cache. With the L2 cache enabled, the K6-2's benefits from the 100MHz FSB are quite visible as it produces a CPUMark32 score of 631, even greater than that of a Pentium II. Because of this the K6-2 also gives the M-II a fair challenge under Winstone 98, falling behind by a total of 1.2 Winstone points. What happens when you take the L2 cache away from the K6-2? Then the system is no better than the older K6 system used in the original Evolution Article. The performance drop is incredible between a K6-2 with and without L2 cache, great enough to fully illustrate the need for the K6-2 to be run at a 100MHz FSB. A surprising result is the performance of the K6-2 under Quake 2 without L2 cache enabled and without 3DNow! drivers, the 7.9 frames per second the K6-2 cranked out was easily beat by the 8.6 the M-II test system managed to pull. Even with L2 cache enabled the K6-2 only offers a 0.2 fps improvement over the M-II illustrating the second reason for the K6-2's success, its 3DNow! instructions which boost its performance under supported games to Pentium II levels.
Conclusion
Luckily you don't have to run your processors without L2 cache, and luckily all of the above processors aren't locked at 200MHz. With the Pentium II roaming around the 450MHz mark, and the K6-2/M-II available in 300MHz (or PR2-300) flavors these tests do nothing more than illustrate how processors change over time, very little. What do you think the difference between a Pentium II 400 and a 450 is? Not much. Once you approach a new processor generation then you begin noticing differences, however until then most of the architectural differences among CPU's are negligible.