@Qaridarium
The speed is exactly the same as it is the same silicon, just with different features enabled. So the benchmarks are absolutely correct. I had only ssh access to that Opterons (running Kanotix live) but i dont know anybody with the newer ones, do you? The gfx part is maybe different but thats not tested in that benchmark. You mainly need the Xeon flavour when you want to use ECC together with a workstation chipset like C206, the cpu itself would run in any desktop s1155 board as well, you just can not use ECC.
http://ark.intel.com/compare/52214,52213,52277
All i can say is: too many cores hurt performance as there is much more work to synchronize. A smaller source code like mplayer2 and the performance of quad with higher single core performance beats 24 cores by 100% difference! All compile tests resulted in debian packages, so the time where configure was done and creating deb packages was included as well, i did the tests with cached depends in pbuilder for the kernel, so dl speed differences are not tested. It does not help much when you try artificial workloads, a kernel so minimal that you can compile (but not package) within 60s is just useless when it has to run on lots of systems.