Announcement

**milkylainen** · 08 February 2019, 02:06 PM

That cachebench result is quite interesting. I'm betting that Clang is using faster builtins for mem* on Power.
If you recompile cachebench without builtins that would probably even out the result.

**Michael_S** · 09 February 2019, 09:29 AM

Originally posted by milkylainen View Post

That cachebench result is quite interesting. I'm betting that Clang is using faster builtins for mem* on Power.
If you recompile cachebench without builtins that would probably even out the result.

I don't understand, does the use of faster builtins by Clang somehow cheat the benchmark? Or should this be a feature that GCC could implement too?

**milkylainen** · 09 February 2019, 01:04 PM

Originally posted by Michael_S View Post

I don't understand, does the use of faster builtins by Clang somehow cheat the benchmark? Or should this be a feature that GCC could implement too?

No. Absolutely no cheating. Mem* functions are notoriously difficult to implement without making them greedy prefetchers. Since cachebench is mostly c library mem* and various simple closed loop array walks, compilers should matter less, not more.

Either way, cachebench tests are rather trivial. Tests that stick out like a sore thumb in cachebench between compilers do warrant closer investigation.

It should be pretty easy to understand why either one is faster compared to more complex source.

**yurikoles** · 10 February 2019, 12:43 PM

Michael, a typo:

Originally posted by phoronix View Post

"-O3 -mtune-native mcpu=native"

Maybe "-mcpu=native" with dash?

Announcement

GCC 8/9 vs. LLVM Clang 7/8 Compiler Performance On POWER9 With The Raptor Talos II

GCC 8/9 vs. LLVM Clang 7/8 Compiler Performance On POWER9 With The Raptor Talos II

Comment

Comment

Comment

Comment