LLVM Clang Shows Off Great Performance Advantage On NVIDIA GH200's Neoverse-V2 Cores
Right away Clang was showing the ability to outperform the GCC-built benchmarks/workloads on this NVIDIA GH200 server.
Across various HPC workloads the Clang AArch64 binaries were significantly faster than using the current GCC 13 stable series. Then again we've seen competitive x86_64 and AArch64 performance for a while to GCC though typically not to some of the extremes seen in this round of testing. Though given Clang being more common on AArch64 due to its use by Apple, Android, etc, the nice performance wins aren't too surprising.
GCC 13 did pick up a few wins in some of the Zstd compression benchmarks.
Even for workloads like WebP image generation the Clang-built binaries were faster on the Neoverse-V2 CPUs.