GCC 9 Compiler Tuning Benchmarks On Intel Skylake AVX-512

Written by Michael Larabel in Software on 8 March 2019 at 12:20 AM EST. Page 4 of 4. 28 Comments.

The C-Ray multi-threaded ray-tracer always craves a good compiler tuning.

These results basically jive with our past experiences in Intel Skylake-AVX512 compiler tuning and for tweaking the CFLAGS/CXXFLAGS in general for achieving greater performance. For most workloads, it pays off if you are satisfied with building out your own software packages.

This is an overall look at the performance of the 71 benchmarks when taking their geometric mean. Hitting at least "-O2" generally captures most of the performance potential, opting for "skylake-avx512" over "skylake" on capable AVX-512 CPUs did tend to really help out select workloads but not across the board, the benefits of link-time optimization (LTO) really come down to the application being tested, and opting for the most aggressive "-Ofast" mode helped out select cases. Those wanting to dig into this data more can find all of the numbers on OpenBenchmarking.org.

The GCC 9 stable release (GCC 9.1) should be out in the next few weeks.

If you enjoyed this article consider joining Phoronix Premium to view this site ad-free, multi-page articles on a single page, and other benefits. PayPal or Stripe tips are also graciously accepted. Thanks for your support.


Related Articles
About The Author
Michael Larabel

Michael Larabel is the principal author of Phoronix.com and founded the site in 2004 with a focus on enriching the Linux hardware experience. Michael has written more than 20,000 articles covering the state of Linux hardware support, Linux performance, graphics drivers, and other topics. Michael is also the lead developer of the Phoronix Test Suite, Phoromatic, and OpenBenchmarking.org automated benchmarking software. He can be followed via Twitter, LinkedIn, or contacted via MichaelLarabel.com.