OpenBLAS 0.3.18 Released With LoongArch64 Support, More Optimizations

Written by Michael Larabel in Programming on 2 October 2021 at 03:02 PM EDT. Add A Comment
PROGRAMMING
OpenBLAS 0.3.18 is out today as the latest feature update to this widely-used, open-source BLAS implementation.

OpenBLAS 0.3.18 brings a wide assortment of improvements and fixes, some of the highlights include:

- Support for LoongArch (LoongArch64) as the new Loongson MIPS-based CPU architecture.

- Intel Skylake X S/DGEMM kernels for small problem sizes.

- Optimized SBGEMM for Intel Cooper Lake processors.

- Support for run-time CPU core/thread count detection on Haiku OS.

- Better performance for DASUM on IBM POWER10.

- Crash fixes for Apple M1 on macOS.

- Fixed building under BSDs on MIPS/MIPS64.

- OpenBLAS at run-time when exceeding the build-time set number of pre-configured threads, an auxiliary control structure will be allocated with up to an additional 512 threads. The current/prior behavior of OpenBLAS was aborting when exceeding the pre-configured thread count.

Downloads and more details on OpenBLAS 0.3.18 at GitHub.
Related News
About The Author
Michael Larabel

Michael Larabel is the principal author of Phoronix.com and founded the site in 2004 with a focus on enriching the Linux hardware experience. Michael has written more than 20,000 articles covering the state of Linux hardware support, Linux performance, graphics drivers, and other topics. Michael is also the lead developer of the Phoronix Test Suite, Phoromatic, and OpenBenchmarking.org automated benchmarking software. He can be followed via Twitter, LinkedIn, or contacted via MichaelLarabel.com.

Popular News This Week