AMD EPYC 9965 "Turin Dense" Delivers Better Performance/Power Efficiency vs. AmpereOne 192-Core ARM CPU

Written by Michael Larabel in Processors on 10 October 2024 at 02:00 PM EDT. Page 3 of 5. 21 Comments.
Coremark benchmark with settings of CoreMark Size 666, Iterations Per Second. AmpereOne A192-32X was the fastest.
Coremark benchmark with settings of CoreMark Size 666, Iterations Per Second. AmpereOne A192-32X was the fastest.
Coremark benchmark with settings of CoreMark Size 666, Iterations Per Second. AmpereOne A192-32X was the fastest.

The synthetic Coremark benchmark was one of the few cases where the AmpereOne A192-32X did come out ahead of the EPYC 9965.

Algebraic Multi-Grid Benchmark benchmark with settings of . EPYC 9965 was the fastest.
Algebraic Multi-Grid Benchmark benchmark with settings of . EPYC 9965 was the fastest.
Algebraic Multi-Grid Benchmark benchmark with settings of . EPYC 9965 was the fastest.

The AmpereOne A192-32X is bound to eight channels with DDR5-5200 memory where as AMD EPYC Turin allows 12 channel memory at DDR5-6000 speeds. Ampere Computing supposedly this quarter will ship AmpereOne M with 12 channel DDR5 memory support but the memory speeds have yet to be confirmed. The greater memory bandwidth for EPYC Turin benefits workloads like AMG.

WRF benchmark with settings of Input: conus 2.5km. EPYC 9965 was the fastest.
WRF benchmark with settings of Input: conus 2.5km. EPYC 9965 was the fastest.

The greater memory bandwidth and Zen 5 advantages helped the EPYC 9965 perform much better than AmpereOne. This EPYC 9965 run is also in a sub-optimal configuration due to DDR5 memory throttling in this benchmark due to an overheating issue with one of the DIMMs. But even with this subpar EPYC 9965 WRF run, it's still coming out well ahead of the AmpereOne CPU in this 192 core battle.

LULESH benchmark with settings of . AmpereOne A192-32X was the fastest.
LULESH benchmark with settings of . AmpereOne A192-32X was the fastest.
LULESH benchmark with settings of . AmpereOne A192-32X was the fastest.

The LULESH hydrodynamics benchmark was a rare upset for the EPYC Turin Dense 192-core processor.

LAMMPS Molecular Dynamics Simulator benchmark with settings of Model: 20k Atoms. EPYC 9965 was the fastest.
LAMMPS Molecular Dynamics Simulator benchmark with settings of Model: 20k Atoms. EPYC 9965 was the fastest.
LAMMPS Molecular Dynamics Simulator benchmark with settings of Model: 20k Atoms. EPYC 9965 was the fastest.
miniFE benchmark with settings of Problem Size: Small. EPYC 9965 was the fastest.
miniFE benchmark with settings of Problem Size: Small. EPYC 9965 was the fastest.
miniFE benchmark with settings of Problem Size: Small. EPYC 9965 was the fastest.
GROMACS benchmark with settings of Implementation: MPI CPU, Input: water_GMX50_bare. EPYC 9965 was the fastest.
GROMACS benchmark with settings of Implementation: MPI CPU, Input: water_GMX50_bare. EPYC 9965 was the fastest.
GROMACS benchmark with settings of Implementation: MPI CPU, Input: water_GMX50_bare. EPYC 9965 was the fastest.
QuantLib benchmark with settings of Configuration: Multi-Threaded. EPYC 9965 was the fastest.
QuantLib benchmark with settings of Configuration: Multi-Threaded. EPYC 9965 was the fastest.
QuantLib benchmark with settings of Configuration: Multi-Threaded. EPYC 9965 was the fastest.
GPAW benchmark with settings of Input: Carbon Nanotube. EPYC 9965 was the fastest.
GPAW benchmark with settings of Input: Carbon Nanotube. EPYC 9965 was the fastest.
High Performance Conjugate Gradient benchmark with settings of X Y Z: 144 144 144, RT: 60. EPYC 9965 was the fastest.
High Performance Conjugate Gradient benchmark with settings of X Y Z: 144 144 144, RT: 60. EPYC 9965 was the fastest.
High Performance Conjugate Gradient benchmark with settings of X Y Z: 144 144 144, RT: 60. EPYC 9965 was the fastest.

The AMD EPYC 9965 Turin Dense processor was delivering dominating performance in most of the HPC benchmarks tested compared to the AmpereOne A192-32X flagship ARM server processor.

Related Articles