Intel NPU Driver Being Updated To Handle Larger AI Workloads

Written by Michael Larabel in Intel on 17 October 2024 at 04:54 PM EDT. 2 Comments
INTEL
Following the recent patch work for enabling the Intel 5th Gen NPU premiering with Panther Lake, a new patch series posted today brings a number of improvements for this Intel neural processing unit driver -- including the ability to handle larger workloads.

A set of patches posted today for the Intel IVPU accelerator driver remove support for the deprecated and unused copy engine, improve the open() performance via lazy allocating MMU page tables, error handling fixes, and support for bigger workloads. Being able to run bigger workloads on the Intel NPU comes with the VPU address ranges being extended.

Today's patch series sums it up as:
- Remove support for deprecated and unused copy engine
- Improved open() performance by lazy allocating MMU page tables
- Error handling fixes in MMU code
- Extend VPU address ranges to allow bigger workloads

The NPU copy engine was deprecated by the NPU firmware and is no longer supported/used. The compute engine is now responsible for all copy engine functionality.

Intel NPU architecture diagram


As for allowing bigger workloads to run on the Intel NPU via increasing the DMA address range, the patch sums up the work as:
Increase DMA address range to:
* 128 GB on 37xx (due to MMU limitations)
* 256 GB on other generations
Merge User and DMA ranges on 40xx and above as it is possible to access whole 256 GBs from both FW and DMA.

Increase User range on 37xx from 255MB to 511MB to allow loading very large models.

The Intel NPU 37xx IP is used with Meteor Lake SoCs.

Another patch in the series moves the secondary preemption buffer allocation to the DMA range to allow allocating more command queues with preemption buffers without running out of the address range.

These Intel IVPU driver changes are good news for allowing larger models to run with the Intel NPU hardware. Stay tuned to Phoronix for knowing when these Intel NPU driver changes are upstreamed into the mainline kernel.
Related News
About The Author
Michael Larabel

Michael Larabel is the principal author of Phoronix.com and founded the site in 2004 with a focus on enriching the Linux hardware experience. Michael has written more than 20,000 articles covering the state of Linux hardware support, Linux performance, graphics drivers, and other topics. Michael is also the lead developer of the Phoronix Test Suite, Phoromatic, and OpenBenchmarking.org automated benchmarking software. He can be followed via Twitter, LinkedIn, or contacted via MichaelLarabel.com.

Popular News This Week