Difference between cuda, DML and CPU onnx models?

by PaulTheHuman - opened Sep 27, 2024

Sep 27, 2024

Hi, can someone explain what the difference is between the Cuda, DML and CPU onnx models?
Are they interchangeable?
Since they are all in the onnx format shouldn't they all work on all devices?
Or is it simply that some work better on certain devices?

kvaishnavi

Microsoft org Dec 14, 2024

The originally uploaded models were optimized for each execution provider. Due to precision differences and operator support, they were not interchangeable and worked best for certain devices. With the newly uploaded models, there is now one optimized ONNX model for CPU and one optimized ONNX model for GPU. The work to support one unified, optimized ONNX model for both CPU and GPU is in progress.

kvaishnavi changed discussion status to closed Dec 14, 2024

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

· Sign up or log in to comment