ssyok's picture
Update README.md
a021c33 verified
|
raw
history blame
873 Bytes
metadata
license: mit
pipeline_tag: text-generation
tags:
  - ONNX
  - DML
  - ONNXRuntime
  - phi3
  - nlp
  - conversational
  - custom_code
inference: false
language:
  - en

EmbeddedLLM/Phi-3-mini-128k-instruct-onnx-directml

Performance Metrics

DirectML

We measured the performance of DirectML on AMD Ryzen 9 7940HS /w Radeon 78

Prompt Length Generation Length Average Throughput (tps)
128 128 -
128 256 -
128 512 -
128 1024 -
256 128 -
256 256 -
256 512 -
256 1024 -
512 128 -
512 256 -
512 512 -
512 1024 -
1024 128 -
1024 256 -
1024 512 -
1024 1024 -