⚡ WebGPU Benchmark Results (2.13x speedup) - Windows fp16 vs. fp32
#56
by
Xenova
HF staff
- opened
Batch Size | WebGPU (fp16) | WebGPU (fp32) |
1 | 11.70 | 14.10 |
2 | 45.80 | 56.10 |
4 | 41.10 | 57.70 |
8 | 97.70 | 133.30 |
16 | 153.40 | 215.70 |
32 | 338.60 | 722.50 |
- Model: Xenova/all-MiniLM-L6-v2
- Tests run: WebGPU (fp16), WebGPU (fp32)
- Sequence length: 512
- Browser: Mozilla/5.0 (Windows NT 10.0; Win64; x64) AppleWebKit/537.36 (KHTML, like Gecko) Chrome/122.0.0.0 Safari/537.36
- GPU: vendor=nvidia, architecture=turing, device=, description=