⚡ WebGPU Benchmark Results (2.05x speedup) - Windows fp16 vs. fp32

#60
by Xenova HF staff - opened
Owner
Batch SizeWebGPU (fp16)WebGPU (fp32)
113.1014.10
249.8059.00
434.3056.90
8107.20146.40
16158.60227.70
32348.40714.10
  • Model: Xenova/all-MiniLM-L6-v2
  • Tests run: WebGPU (fp16), WebGPU (fp32)
  • Sequence length: 512
  • Browser: Mozilla/5.0 (Windows NT 10.0; Win64; x64) AppleWebKit/537.36 (KHTML, like Gecko) Chrome/122.0.0.0 Safari/537.36
  • GPU: vendor=nvidia, architecture=turing, device=, description=

Sign up or log in to comment