Whisper Engines Collection Compiled engines for running Whisper with TRT LLM for much faster inference. • 243 items • Updated about 13 hours ago
Whisper Engines Collection Compiled engines for running Whisper with TRT LLM for much faster inference. • 243 items • Updated about 13 hours ago
baseten/writer_palmyra_fin_70b_32K_i25000_o7000_bs16_tp4_A100_0.9.0.dev2024040200 Updated Jul 8, 2024 • 13
baseten/llama3-8b-i7000-o1000-with-lora-trtllm-0.11.0.dev2024052100-h100-mig-tp1-fixed-2 Updated Jun 28, 2024 • 12
baseten/writer_palmyra_med_70b_8k_i7192_o2048_bs42_fp16_A100_tp4-tllm_0.9.0.dev2024040200 Updated Jun 23, 2024 • 10
baseten/writer_palmyra_med_70b_32K_i25000_o7000_bs16_tp4_A100_0.9.0.dev2024040200 Updated Jun 14, 2024 • 14
baseten/writer_llama3_70b_32K_i25000_o7000_bs16_tp4_A100_0.11.0.dev2024052100 Updated Jun 14, 2024 • 14