distil-whisper
/

distil-large-v3

reach-vb HF staff commited on Jun 7, 2024

Commit

871351a

verified ·

1 Parent(s): c4fbc17

Update README.md (#5)

- Update README.md (38967b8d0952dd7ebbc5634f8933bc626773d5b7)

Co-authored-by: Vaibhav Srivastav <reach-vb@users.noreply.huggingface.co>

Files changed (1) hide show

README.md CHANGED Viewed

@@ -424,6 +424,8 @@ Once a valid PyTorch version is installed, SDPA is activated by default. It can
 + model = AutoModelForSpeechSeq2Seq.from_pretrained(model_id, torch_dtype=torch_dtype, low_cpu_mem_usage=True, use_safetensors=True, attn_implementation="sdpa")
 ```
 #### Torch compile
 Coming soon...

 + model = AutoModelForSpeechSeq2Seq.from_pretrained(model_id, torch_dtype=torch_dtype, low_cpu_mem_usage=True, use_safetensors=True, attn_implementation="sdpa")
 ```
+For more information about how to use the SDPA refer to the [Transformers SDPA documentation](https://huggingface.co/docs/transformers/en/perf_infer_gpu_one#pytorch-scaled-dot-product-attention).
 #### Torch compile
 Coming soon...