Text-to-Speech
GGUF
Inference Endpoints
edwko commited on
Commit
7be995f
1 Parent(s): fb34ce3

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +16 -0
README.md CHANGED
@@ -172,6 +172,22 @@ model_config = outetts.GGUFModelConfig_v1(
172
  interface = outetts.InterfaceGGUF(model_version="0.2", cfg=model_config)
173
  ```
174
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
175
  ## Creating a Speaker for Voice Cloning
176
 
177
  To achieve the best results when creating a speaker profile, consider the following recommendations:
 
172
  interface = outetts.InterfaceGGUF(model_version="0.2", cfg=model_config)
173
  ```
174
 
175
+ # Configure the model with bfloat16 and flash attention
176
+
177
+ ```python
178
+ import outetts
179
+ import torch
180
+
181
+ model_config = outetts.HFModelConfig_v1(
182
+ model_path="OuteAI/OuteTTS-0.2-500M",
183
+ language="en", # Supported languages in v0.2: en, zh, ja, ko
184
+ dtype=torch.bfloat16,
185
+ additional_model_config={
186
+ 'attn_implementation': "flash_attention_2"
187
+ }
188
+ )
189
+ ```
190
+
191
  ## Creating a Speaker for Voice Cloning
192
 
193
  To achieve the best results when creating a speaker profile, consider the following recommendations: