eustlb
/

distil-large-v3-fr

Automatic Speech Recognition

Inference Endpoints

Model card Files Files and versions Community

eustlb HF staff commited on Jun 20, 2024

Commit

954a00f

·

1 Parent(s): ea9549d

readme update

Files changed (1) hide show

README.md +4 -3

README.md CHANGED Viewed

@@ -618,7 +618,7 @@ The model has been tested for both in-distribution (Common Voice 17 and Multilin
 ### Short-Form
-|     Model Name     |   RTF   | Common Voice 17 | Multilingual Librispeech | Voxpopuli | Fleurs |
 | :----------------: | :-----: | :-------------: | :----------------------: | :-------: | :----: |
 | distil-large-v3-fr | 310.127 |     12.681      |          5.865           |  10.851   | 7.984  |
 |    whisper-tiny    | 280.576 |     56.757      |          37.512          |  32.505   | 46.173 |
@@ -627,12 +627,13 @@ The model has been tested for both in-distribution (Common Voice 17 and Multilin
 |   whisper-medium   |  170.9  |     15.432      |          9.602           |   11.92   | 9.155  |
 |  whisper-large-v3  | 150.719 |     11.024      |          4.783           |   9.948   | 5.624  |
-*the above datasets correspond to test splits, RTF co
 ### Long-Form
-|     Model Name     |   RTF   | [long-form test set](https://huggingface.co/datasets/eustlb/french-long-form-test) |
 | :----------------: | :-----: | :--------------------------------------------------------------------------------: |
 | distil-large-v3-fr | 169.692 |                                       11.385                                       |
 |    whisper-tiny    | 125.367 |                                       28.277                                       |

 ### Short-Form
+|     Model Name     |   RTFx   | Common Voice 17 | Multilingual Librispeech | Voxpopuli | Fleurs |
 | :----------------: | :-----: | :-------------: | :----------------------: | :-------: | :----: |
 | distil-large-v3-fr | 310.127 |     12.681      |          5.865           |  10.851   | 7.984  |
 |    whisper-tiny    | 280.576 |     56.757      |          37.512          |  32.505   | 46.173 |
 |   whisper-medium   |  170.9  |     15.432      |          9.602           |   11.92   | 9.155  |
 |  whisper-large-v3  | 150.719 |     11.024      |          4.783           |   9.948   | 5.624  |
+*the above datasets correspond to test splits
+*$RTFx =\frac{1}{RTF}$, where RTF is the [Real Time Factor](https://openvoice-tech.net/wiki/Real-time-factor). To be interpreted as audio processed (in seconds) per second of processing.
 ### Long-Form
+|     Model Name     |   RTFx   | [long-form test set](https://huggingface.co/datasets/eustlb/french-long-form-test) |
 | :----------------: | :-----: | :--------------------------------------------------------------------------------: |
 | distil-large-v3-fr | 169.692 |                                       11.385                                       |
 |    whisper-tiny    | 125.367 |                                       28.277                                       |