Model Name | Parameters | Class | Ratio | Tokens | Batch Size (Tokens) | Training Loss |
---|---|---|---|---|---|---|
GerbilLab/Gerbil-D-3.3m | 3.3m | D-Class | 142 | 426M | 65.5k | 5.3307 |
- Downloads last month
- 11
This model does not have enough activity to be deployed to Inference API (serverless) yet. Increase its social
visibility and check back later, or deploy to Inference Endpoints (dedicated)
instead.