Hugging Face
Models
Datasets
Spaces
Posts
Docs
Solutions
Pricing
Log In
Sign Up
flax-community
/
roberta-swahili
like
2
Follow
Flax Community
313
Fill-Mask
Transformers
PyTorch
JAX
TensorBoard
flax-community/swahili-safi
Swahili
roberta
Inference Endpoints
Model card
Files
Files and versions
Metrics
Training metrics
Community
1
Train
Deploy
Use this model
095dc7f
roberta-swahili
4 contributors
History:
98 commits
alokmatta
Update README.md
095dc7f
over 3 years ago
.gitattributes
Safe
737 Bytes
push
over 3 years ago
README.md
Safe
1.61 kB
Update README.md
over 3 years ago
config.json
Safe
671 Bytes
set new dataset in train_tokenizer
over 3 years ago
create_config.py
Safe
142 Bytes
push
over 3 years ago
events.out.tfevents.1625699083.t1v-n-6a2ff29b-w-0.6154.3.v2
Safe
332 kB
LFS
push
over 3 years ago
events.out.tfevents.1625736354.t1v-n-6a2ff29b-w-0.169303.3.v2
Safe
12.7 MB
LFS
push
over 3 years ago
events.out.tfevents.1625831062.t1v-n-6a2ff29b-w-0.1152929.3.v2
Safe
40 Bytes
LFS
set new dataset in train_tokenizer
over 3 years ago
events.out.tfevents.1625850549.t1v-n-6a2ff29b-w-0.1178206.3.v2
Safe
9.15 MB
LFS
set new dataset in train_tokenizer
over 3 years ago
events.out.tfevents.1625996487.t1v-n-6a2ff29b-w-0.1982849.3.v2
Safe
10.5 MB
LFS
Retrain with new cleaned data
over 3 years ago
events.out.tfevents.1626116445.t1v-n-6a2ff29b-w-0.3120945.3.v2
Safe
12.7 MB
LFS
Retrain with new cleaned data
over 3 years ago
flax_model.msgpack
Safe
422 MB
LFS
Retrain with new cleaned data
over 3 years ago
flax_to_torch.py
Safe
137 Bytes
set new dataset in train_tokenizer
over 3 years ago
pytorch_model.bin
Safe
pickle
Detected Pickle imports (4)
"torch.LongStorage"
,
"torch._utils._rebuild_tensor_v2"
,
"collections.OrderedDict"
,
"torch.FloatStorage"
What is a pickle import?
422 MB
LFS
Retrain with new cleaned data
over 3 years ago
run.sh
Safe
673 Bytes
Update data sources in run script
over 3 years ago
run_mlm_flax.py
Safe
74 Bytes
add
over 3 years ago
tokenizer.json
Safe
642 kB
New tokenizer with cleaned data
over 3 years ago
train_tokenizer.py
Safe
744 Bytes
New tokenizer with cleaned data
over 3 years ago