Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
jinaai
/
xlm-roberta-flash-implementation
like
25
Follow
Jina AI
429
Transformers
94 languages
xlm-roberta
Inference Endpoints
License:
cc-by-nc-4.0
๐ช๐บ Region: EU
Model card
Files
Files and versions
Community
56
Train
Deploy
Use this model
e3423c0
xlm-roberta-flash-implementation
10 contributors
History:
30 commits
jupyterjazz
feat: no flash attention during inference
e3423c0
6 months ago
.gitattributes
Safe
1.52 kB
initial commit
9 months ago
README.md
Safe
147 Bytes
add mlm model and adjust naming
9 months ago
block.py
Safe
17.7 kB
add stochastic_depth
8 months ago
config
Safe
980 Bytes
Create config
6 months ago
configuration_xlm_roberta.py
Safe
2.73 kB
truncate-embedding-dimension (#10)
7 months ago
convert_roberta_weights_to_flash.py
Safe
6.94 kB
Support for SequenceClassification (#7)
8 months ago
embedding.py
Safe
2.56 kB
alibi (#19)
7 months ago
mha.py
Safe
31.5 kB
rope-embeddings (#20)
7 months ago
mlp.py
Safe
6.21 kB
upload model
9 months ago
modeling_lora.py
Safe
13.8 kB
lora bugfix (#16)
7 months ago
modeling_xlm_roberta.py
Safe
52.4 kB
rope-embeddings (#20)
7 months ago
modeling_xlm_roberta_for_glue.py
Safe
4.45 kB
Update modeling_xlm_roberta_for_glue.py
8 months ago
pytorch_model.bin
Safe
pickle
Detected Pickle imports (3)
"torch.FloatStorage"
,
"collections.OrderedDict"
,
"torch._utils._rebuild_tensor_v2"
What is a pickle import?
1.11 GB
LFS
add mlm model and adjust naming
9 months ago
rotary.py
Safe
22.6 kB
feat: no flash attention during inference
6 months ago
stochastic_depth.py
Safe
3.76 kB
add stochastic_depth
8 months ago
tokenizer.json
Safe
9.1 MB
upload model
9 months ago
tokenizer_config.json
Safe
75 Bytes
Update tokenizer_config.json (#14)
7 months ago
xlm_padding.py
Safe
9.82 kB
add mlm model and adjust naming
9 months ago