Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
stefan-it
/
xlstm-german-wikipedia
like
7
Text Generation
Transformers
Safetensors
German
xlstm
custom_code
License:
cc-by-sa-3.0
Model card
Files
Files and versions
Community
Train
Use this model
9d21367
xlstm-german-wikipedia
1 contributor
History:
30 commits
stefan-it
config: add mapping for AutoModelForSequenceClassification to own xLSTMForSequenceClassification
9d21367
3 months ago
.gitattributes
1.52 kB
initial commit
6 months ago
README.md
3.75 kB
readme: fix revision of forked Helibrunna repo
3 months ago
brat-logo.png
57.8 kB
figure: add some new logo :p
3 months ago
config.json
730 Bytes
config: add mapping for AutoModelForSequenceClassification to own xLSTMForSequenceClassification
3 months ago
configuration_xlstm.py
3.08 kB
xlstm-config: temporarily introduce new hidden_size parameter
3 months ago
generation_config.json
69 Bytes
model: add generation confgi
3 months ago
model.safetensors
445 MB
LFS
model: add newly trained xLSTM model (with grad clipping)
3 months ago
modeling_xlstm.py
9.85 kB
modeling: sync xLSTMForSequenceClassification with Patrick's codebase from https://github.com/HallerPatrick/helibrunna/blob/a1b377271867d5f23201ccacb55e017749aba487/model/modeling_xlstm.py
3 months ago
special_tokens_map.json
551 Bytes
tokenizer: add config and vocab
3 months ago
tokenizer.json
1.84 MB
tokenizer: add config and vocab
3 months ago
tokenizer_config.json
957 Bytes
tokenizer: add config and vocab
3 months ago
training-loss.png
201 kB
figure: add updated loss curve for training
3 months ago