Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
stefan-it
/
xlstm-german-wikipedia
like
7
Text Generation
Transformers
Safetensors
German
xlstm
custom_code
License:
cc-by-sa-3.0
Model card
Files
Files and versions
Community
Train
Use this model
83fd560
xlstm-german-wikipedia
1 contributor
History:
28 commits
stefan-it
readme: fix revision of forked Helibrunna repo
83fd560
verified
3 months ago
.gitattributes
1.52 kB
initial commit
6 months ago
README.md
3.75 kB
readme: fix revision of forked Helibrunna repo
3 months ago
brat-logo.png
57.8 kB
figure: add some new logo :p
3 months ago
config.json
639 Bytes
config: fix it
3 months ago
configuration_xlstm.py
3.08 kB
xlstm-config: temporarily introduce new hidden_size parameter
3 months ago
generation_config.json
69 Bytes
model: add generation confgi
3 months ago
model.safetensors
445 MB
LFS
model: add newly trained xLSTM model (with grad clipping)
3 months ago
modeling_xlstm.py
6.58 kB
xlstm: add configuration and modeling (own one)
3 months ago
special_tokens_map.json
551 Bytes
tokenizer: add config and vocab
3 months ago
tokenizer.json
1.84 MB
tokenizer: add config and vocab
3 months ago
tokenizer_config.json
957 Bytes
tokenizer: add config and vocab
3 months ago
training-loss.png
201 kB
figure: add updated loss curve for training
3 months ago