Commit History

model: add re-trained xLSTM model with grouped corpus for pretraining
9035a6b
verified

stefan-it commited on

readme: fix markdown
01d2cb6
verified

stefan-it commited on

readme: mention potential bug in pretraining (truncated Wikipedia articles are used)
ba44f7a
verified

stefan-it commited on

config: add mapping for AutoModelForSequenceClassification to own xLSTMForSequenceClassification
9d21367

stefan-it commited on

modeling: sync xLSTMForSequenceClassification with Patrick's codebase from https://github.com/HallerPatrick/helibrunna/blob/a1b377271867d5f23201ccacb55e017749aba487/model/modeling_xlstm.py
7b7eb08

stefan-it commited on

readme: fix revision of forked Helibrunna repo
83fd560
verified

stefan-it commited on

xlstm-config: temporarily introduce new hidden_size parameter
dbe6e99
verified

stefan-it commited on

readme: include some new logo :-)
5980dab
verified

stefan-it commited on

figure: add some new logo :p
2ebd9c5
verified

stefan-it commited on

readme: update information about final xLSTM model (one epoch over corpus)
48b8ed3
verified

stefan-it commited on

figure: add updated loss curve for training
aed4ef8
verified

stefan-it commited on

model: add newly trained xLSTM model (with grad clipping)
38db0e5
verified

stefan-it commited on

readme: cleanup configuration example
378829e

stefan-it commited on

readme: mention currently missing grad norm
c5e03ab

stefan-it commited on

readme: mention Tristan
d25058a

stefan-it commited on

readme: mention Tristan
c08abca

stefan-it commited on

readme: add more training details
38341af

stefan-it commited on

readme: add example usage
7acb992

stefan-it commited on

config: fix it
bdcccf0

stefan-it commited on

xlstm: add configuration and modeling (own one)
caecb8c

stefan-it commited on

readme: mention uploaded checkpoint
49473bf

stefan-it commited on

tokenizer: add config and vocab
92d0b20

stefan-it commited on

model: add generation confgi
1e51915

stefan-it commited on

model: add config and trained xLSTM model
c2b9dd2

stefan-it commited on

figure: add training loss overview
6ce722e

stefan-it commited on

readme: update
87b1360

stefan-it commited on

readme: finalize training section
37162e4
verified

stefan-it commited on

metrics: add final logs after 2 epochs
2164244

stefan-it commited on

model: add training log
962439a

stefan-it commited on

model: add best model
b8dd5fd

stefan-it commited on

readme: minor tweaks
70454bb
verified

stefan-it commited on

readme: add initial version
609e6b7
verified

stefan-it commited on

initial commit
2228461
verified

stefan-it commited on