Matthias Seeger
mseeger
·
AI & ML interests
None yet
Recent Activity
commented on
an
article
20 days ago
Everything About Long Context Fine-tuning
commented on
an
article
about 1 month ago
Open-R1: a fully open reproduction of DeepSeek-R1
commented on
an
article
about 1 month ago
Open-R1: a fully open reproduction of DeepSeek-R1
Organizations
None yet
mseeger's activity
Exact computations for multi-head latent attention
1
#9 opened about 1 month ago
by
mseeger
hidden_size % num_attention_heads != 0
#2 opened 3 months ago
by
mseeger