Lavanya KV
lkv
·
AI & ML interests
None yet
Organizations
lkv's activity
Very high loss compared to keras
6
#46 opened 4 months ago
by
tanimazsin130
Shutting down servers during fine-tuning
2
#73 opened 4 months ago
by
yjok0220
What is the max sequence length that model can compute if I use flash attention?
1
#20 opened 2 months ago
by
halfmoon039