Jens Roland

FreeHugsForRobots
ยท

AI & ML interests

None yet

Recent Activity

Organizations

None yet

FreeHugsForRobots's activity

New activity in mistralai/Mamba-Codestral-7B-v0.1 2 months ago
upvoted an article 5 months ago
view article
Article

Our Transformers Code Agent beats the GAIA benchmark!

โ€ข 47
replied to DmitryRyumin's post 7 months ago
reacted to DmitryRyumin's post with ๐Ÿ”ฅ 7 months ago
view post
Post
2178
๐Ÿ”ฅ๐Ÿš€๐ŸŒŸ New Research Alert - xLSTM! ๐ŸŒŸ๐Ÿš€๐Ÿ”ฅ
๐Ÿ“„ Title: xLSTM: Extended Long Short-Term Memory ๐Ÿ”

๐Ÿ“ Description: xLSTM is a scaled-up LSTM architecture with exponential gating and modified memory structures to mitigate known limitations. xLSTM blocks outperform SOTA transformers and state-space models in performance and scaling.

๐Ÿ‘ฅ Authors: Maximilian Beck et al.

๐Ÿ“„ Paper: xLSTM: Extended Long Short-Term Memory (2405.04517)

๐Ÿ“ Repository: https://github.com/NX-AI/xlstm

๐Ÿ“š More Papers: more cutting-edge research presented at other conferences in the DmitryRyumin/NewEraAI-Papers curated by @DmitryRyumin

๐Ÿ” Keywords: #xLSTM #DeepLearning #Innovation #AI
  • 1 reply
ยท