-
LLM2Vec: Large Language Models Are Secretly Powerful Text Encoders
Paper • 2404.05961 • Published • 64 -
Leave No Context Behind: Efficient Infinite Context Transformers with Infini-attention
Paper • 2404.07143 • Published • 104 -
Scaling (Down) CLIP: A Comprehensive Analysis of Data, Architecture, and Training Strategies
Paper • 2404.08197 • Published • 27 -
Pre-training Small Base LMs with Fewer Tokens
Paper • 2404.08634 • Published • 34
vitalyr
vitalyr
AI & ML interests
None yet
Recent Activity
liked
a model
4 days ago
henryen/OriGen
liked
a Space
about 1 month ago
PR-Puppets/PR-Puppet-Sora
Organizations
None yet
Collections
2
models
None public yet
datasets
None public yet